RefineCap: Concept-Aware Refinement for Image Captioning.

Yekun Chai,Shuo Jin,Junliang Xing

RefineCap: Concept-Aware Refinement for Image Captioning.

2021

Yekun Chai
Shuo Jin
Junliang Xing

Automatically translating images to texts involves image scene understanding and language modeling. In this paper, we propose a novel model, termed RefineCap, that refines the output vocabulary of the language decoder using decoder-guided visual semantics, and implicitly learns the mapping between visual tag words and images. The proposed Visual-Concept Refinement method can allow the generator to attend to semantic details in the image, thereby generating more semantically descriptive captions. Our model achieves superior performance on the MS-COCO dataset in comparison with previous visual-concept based models.

Keywords:

Language model
Image (mathematics)
Semantics
Vocabulary
Closed captioning
Natural language processing
Computer science
Artificial intelligence
generator

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations