Unraveling the Contribution of Image Captioning and Neural Machine Translation for Multimodal Machine Translation

Chiraag Lala,Pranava Swaroop Madhyastha,Josiah Wang,Lucia Specia

Unraveling the Contribution of Image Captioning and Neural Machine Translation for Multimodal Machine Translation

2017

Chiraag Lala
Pranava Swaroop Madhyastha
Josiah Wang
Lucia Specia

Recent work on multimodal machine translation has attempted to address the problem of producing target language image descriptions based on both the source language description and the corresponding image. However, existing work has not been conclusive on the contribution of visual information. This paper presents an in-depth study of the problem by examining the differences and complementarities of two related but distinct approaches to this task: textonly neural machine translation and image captioning. We analyse the scope for improvement and the effect of different data and settings to build models for these tasks. We also propose ways of combining these two approaches for improved translation quality.

Keywords:

Machine translation
Artificial intelligence
Natural language processing
Automatic image annotation
Computer science
Example-based machine translation
Transfer-based machine translation
Closed captioning
Rule-based machine translation
Speech recognition

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations