On Compositional Generalization of Neural Machine Translation

Yafu Li,Yongjing Yin,Yulong Chen,Yue Zhang

On Compositional Generalization of Neural Machine Translation

2021

Yafu Li
Yongjing Yin
Yulong Chen
Yue Zhang

Modern neural machine translation (NMT) models have achieved competitive performance in standard benchmarks such as WMT. However, there still exist significant issues such as robustness, domain generalization, etc. In this paper, we study NMT models from the perspective of compositional generalization by building a benchmark dataset, CoGnition, consisting of 216k clean and consistent sentence pairs. We quantitatively analyze effects of various factors using compound translation error rate, then demonstrate that the NMT model fails badly on compositional generalization, although it performs remarkably well under traditional metrics.

Keywords:

Domain (software engineering)
Robustness (computer science)
Artificial intelligence
Machine translation
Sentence
Computer science
Perspective (graphical)
Generalization
Benchmark (computing)
translation error rate
Machine learning

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations