Layer-Wise Coordination between Encoder and Decoder for Neural Machine Translation

Tianyu He,Xu Tan,Yingce Xia,Di He,Tao Qin,Zhibo Chen,Tie-Yan Liu

Layer-Wise Coordination between Encoder and Decoder for Neural Machine Translation

2018

Tianyu He
Xu Tan
Yingce Xia
Di He
Tao Qin
Zhibo Chen
Tie-Yan Liu

Neural Machine Translation (NMT) has achieved remarkable progress with the quick evolvement of model structures. In this paper, we propose the concept of layer-wise coordination for NMT, which explicitly coordinates the learning of hidden representations of the encoder and decoder together layer by layer, gradually from low level to high level. Furthermore, we share the parameters of each layer between the encoder and decoder to regularize and coordinate the learning. Experiments show that combined with the state-of-the-art Transformer model, layer-wise coordination achieves improvements on three IWSLT and two WMT translation tasks. More specifically, our method achieves 34.43 and 29.01 BLEU score on WMT16 English-Romanian and WMT14 English-German tasks, outperforming the Transformer baseline.

Keywords:

BLEU
Machine translation
Machine learning
Artificial intelligence
Computer science
Encoder
layer wise
Transformer
Algorithm

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

101

Citations