Some improvements in phrase-based statistical machine translation

Zhendong Yang,Wei Pang,Jinhua Du,Wei Wei,Bo Xu

Some improvements in phrase-based statistical machine translation

2006

Zhendong Yang
Wei Pang
Jinhua Du
Wei Wei
Bo Xu

In statistical machine translation, many of the top-performing systems are phrase-based systems. This paper describes a phrase-based translation system and some improvements. We use more information to compute translation probability. The scaling factors of the log-linear models are estimated by the minimum error rate training that uses an evaluation criteria to balance BLEU and NIST scores. We extract phrase-template from initial phrases to deal with data sparseness and distortion problem through decoding. By re-ranking the n-best list of translations generated firstly, the system gets the final output. Some experiments concerned show that all these refinements are beneficial to get better results.

Keywords:

BLEU
Machine translation
Artificial intelligence
Speech recognition
NIST
Decoding methods
Pattern recognition
Phrase
Computer science
Word error rate
Rule-based machine translation
Evaluation of machine translation
Natural language
Natural language processing

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations