An Empirical Study of End-To-End Simultaneous Speech Translation Decoding Strategies

Ha Nguyen,Yannick Estève,Laurent Besacier

An Empirical Study of End-To-End Simultaneous Speech Translation Decoding Strategies

2021

Ha Nguyen
Yannick Estève
Laurent Besacier

This paper proposes a decoding strategy for end-to-end simultaneous speech translation. We leverage end-to-end models trained in offline mode and conduct an empirical study for two language pairs (English-to-German and English-to-Portuguese). We also investigate different output token granularities including characters and Byte Pair Encoding (BPE) units. The results show that the proposed decoding approach allows to control BLEU/Average Lagging trade-off along different latency regimes. Our best decoding settings achieve comparable results with a strong cascade model evaluated on the simultaneous translation track of IWSLT 2020 shared task.

Keywords:

End-to-end principle
Leverage (statistics)
Speech recognition
Decoding methods
Translation (geometry)
Speech translation
Security token
task
Computer science
Byte pair encoding

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations