Gender Bias in Multilingual Neural Machine Translation: The Architecture Matters

Marta R. Costa-jussà,Carlos Escolano,Christine Basta,Javier Ferrando,Roser Batlle,Ksenia Kharitonova

Gender Bias in Multilingual Neural Machine Translation: The Architecture Matters

2020

Marta R. Costa-jussà
Carlos Escolano
Christine Basta
Javier Ferrando
Roser Batlle
Ksenia Kharitonova

Multilingual Neural Machine Translation architectures mainly differ in the amount of sharing modules and parameters among languages. In this paper, and from an algorithmic perspective, we explore if the chosen architecture, when trained with the same data, influences the gender bias accuracy. Experiments in four language pairs show that Language-Specific encoders-decoders exhibit less bias than the Shared encoder-decoder architecture. Further interpretability analysis of source embeddings and the attention shows that, in the Language-Specific case, the embeddings encode more gender information, and its attention is more diverted. Both behaviors help in mitigating gender bias.

Keywords:

Machine translation
gender bias
Natural language processing
ENCODE
Artificial intelligence
Computer science
Interpretability
Architecture

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations