Translation Crowdsourcing: Creating a Multilingual Corpus of Online Educational Content.
2018
The present work describes a multilingual corpus of online content in the educational domain, i.e. Massive Open Online Course
material, ranging from course forum text to subtitles of online video lectures, that has been developed via large-scale crowdsourcing.
The English source text is manually translated into 11 European and BRIC languages using the CrowdFlower platform. During the
process several challenges arose which mainly involved the in-domain text genre, the large text volume, the idiosyncrasies of each
target language, the limitations of the crowdsourcing platform, as well as the quality assurance and workflow issues of the
crowdsourcing process. The corpus constitutes a product of the EU-funded TraMOOC project and is utilised in the project in order to
train, tune and test machine translation engines.
Keywords:
- Correction
- Source
- Cite
- Save
- Machine Reading By IdeaReader
10
References
6
Citations
NaN
KQI