Discrimination between Arabic and Latin from bilingual documents

Sofiene Haboubi,Samia Snoussi Maddouri,Hamid Amiri

Discrimination between Arabic and Latin from bilingual documents

2012

Sofiene Haboubi
Samia Snoussi Maddouri
Hamid Amiri

An important task in machine learning is the electronic reading of documents. In this process, discrimination between languages is one of the first steps in the problem of automatic document text recognition. We are interested in the processing of mixed Arabic/Latin printed documents. Our method is based essentially on the extraction of words. We first extract structural features of words and then recognize the writing language. We finally present the results of our classification approach and discuss possible improvements.

Keywords:

Arabic
Linguistics
Computer science

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations