Challenges in Baseline Detection of Arabic Script Based Languages

Saeeda Naz,Muhammad Imran Razzak,Khizar Hayat,Muhammad Waqas Anwar,Sahib Zar Khan

Challenges in Baseline Detection of Arabic Script Based Languages

2014

Saeeda Naz
Muhammad Imran Razzak
Khizar Hayat
Muhammad Waqas Anwar
Sahib Zar Khan

In this chapter, we present baseline detection challenges for Arabic script based languages and targeted Nastaliq and Naskh writing style. Baseline is an important step in the OCR as it directly affects the rest of the steps and increases the performance and efficiency of character segmentation and feature extraction in OCR process. Character recognition on Arabic script is relatively more difficult than Latin text due to the nature of Arabic script, which is cursive, context sensitive and different writing style. In this paper, we provide a comprehensive review of baseline detection methods for Urdu language. The aim of the chapter is to introduce the challenges during baseline detection in cursive script languages for Nastaliq and Naskh script.

Keywords:

Scripting language
Optical character recognition
Cursive
Urdu
Arabic script
Speech recognition
Feature extraction
Writing style
Computer science
Segmentation

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations