Conception d'un jeu de ressources libres pour le TAL arabe sous Unitex

2013 
This paper aims to describe the process of building a free Arabic package for the Unitex framework: we proposed a test corpus, we chose a tag set suited to this task and we build dictionaries respecting the LADL DELA format. We describe each of the above particularly the building of dictionaries, for which we designed algorithms for automatic generation of verb and noun inflection graphs. We use the word-based inflection foundations and we define for each lexeme a set of themes. For the verbs, we use five themes given by the user and the graphs generate up to 264 inflected verbal forms; for the nouns and adjectives we use one or at most two themes and the produced graphs generate 63 inflected forms.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    0
    References
    5
    Citations
    NaN
    KQI
    []