Computing Russian Morphological distribution patterns using RusAC Online Server

2020 
The article presents findings of distribution patterns of Russian grammatical categories computed with the help of MyStem.3 tagger and a proprietary Russian language processor, ETAP-3. The corpus of over 1.1 mln tokens compiled for the study comprises two types of academic textbooks used in Russian schools: Science and Humanities. We computed descriptive metrics of each textbooks with the help of the text analyzer RusAC (http://tykau.pythonanywhere.com/) and pursued the contrastive analysis of Science and Humanities textbook features. Significant differences of two types of the texts were found in distribution patterns of noun cases and verbs tenses, while morphological categories of nouns, adjectives, verbs, and adverbs demonstrate similarities. The specifics of grammatical patterns defined for classroom textbooks can be used in further studies on distribution of morphological patterns and text complexity of Russian academic texts.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    9
    References
    1
    Citations
    NaN
    KQI
    []