Lemmatised Wordlist of 1 m. Corpus of Contemporary Lithuanian

2016 
The lemmatised wordlist of 1 m. word Lithuanian corpus. The structure of the tab delimited text file (dazninis.txt): Headword Part of Speech Wordform Frequency of Occurrence. The data is the basis for "Frequency Dictionary of Written Lithuanian - based on 1m word morphologically annotated corpus" (A_Utka-Dazninis_zodynas.pdf). Reference: Utka. A. 2009. Dažninis rasytinis lietuvių kalbos žodynas: 1 milijono žodžių morfologiskai anotuoto tekstyno pagrindu. Kaunas: VDU leidykla, ISBN 978-9955-12-546-4
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    0
    References
    0
    Citations
    NaN
    KQI
    []