How to Measure Word Length in Spoken and Written Chinese

2016 
AbstractChoosing an appropriate measurement unit of word length is a key prerequisite for word length distribution studies, since the measurement unit varies with different types of language or text. Taking Chinese as an example, this study explores the word length distributions of Chinese spoken and written language based on a data source consisting of 20 dialogue texts (spoken language) and 20 prose texts (written language), in which the lengths of words are variously determined in terms of pinyin letter, phoneme, syllable for spoken Chinese and stroke, component, character for written Chinese respectively. With the aim of selecting the most appropriate word length measurement unit, empirical word length distribution models, synergetic linguistic theories and Menzerath’s law are used in this study. Results show that the syllable is the most appropriate measurement unit for spoken Chinese, and the component the most appropriate measurement unit for written Chinese. Chinese word length distributions can b...
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    19
    References
    10
    Citations
    NaN
    KQI
    []