Chinese Language Processing with Complex Network Theory

2008 
We defined two kinds of Chinese words network (CWN) in this paper. The nodes in the network are composed by the Chinese characters, phrases or classical idioms from authority dictionaries. Studying the network characters and giving a new evolution model to simulate the CWN, we found that the phrase construction follows random and preferential choosing method and we found 55 Chinese characters used by ten thousand distinct phrases. Finally, the Chinese words network exhibiting Small-world character will make it easy to search information as fast as English language.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    12
    References
    3
    Citations
    NaN
    KQI
    []