A Clustering Chunking Method Based on Manifold Geodesic Distance

2013 
Regarding the Chinese chunker analysis as a procedure of inner-sentence word clustering and chunker type labeling,a grammar function space is constructed at first,and then embedded in a lower dimension space by applying ISOMAP to observe the distribution feature of Chinese word in the embedding space.In the hierarchical clustering algorithm which is aiming at partitioning word into different clusters,the manifold geodesic distance is employed instead of Euclidean distance to measure the similarity between words.The algorithm facilitates the increment of Chinese chunker analysis performance under the condition of appropriate algorithm complexity.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    0
    References
    0
    Citations
    NaN
    KQI
    []