A New Approach for Computing Semantic Relatedness with Wikipedia

2013 
Semantic relatedness measures are used in many applications in natural language processing and we propose a Wikipedia-based method to compute it. Unlike existed methods that only focus on a small section of Wikipedia (e.g. info box or hyperlinks), our method makes full use of the rich information contained in the Wikipedia page and could get a higher accuracy within reasonable time. In our method, we first use some special sections (e.g. synonyms and hyponyms) in the Wikipedia page to judge whether two concepts are closely related. If they are not, we then use pattern matching to find whether they are related through usual relatedness (e.g. ―a part of‖, ―result in‖, and ―is a member of ‖). And if the relatedness score hasn't been computed out through former steps, we then use a method which makes some improvement on the famous explicit semantic analysis method to compute the relatedness.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    8
    References
    0
    Citations
    NaN
    KQI
    []