A Document Ranking Method by Document Clustering Using Bayesian SoM and Botstrap

Jun Hyeok Choe,Seong-Hae Jeon,Jeong Hyeon Lee

A Document Ranking Method by Document Clustering Using Bayesian SoM and Botstrap

2000

The conventional Boolean retrieval systems based on vector spae model can provide the results of retrieval fast, they can't reflect exactly user's retrieval purpose including semantic information. Consequently, the results of retrieval process are very different from those users expected. This fact forces users to waste much time for finding expected documents among retrieved documents. In his paper, we designed a bayesian SOM(Self-Organizing feature Maps) in combination with bayesian statistical method and Kohonen network as a kind of unsupervised learning, then perform classifying documents depending on the semantic similarity to user query in real time. If it is difficult to observe statistical characteristics as there are less than 30 documents for clustering, the number of documents must be increased to at least 50. Also, to give high rank to the documents which is most similar to user query semantically among generalized classifications for generalized clusters, we find the similarity by means of Kohonen centroid of each document classification and adjust the secondary rank depending on the similarity.

Keywords:

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations