Document Clustering with Cluster Refinement and Non-negative Matrix Factorization
2009
Document clustering is an important method for document analysis and is used in many different information retrieval applications. This paper proposes a new document clustering method using the clustering method based NMF (Non-negative Matrix Factorization) and refinement of documents in clusters by using coherence of cluster. The proposed method can improve the quality of document clustering because the re-assigned documents in cluster by using coherence of cluster based similarity between documents, the semantic feature matrix and the semantic variable matrix, which is used in document clustering, can represent an inherent structure of document set better. The experimental results demonstrate that the proposed method achieves better performance than other document clustering methods.
Keywords:
- Machine learning
- Correlation clustering
- Fuzzy clustering
- Clustering high-dimensional data
- Artificial intelligence
- Pattern recognition
- Cluster analysis
- FLAME clustering
- Document clustering
- Canopy clustering algorithm
- Computer science
- CURE data clustering algorithm
- Biclustering
- Hierarchical clustering
- Document-term matrix
- Data mining
- Correction
- Source
- Cite
- Save
- Machine Reading By IdeaReader
17
References
15
Citations
NaN
KQI