Research on text feature clustering based on improved parallel genetic algorithm

2018 
Synonyms, synonyms and the strong association of semantic information increase the dimension text feature vectors, and greatly affect the efficiency and accuracy of text classification. In order to reduce the dimension of the text feature vectors, this paper presents an improved parallel genetic algorithm to sovle the text feature clustering problem. Firstly, the K-means algorithm is used to cluster the feature words. The parallel genetic algorithm is used to fine-grained the feature words. In the process of applying genetic algorithms, crossover operator is improved so that the algorithm has a global search ability and local search capability, and reduce the dependence on the initial cluster centers. Finally, the different types of feature words are analyzed and compressed, and a set of feature words of the text category and the semantic information is formed. The experimental results verify our method of text feature extraction is better.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    0
    References
    1
    Citations
    NaN
    KQI
    []