Text Emotion Classification Research Based on Improved Latent Semantic Analysis Algorithm

2013 
The emotion classification of text is an important research direction of text mining. Application on emotion text classification, latent semantic analysis algorithm has advantage of small occupied space, applicable to a large scale of text classifications. Compared with the traditional vector space model, latent semantic analysis algorithms reduce the search space for text classification by means of singular value decomposition for term and document matrix. Moreover, latent semantic analysis algorithms solve the problem of words with multiple meanings by analyzing the term at the semantic level. Using an improved latent semantic analysis algorithm to classify the test set by their emotion. The new cluster centroid is the average vector for each emotion category, and access to emotions classification for training dataset by calculating similarity of the average vector and test textual. The experimental results show that the improved latent semantic analysis algorithm have high precision and recall rate as same as the original algorithm, the efficiency of text emotion classification improved 4 percentage points. Keywords-Latent Semantic Analysis; Vector Space Model; Text Emotion Classification;
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    5
    References
    17
    Citations
    NaN
    KQI
    []