Research on Short Text Classification Method Based on Convolution Neural Network

2018 
Short text classification is one of the hotspots of research in Natural Language Processing. a new model of text representation is proposed in this paper (N-of-DOC), and in order to solve the problem of sparse representation in Chinese, the word2vec distributed representation is used, finally, it is applied to the improved convolution neural network model (CNN) to extract the high level features from the filter layer, the classification model is obtained by connecting the softmax classifier after the pooling layer. In the experiment, the traditional text representation model and the improved text representation model are used as the input of the original data, respectively. It acts on the model of traditional machine learning (KNN, SVM, logistic regression, naive Bayes) and the improved convolution neural network model. The results show that the proposed method can not only solve the dimension disaster and sparse problem of Chinese text vectors, but also improve the classification accuracy by 10.23% compared with traditional methods.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    1
    References
    0
    Citations
    NaN
    KQI
    []