Hybrid POS Classification Using Long Short Term Memory

2018 
A Part of Speech classifier is an important tool that is used to develop many NLP tasks. In this paper we described Deep Neural Network based architecture, that given a sentence, outputs a part-of-speech tag sequence. For Part of Speech classification, we trained a LSTM network by semi-supervised learning. We further applied a probabilistic approach to improve Part of Speech prediction of the LSTM network. Word2vec deep learning tool is used for learning high-quality distributed vector representations of words that capture a large number of precise word relationships. By using a small amount of labeled dataset of words and applying cosine similarity we assign Part of Speech tags to words. If some words don't show nearness to any word in labeled dataset, then we apply probabilistic approach for classification of unclassified words of input sentence. This hybrid neural Part of Speech classifier is evaluated over a corpus of 1000 sentences having a total of 7,400 words; part of speech classification system achieved the accuracy of 94.71 %.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    4
    References
    0
    Citations
    NaN
    KQI
    []