Concept Based Text Classification Using Labeled and Unlabeled Data

2006 
Recent work has shown improvements in text clustering and classification by integrating conceptual features extracted from background knowledge. In this paper we address the problem of text classification with labeled data and unlabeled data. We propose a Latent Bayes Ensemble model based on word-concept mapping and transductive boosting method. With the knowledge extracted from ontologies, we hope to improve the classification accuracy even with large amounts of unlabeled documents. We conducted several experiments on two well-known corpora and the results are compared with Naive Bayes and TSVM classifiers.
    • Correction
    • Cite
    • Save
    • Machine Reading By IdeaReader
    0
    References
    0
    Citations
    NaN
    KQI
    []