Indonesian text feature extraction using gibbs sampling and mean variational inference latent dirichlet allocation

2017 
Latent Dirichlet Allocation has been developed as topic-based method which uses reasoning to determine the topics of a document. There are many methods of reasoning used for Latent Dirichlet Allocation, including the Gibbs Sampling and Mean Variational Inference, the most widely used in research. However, there have not been many studies that discuss the implementation of these methods on the Indonesian text, so analysis is needed to compare its performance in generating feature extraction. Therefore, in this paper, will be implemented the method of reasoning Gibbs Sampling and Mean Variational Inference for Latent Dirichlet Allocation on Indonesian text. The objective is determining the performance of both algorithms on Indonesian text so it can provide a reference about the better reasoning method for Latent Dirichlet Allocation on Indonesian text. The research was implemented on digital Indonesia news text data with 100 documents. The tests are conducted on feature data as the result of extraction process using three type of evaluation metric. The test results show that Gibbs Sampling has a better performance than Mean Variational Inference for Latent Dirichlet Allocation on Indonesian text.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    17
    References
    3
    Citations
    NaN
    KQI
    []