Topic Modelling and Hotel Rating Prediction based on Customer Review in Indonesia

2021 
The growth of the tourism sector and the use of hotel online booking platforms lead to the creation of textual data sources in the form of customer review. Motivation of this study is to add value to the customer review, using more than 50,000 samples taken from 510 hotels across Indonesia. First added value is understanding most talked topics by hotel customers. Using topic model latent Dirichlet allocation (LDA), this study revealed that services, price/food, facility, comfort and location are the most talked topics. Secondly, numerical hotel rating is derived from textual data using ridge regression. In addition, regression coefficient indicates the sentiment of each word in the customer review. Finally, the output of this study is expected to be useful for customers in assessing hotel service quality and in making booking decisions, and for hotel operators to get additional input during management decision making.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    0
    References
    0
    Citations
    NaN
    KQI
    []