Mining online reviews in Indonesia’s priority tourist destinations using sentiment analysis and text summarization approach

Published in 2017 IEEE 8th International Conference on Awareness Science and Technology (iCAST), 2017

In this modern era, online hotel reviews have a big role considering the hotel is one of the aspects in determining the competitiveness in the tourist area, but its implementation is still rare. Regarding the government’s plan to increase tourist arrivals to Indonesia, this research utilized text mining towards online hotel reviews to find useful knowledge in building the hospitality sector as an integral part of the tourism industry. Text classification technique was used to obtain sentiment information contained in review sentences through sentiment analysis, as well as clustering technique as a part of text summarization to find representative sentences that are able to describe the entire contents of the review. The main contribution of this research is to combine two techniques in text mining that have never been done before, namely the sentiment analysis and text summarization. Experiments with hotel reviews in Labuan Bajo and Bali generated surprising outcomes, where the accuracy of classification model reaches 78% and the Davies-Bouldin Index (DBI) of clustering algorithm strikes 0.071. The output of this research is expected to describe the condition of the hotel in the tourist area with a different level of tourism development so that it can contribute to improving the quality of the hotel industry as well as supporting the tourism industry in Indonesia.

Download paper here