A Comparison of Traditional Machine Learning Approaches for Supervised Feedback Classification in Bahasa Indonesia

Rusli, Andre and Suryadibrata, Alethea and Nusantara, Samiaji Bintang and Young, Julio Cristian (2020) A Comparison of Traditional Machine Learning Approaches for Supervised Feedback Classification in Bahasa Indonesia. IJNMT (International Journal of New Media Technology), 7 (1). ISSN 2355-0082

Full text not available from this repository.


The advancement of machine learning and natural language processing techniques hold essential opportunities to improve the existing software engineering activities, including the requirements engineering activity. Instead of manually reading all submitted user feedback to understand the evolving requirements of their product, developers could use the help of an automatic text classification program to reduce the required effort. Many supervised machine learning approaches have already been used in many fields of text classification and show promising results in terms of performance. This paper aims to implement NLP techniques for the basic text preprocessing, which then are followed by traditional (non-deep learning) machine learning classification algorithms, which are the Logistics Regression, Decision Tree, Multinomial Naïve Bayes, K-Nearest Neighbors, Linear SVC, and Random Forest classifier. Finally, the performance of each algorithm to classify the feedback in our dataset into several categories is evaluated using three F1 Score metrics, the macro-, micro-, and weighted-average F1 Score. Results show that generally, Logistics Regression is the most suitable classifier in most cases, followed by Linear SVC. However, the performance gap is not large, and with different configurations and requirements, other classifiers could perform equally or even better.

Item Type: Article
Subjects: 000 Computer Science, Information and General Works > 000 Computer Science, Knowledge and Systems > 006 Special Computer Methods (3D Graphics, Digital Video, Data Mining, Augmented Reality)
400 Language > 490 Other language
Divisions: Fakultas Teknik Informatika > Program Studi Informatika
Depositing User: mr admin umn
Date Deposited: 05 Oct 2021 10:23
Last Modified: 05 Oct 2021 10:23
URI: http://kc.umn.ac.id/id/eprint/18541

Actions (login required)

View Item View Item