Klasifikasi Sentimen Ulasan Ipusnas Menggunakan Metode Natural Language Processing dan Support Vector Machine
Keywords:
Analisis Sentimen, Natural Language Processing, Support Vector Machine, iPusnas, Literasi DigitalAbstract
This study investigates sentiment classification of user reviews on the iPusnas digital library application to provide an objective overview of service quality and user experience. Numerous complaints related to login failures, application crashes, and issues in accessing digital books indicate the need for a computational approach capable of processing large volumes of user feedback. The proposed method integrates Natural Language Processing (NLP) techniques with the Support Vector Machine (SVM) algorithm. The workflow consists of collecting 2,000 reviews, applying text cleaning and normalization, tokenization, stopword removal, stemming, rating-based sentiment annotation, and feature extraction using TF-IDF. The dataset was divided using a train–test split for model training and evaluation. Experimental results show that the SVM model achieves 90.1% accuracy, demonstrating strong performance in detecting negative sentiments and moderate performance for positive sentiments due to class imbalance. These findings highlight the effectiveness of NLP and SVM for extracting user perceptions and indicate the potential of this model as a decision-support tool for improving iPusnas application services. Overall, the study contributes to the advancement of digital service innovation in Indonesia.
Downloads
References
N. A. K. M. Haris, S. Mutalib, A. M. A. Malik, S. Abdul-Rahman, and S. N. K. Kamarudin, “Sentiment Classification From Reviews for Tourism Analytics,” Int. J. Adv. Intell. Informatics, vol. 9, no. 1, p. 108, 2023, doi: 10.26555/ijain.v9i1.1077.
T. Kolajo, O. Daramola, A. A. Adebiyi, and A. Seth, “A Framework for Pre-Processing of Social Media Feeds Based on Integrated Local Knowledge Base,” Inf. Process. Manag., vol. 57, no. 6, p. 102348, 2020, doi: 10.1016/j.ipm.2020.102348.
E. H. Muktafin, P. Pramono, and K. Kusrini, “Sentiments Analysis of Customer Satisfaction in Public Services Using K-Nearest Neighbors Algorithm and Natural Language Processing Approach,” Telkomnika (Telecommunication Comput. Electron. Control., vol. 19, no. 1, p. 146, 2021, doi: 10.12928/telkomnika.v19i1.17417.
A. B. P. Negara, “The Influence of Applying Stopword Removal and Smote on Indonesian Sentiment Classification,” Lontar Komput. J. Ilm. Teknol. Inf., vol. 14, no. 03, pp. 172–185, 2025, doi: 10.24843/lkjiti.2023.v14.i03.p05.
N. N. Alabid and Z. Naseer, “Summarizing Twitter Posts Regarding COVID-19 Based on N-Grams,” Indones. J. Electr. Eng. Comput. Sci., vol. 31, no. 2, p. 1008, 2023, doi: 10.11591/ijeecs.v31.i2.pp1008-1015.
I. A. Farha and W. Magdy, “A Comparative Study of Effective Approaches for Arabic Sentiment Analysis,” Inf. Process. Manag., vol. 58, no. 2, p. 102438, 2021, doi: 10.1016/j.ipm.2020.102438.
R. Guido, M. C. Groccia, and D. Conforti, “A Hyper-Parameter Tuning Approach for Cost-Sensitive Support Vector Machine Classifiers,” Soft Comput., vol. 27, no. 18, pp. 12863–12881, 2022, doi: 10.1007/s00500-022-06768-8.
M. Khairy, T. M. Mahmoud, and T. A. El‐Hafeez, “The Effect of Rebalancing Techniques on the Classification Performance in Cyberbullying Datasets,” Neural Comput. Appl., vol. 36, no. 3, pp. 1049–1065, 2023, doi: 10.1007/s00521-023-09084-w.
S. F. TAŞKIRAN, B. Türkoğlu, E. Kaya, and T. Aşuroğlu, “A Comprehensive Evaluation of Oversampling Techniques for Enhancing Text Classification Performance,” Sci. Rep., vol. 15, no. 1, 2025, doi: 10.1038/s41598-025-05791-7.
W. Rhmann, “An Empirical Study on the Class Imbalance Handling Techniques for Different Diseases,” Soft Comput., vol. 28, no. 19, pp. 11439–11456, 2024, doi: 10.1007/s00500-024-09881-y.
S. Riyanto, I. S. Sitanggang, T. Djatna, and T. D. Atikah, “Comparative Analysis Using Various Performance Metrics in Imbalanced Data for Multi-Class Text Classification,” Int. J. Adv. Comput. Sci. Appl., vol. 14, no. 6, 2023, doi: 10.14569/ijacsa.2023.01406116.
M. Reusens et al., “Evaluating Text Classification: A Benchmark Study,” Expert Syst. Appl., vol. 254, p. 124302, 2024, doi: 10.1016/j.eswa.2024.124302.
E. Elyan, C. F. Moreno‐García, and C. Jayne, “CDSMOTE: Class Decomposition and Synthetic Minority Class Oversampling Technique for Imbalanced-Data Classification,” Neural Comput. Appl., vol. 33, no. 7, pp. 2839–2851, 2020, doi: 10.1007/s00521-020-05130-z.
W. Saidi, A. E. Abderrahmani, and K. Satori, “Effective Comparative Evaluation of Sentiment Analysis Using Paired T-Test: A Performance Study of Supervised Methods,” J. Southwest Jiaotong Univ., vol. 58, no. 5, 2023, doi: 10.35741/issn.0258-2724.58.5.28.
P. Vuttipittayamongkol, E. Elyan, and A. Petrovski, “On the Class Overlap Problem in Imbalanced Data Classification,” Knowledge-Based Syst., vol. 212, p. 106631, 2021, doi: 10.1016/j.knosys.2020.106631.
Downloads
Published
Issue
Section
License
Copyright (c) 2026 Akmal Mustafa, Rudi Kurniawan, Bani Nurhakim, Puji Pramudya Marta, Khaerul Anam

This work is licensed under a Creative Commons Attribution 4.0 International License.
Universitas Harapan Medan






.png)


.png)


.png)

.png)


.png)



