Klasifikasi Sentimen Ulasan Ipusnas Menggunakan Metode Natural Language Processing dan Support Vector Machine

Authors

  • Akmal Mustafa -
  • Rudi Kurniawan STMIK IKMI Cirebon
  • Bani Nurhakim STMIK IKMI Cirebon
  • Puji Pramudya Marta STMIK IKMI Cirebon
  • Khaerul Anam STMIK IKMI Cirebon

Keywords:

Analisis Sentimen, Natural Language Processing, Support Vector Machine, iPusnas, Literasi Digital

Abstract

This study investigates sentiment classification of user reviews on the iPusnas digital library application to provide an objective overview of service quality and user experience. Numerous complaints related to login failures, application crashes, and issues in accessing digital books indicate the need for a computational approach capable of processing large volumes of user feedback. The proposed method integrates Natural Language Processing (NLP) techniques with the Support Vector Machine (SVM) algorithm. The workflow consists of collecting 2,000 reviews, applying text cleaning and normalization, tokenization, stopword removal, stemming, rating-based sentiment annotation, and feature extraction using TF-IDF. The dataset was divided using a train–test split for model training and evaluation. Experimental results show that the SVM model achieves 90.1% accuracy, demonstrating strong performance in detecting negative sentiments and moderate performance for positive sentiments due to class imbalance. These  findings highlight the effectiveness of NLP and SVM for extracting user perceptions and indicate the potential of this model as a decision-support tool for improving iPusnas application services. Overall, the study contributes to the advancement of digital service innovation in Indonesia.

Downloads

Download data is not yet available.

References

N. A. K. M. Haris, S. Mutalib, A. M. A. Malik, S. Abdul-Rahman, and S. N. K. Kamarudin, “Sentiment Classification From Reviews for Tourism Analytics,” Int. J. Adv. Intell. Informatics, vol. 9, no. 1, p. 108, 2023, doi: 10.26555/ijain.v9i1.1077.

T. Kolajo, O. Daramola, A. A. Adebiyi, and A. Seth, “A Framework for Pre-Processing of Social Media Feeds Based on Integrated Local Knowledge Base,” Inf. Process. Manag., vol. 57, no. 6, p. 102348, 2020, doi: 10.1016/j.ipm.2020.102348.

E. H. Muktafin, P. Pramono, and K. Kusrini, “Sentiments Analysis of Customer Satisfaction in Public Services Using K-Nearest Neighbors Algorithm and Natural Language Processing Approach,” Telkomnika (Telecommunication Comput. Electron. Control., vol. 19, no. 1, p. 146, 2021, doi: 10.12928/telkomnika.v19i1.17417.

A. B. P. Negara, “The Influence of Applying Stopword Removal and Smote on Indonesian Sentiment Classification,” Lontar Komput. J. Ilm. Teknol. Inf., vol. 14, no. 03, pp. 172–185, 2025, doi: 10.24843/lkjiti.2023.v14.i03.p05.

N. N. Alabid and Z. Naseer, “Summarizing Twitter Posts Regarding COVID-19 Based on N-Grams,” Indones. J. Electr. Eng. Comput. Sci., vol. 31, no. 2, p. 1008, 2023, doi: 10.11591/ijeecs.v31.i2.pp1008-1015.

I. A. Farha and W. Magdy, “A Comparative Study of Effective Approaches for Arabic Sentiment Analysis,” Inf. Process. Manag., vol. 58, no. 2, p. 102438, 2021, doi: 10.1016/j.ipm.2020.102438.

R. Guido, M. C. Groccia, and D. Conforti, “A Hyper-Parameter Tuning Approach for Cost-Sensitive Support Vector Machine Classifiers,” Soft Comput., vol. 27, no. 18, pp. 12863–12881, 2022, doi: 10.1007/s00500-022-06768-8.

M. Khairy, T. M. Mahmoud, and T. A. El‐Hafeez, “The Effect of Rebalancing Techniques on the Classification Performance in Cyberbullying Datasets,” Neural Comput. Appl., vol. 36, no. 3, pp. 1049–1065, 2023, doi: 10.1007/s00521-023-09084-w.

S. F. TAŞKIRAN, B. Türkoğlu, E. Kaya, and T. Aşuroğlu, “A Comprehensive Evaluation of Oversampling Techniques for Enhancing Text Classification Performance,” Sci. Rep., vol. 15, no. 1, 2025, doi: 10.1038/s41598-025-05791-7.

W. Rhmann, “An Empirical Study on the Class Imbalance Handling Techniques for Different Diseases,” Soft Comput., vol. 28, no. 19, pp. 11439–11456, 2024, doi: 10.1007/s00500-024-09881-y.

S. Riyanto, I. S. Sitanggang, T. Djatna, and T. D. Atikah, “Comparative Analysis Using Various Performance Metrics in Imbalanced Data for Multi-Class Text Classification,” Int. J. Adv. Comput. Sci. Appl., vol. 14, no. 6, 2023, doi: 10.14569/ijacsa.2023.01406116.

M. Reusens et al., “Evaluating Text Classification: A Benchmark Study,” Expert Syst. Appl., vol. 254, p. 124302, 2024, doi: 10.1016/j.eswa.2024.124302.

E. Elyan, C. F. Moreno‐García, and C. Jayne, “CDSMOTE: Class Decomposition and Synthetic Minority Class Oversampling Technique for Imbalanced-Data Classification,” Neural Comput. Appl., vol. 33, no. 7, pp. 2839–2851, 2020, doi: 10.1007/s00521-020-05130-z.

W. Saidi, A. E. Abderrahmani, and K. Satori, “Effective Comparative Evaluation of Sentiment Analysis Using Paired T-Test: A Performance Study of Supervised Methods,” J. Southwest Jiaotong Univ., vol. 58, no. 5, 2023, doi: 10.35741/issn.0258-2724.58.5.28.

P. Vuttipittayamongkol, E. Elyan, and A. Petrovski, “On the Class Overlap Problem in Imbalanced Data Classification,” Knowledge-Based Syst., vol. 212, p. 106631, 2021, doi: 10.1016/j.knosys.2020.106631.

Downloads

Published

2026-05-06

Issue

Section

Articles