Classification of Indonesian University Entrance Tweets Using Machine Learning

Authors

  • Angga Zakariya Universitas Negeri Surabaya https://orcid.org/0009-0000-1909-130X
  • Fahmi Hasan Firdaus Universitas Negeri Surabaya
  • Muhammad Fahril Syahputra Universitas Negeri Surabaya
  • Muchammad Abdulloh 'Ubaid
  • Wiyli Yustanti
  • Cendra Devayana Putra National Cheng Kung University
  • Monica Cinthya Universitas Negeri Surabaya

DOI:

https://doi.org/10.26740/jistel.v2n1.p64-76

Keywords:

Data Mining, Scrapping, Text Classification, Twitter, Machine Learning, Neural Network

Abstract

Entrance selection for State Universities (PTN) in Indonesia, specifically SNBP, SNBT, and Independent Selection (Mandiri), is a trending topic on social media, particularly Twitter. However, the high volume of tweets creates noise, making it difficult for prospective students to find relevant information. This study aims to classify tweets into three categories (SNBP, SNBT, Mandiri) and compare the performance of four machine learning models: Naïve Bayes, Support Vector Machine (SVM), Artificial Neural Network (ANN), and Long Short-Term Memory (LSTM). The data consists of 12,442 tweets collected using specific keywords. The methodology involves preprocessing (cleaning, normalization, stemming), feature extraction using TF-IDF, and model evaluation. The results show that SVM achieved the best performance with an accuracy of 89.41% and an F1-Score of 89.44%, outperforming the deep learning models (ANN and LSTM) for this specific dataset. These findings indicate that traditional machine learning models can be more effective for text classification with moderate dataset sizes compared to complex deep learning architectures.

Downloads

Published

2026-05-25

How to Cite

Angga Zakariya, Fahmi Hasan Firdaus, Muhammad Fahril Syahputra, Muchammad Abdulloh ’Ubaid, Wiyli Yustanti, Cendra Devayana Putra, & Monica Cinthya. (2026). Classification of Indonesian University Entrance Tweets Using Machine Learning. Journal of Intelligent System and Telecommunication, 2(1), 64–76. https://doi.org/10.26740/jistel.v2n1.p64-76
Abstract views: 1 , PDF Downloads: 7