Predicting Software Sales Performance Using Support Vector Regression (SVR) and Linear Regression Algorithms A Comparative Study on Machine Learning Approaches for Sales Forecasting

Main Article Content

Muhammad Athallah Rafi
Alvin Adam Anton Suryadarma
Hazbie Alfarhizi Syahwadana
Aji Setiawan

Abstract

Software has become an essential part of everyday life, both in the workplace and in education. Various applications such as Microsoft Office and Google Workspace are widely used to enhance productivity. As public demand for digital solutions continues to rise, software distribution through global platforms such as Amazon has also seen significant growth. However, not all software products are able to achieve high sales figures due to the lack of effective strategies in understanding consumer behavior and market demands. Therefore, accurate sales prediction plays a crucial role in supporting successful software marketing strategies.  


This study aims to predict the best-selling software on Amazon by applying two algorithms: Linear Regression and Support Vector Regression (SVR). Before implementing these algorithms, several stages were conducted, including identifying the research object, preprocessing the data—where the original dataset consisting of 2,424 rows was reduced to 1,338 rows—followed by splitting the dataset into 80% training, 10% validation, and 10% testing sets. The final stage involved developing and comparing prediction models using both the Linear Regression and SVR algorithms. The results of this study are expected to contribute to determining the most suitable algorithm for predicting software sales and to serve as a reference for future research in this field

Article Details

Section
Articles
Author Biography

Aji Setiawan, Universitas Darma Persada

Dr. Aji Setiawan, S.Kom., MMSI, is a lecturer in the Department of Informatics Engineering, Faculty of Engineering, Universitas Darma Persada, Indonesia. His research interests include information systems, data analytics, and software engineering.

References

B. Moharana, B. B. Biswal, S. Dey, M. K. Rath and S. Banerjee, "Play Store App Analysis & Rating Prediction Using Classical ML Models & Artificial Neural Network," 2023 7th International Conference On Computing, Communication, Control And Automation (ICCUBEA), Pune, India, 2023, pp. 1-5, doi: 10.1109/ICCUBEA58933.2023.10391960.

Botchkarev, A. (2019). "A new typology design of performance metrics to measure errors in machine learning regression algorithms." Interdisciplinary Journal of Information, Knowledge, and Management, 14, 45-76. https://doi.org/10.28945/4184.

Cheriyan, S., et al. (2018). "Intelligent Sales Prediction Using Machine Learning Techniques." 2018 International Conference on Computing, Power and Communication Technologies (GUCON). IEEE.DOI: 10.1109/iCCECOME.2018.8659115

Chicco, D., Warrens, M. J., & Jurman, G. (2021). "The coefficient of determination R-squared is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluation." PeerJ Computer Science, 7, e623.https://doi.org/10.7717/peerj-cs.623

García, S., Luengo, J., & Herrera, F. (2016). "Data preprocessing in data mining." Springer International Publishing, 1-3.

Gumus, M., & Kiran, M. S. (2017). "Crude oil price forecasting using XGBoost, support vector regression and artificial neural networks." International Journal of Energy Economics and Policy, 7(6), 46-55. https://doi.org/10.1109/UBMK.2017.8093500

Ismail, Mustapha & Tukur, Hafsat & Friday, Mamudu. (2025). Sales Prediction using Ensemble Machine Learning Model. International Journal of Scientific Research and Modern Technology. 4. 24-35. 10.38124/ijsrmt.v4i3.350.

Kotsiantis, S. B., Kanellopoulos, D., & Pintelas, P. E. (2006). "Data preprocessing for supervised learning." International Journal of Computer Science, 1(2), 111-117.

Lu, C. J., et al. (2014). "Sales forecasting for computer products based on a variable selection scheme and support vector regression." Neurocomputing, 128, 491-499. https://doi.org/10.1016/j.neucom.2013.08.012

Pavlyshenko, B. M. (2019). "Machine-learning models for sales time series forecasting." Data, 4(1), 15. https://doi.org/10.3390/data4010015

Schneider, P., & Gupta, A. (2016). "Forecasting sales of new and existing products using consumer reviews: A machine learning approach." Information Systems Frontiers, 18, 247-263. https://doi.org/10.1016/j.ijforecast.2015.08.005

Tarta, E. N., et al. (2021). "Comparison of Linear Regression and Random Forest Algorithms for Sales Forecasting." 2021 IEEE International Conference on Automation/XXIV Congress of the Chilean Association of Automatic Control (ICA-ACCA).

Vapnik, V. (1995). The Nature of Statistical Learning Theory. Springer, New York.

Xu, Q., et al. (2019). "Product adoption and sales forecast for e-commerce: A review." Electronic Commerce Research and Applications, 36, 100869.

Wang G. (2022). Sales Forecasting for Firms based on Multiple Regression Model. In Proceedings of the International Conference on Big Data Economy and Digital Management - Volume 1: BDEDM, ISBN 978-989-758-593-7, pages 628-633. DOI: 10.5220/0011198600003440