Comparative analysis of machine learning classifiers
Article Sidebar
Issue Vol. 38 (2026)
-
Comparative analysis of Next.js and Astro frameworks
Patryk Gieda, Marek Miłosz1-5
-
Behavioral analysis of ransomware threats to ESXi Hypervisors: a machine learning-based predictive model
Upakar Bhatta6-10
-
Performance and usability evaluation of a VR virtual museum application
Kamil Gabrysiewicz, Krzysztof Dziedzic11-18
-
Comparative analysis of selected data visualization methods
Damian Węcławski, Radosław Tomczyk, Paweł Powroźnik19-25
-
A performance comparison of web programming interfaces: GraphQL, gRPC and Thrift
Piotr Rożek, Mariusz Dzieńkowski26-31
-
A review of security mechanisms in electronic payment systems
Omniah ALibrahim, Suhair Alshehri32-42
-
Clustering methods in machine learning
Bartłomiej Głuszczak, Paweł Powroźnik43-50
-
Comparative analysis of interpretable artificial intelligence methods
Aleksandra Kuszewska, Małgorzata Charytanowicz51-58
-
Comparative analysis of machine learning classifiers
Łukasz Krukowski, Grzegorz Kozieł59-65
-
Analysis of the impact of machine learning algorithms on the quality of generated sounds
Krzysztof Pedrycz, Mateusz Pikula66-72
-
Comparative analysis of the functionalities of applications supporting the self-control process of anticoagulation therapy
Marcin Furmaga, Vitalii Baida73-80
-
Application of machine learning for predicting Formula 1 race results
Sylwia Krzysztoń, Jakub Smołka81-86
-
Analysis of latency reduction and performance improvement methods in selected VR applications
Mateusz Czapczyński, Krzysztof Dziedzic87-94
-
Integrating deep learning image analysis into Web GIS applications: A Hybrid Flask - Spring Boot architecture for automated place detection
Medjon HYSENAJ95-101
Main Article Content
DOI
Authors
Abstract
This study presents a comparative analysis of five machine learning classification algorithms: support vector machine (SVM), multilayer perceptron (MLP), classification and regression tree (CART), k-nearest neighbors algorithm (K-NN), and naive Bayes classifier (NB) across four datasets from various domains. Using nested cross-validation, the research evaluated classifier performance on Heart Disease, German Credit, Spambase, and Online Shoppers Purchasing Intention datasets. Results demonstrated that no single classifier consistently outperformed others across all datasets and selection should be based on dataset characteristics and application requirements. Dataset characteristics emerged as the primary factor influencing performance, with class imbalance proving particularly problematic. Training efficiency analysis revealed that simpler algorithms can maintain competitive performance with lower computational costs.
Keywords:
References
[1] D. A. Omondiagbe, S. Veeramani, A. S. Sidhu, Machine Learning Classification Techniques for Breast Cancer Diagnosis, IOP Conf Ser Mater Sci Eng 495 (2019) 012033, https://doi.org/10.1088/1757-899X/495/1/012033. DOI: https://doi.org/10.1088/1757-899X/495/1/012033
[2] R. M. Rahman, F. Afroz, Comparison of Various Classification Techniques Using Different Data Mining Tools for Diabetes Diagnosis, Journal of Software Engineering and Applications 6 (2013) 85–97, https://doi.org/10.4236/jsea.2013.63013. DOI: https://doi.org/10.4236/jsea.2013.63013
[3] V. Kunwar, K. Chandel, A. S. Sabitha, A. Bansal, Chronic Kidney Disease analysis using data mining classification techniques, Proceedings of the 2016 6th International Conference - Cloud System and Big Data Engineering (2016) 300–305, https://doi.org/10.1109/CONFLUENCE.2016.7508132. DOI: https://doi.org/10.1109/CONFLUENCE.2016.7508132
[4] C. S. Dangare, S. S. Apte, Improved Study of Heart Disease Prediction System using Data Mining Classification Techniques, Int J Comput Appl 47 (2012) 44–48, https://doi.org/10.5120/7228-0076. DOI: https://doi.org/10.5120/7228-0076
[5] C. B. C. Latha, S. C. Jeeva, Improving the accuracy of prediction of heart disease risk based on ensemble classification techniques, Inform Med Unlocked 16 (2019) 100203, https://doi.org/10.1016/j.imu.2019.100203. DOI: https://doi.org/10.1016/j.imu.2019.100203
[6] I. Kurt, M. Ture, A. T. Kurum, Comparing performances of logistic regression, classification and regression tree, and neural networks for predicting coronary artery disease, Expert Syst Appl 34 (2008) 366–374, https://doi.org/10.1016/j.eswa.2006.09.004. DOI: https://doi.org/10.1016/j.eswa.2006.09.004
[7] D. Khanna, R. Sahu, V. Baths, B. Deshpande, Comparative Study of Classification Techniques (SVM, Logistic Regression and Neural Networks) to Predict the Prevalence of Heart Disease, Int J Mach Learn Comput 5 (2015) 414–419, https://doi.org/10.7763/ijmlc.2015.v5.544. DOI: https://doi.org/10.7763/IJMLC.2015.V5.544
[8] M. Ture, I. Kurt, A. Turhan Kurum, K. Ozdamar, Comparing classification techniques for predicting essential hypertension, Expert Syst Appl 29 (2005) 583–588, https://doi.org/10.1016/j.eswa.2005.04.014. DOI: https://doi.org/10.1016/j.eswa.2005.04.014
[9] P. R. Harper, A review and comparison of classification algorithms for medical decision making, Health Policy 71 (2005) 315–331, https://doi.org/10.1016/j.healthpol.2004.05.002. DOI: https://doi.org/10.1016/j.healthpol.2004.05.002
[10] D. Gupta, U. Ghose, A comparative study of classification algorithms for forecasting rainfall, In 2015 4th International Conference on Reliability, Infocom Technologies and Optimization: Trends and Future Directions (2015) 1–6, https://doi.org/10.1109/ICRITO.2015.7359273. DOI: https://doi.org/10.1109/ICRITO.2015.7359273
[11] B. Heung, H. C. Ho, J. Zhang, A. Knudby, C. E. Bulmer, M. G. Schmidt, An overview and comparison of machine-learning techniques for classification purposes in digital soil mapping, Geoderma 265 (2016) 62–77, https://doi.org/10.1016/j.geoderma.2015.11.014. DOI: https://doi.org/10.1016/j.geoderma.2015.11.014
[12] P. K. Srivastava, D. Han, M. A. Rico-Ramirez, M. Bray, T. Islam, Selection of classification techniques for land use/land cover change investigation, Advances in Space Research 50 (2012) 1250–1265, https://doi.org/10.1016/j.asr.2012.06.032. DOI: https://doi.org/10.1016/j.asr.2012.06.032
[13] M. Park, S. Chai, Constructing a User-Centered Fake News Detection Model by Using Classification Algorithms in Machine Learning Techniques, IEEE Access 11 (2023) 71517–71527, https://doi.org/10.1109/ACCESS.2023.3294613. DOI: https://doi.org/10.1109/ACCESS.2023.3294613
[14] M. Shafiq, X. Yu, A. A. Laghari, L. Yao, N. K. Karn, F. Abdessamia, Network Traffic Classification techniques and comparative analysis using Machine Learning algorithms, In 2016 2nd IEEE International Conference on Computer and Communications (2016) 2451–2455, https://doi.org/10.1109/COMPCOMM.2016.7925139. DOI: https://doi.org/10.1109/CompComm.2016.7925139
[15] M. Thangaraj, M. Sivakami, Text classification techniques: A literature review, Interdisciplinary Journal of Information, Knowledge, and Management 13 (2018) 117–135, https://doi.org/10.28945/4066. DOI: https://doi.org/10.28945/4066
[16] S. Shakya, Analysis of Artificial Intelligence based Image Classification Techniques, Journal of Innovative Image Processing 2 (2020) 44–54, https://doi.org/10.36548/jiip.2020.1.005. DOI: https://doi.org/10.36548/jiip.2020.1.005
[17] P. Herrera, A. Yeterian, F. Gouyon, Automatic Classification of Drum Sounds: A Comparison of Feature Selection Methods and Classification Techniques, In: C. Anagnostopoulou, M. Ferrand, A. Smaill, (eds) Music and Artificial Intelligence. ICMAI 2002. Lecture Notes in Computer Science, Springer 2445 (2002) 69–80, https://doi.org/10.1007/3-540-45722-4_8. DOI: https://doi.org/10.1007/3-540-45722-4_8
[18] A. L. B. Miranda, L. P. F. Garcia, A. C. P. L. F. Carvalho, A. C. Lorena, Use of Classification Algorithms in Noise Detection and Elimination, In: E. Corchado, X. Wu, E. Oja, Á. Herrero, B. Baruque, (eds) Hybrid Artificial Intelligence Systems. HAIS 2009. Lecture Notes in Computer Science, Springer 5572 (2009) 417–424, https://doi.org/10.1007/978-3-642-02319-4_50. DOI: https://doi.org/10.1007/978-3-642-02319-4_50
[19] M. Mayilvaganan, D. Kalpanadevi, Comparison of classification techniques for predicting the performance of students academic environment, In 2014 International Conference on Communication and Network Technologies (2014) 113–118, https://doi.org/10.1109/CNT.2014.7062736. DOI: https://doi.org/10.1109/CNT.2014.7062736
[20] K. Bunkar, U. K. Singh, B. Pandya, R. Bunkar, Data mining: Prediction for performance improvement of graduate students using classification, In 2012 Ninth International Conference on Wireless and Optical Communications Networks (2012) 1–5, https://doi.org/10.1109/WOCN.2012.6335530. DOI: https://doi.org/10.1109/WOCN.2012.6335530
[21] M. C. Yu, Multi-criteria ABC analysis using artificial-intelligence-based classification techniques, Expert Syst Appl 38 (2011) 3416–3421, https://doi.org/10.1016/j.eswa.2010.08.127. DOI: https://doi.org/10.1016/j.eswa.2010.08.127
[22] C. Ioannidis, F. Pasiouras, C. Zopounidis, Assessing bank soundness with classification techniques, Omega 38 (2010) 345–357, https://doi.org/10.1016/j.omega.2009.10.009. DOI: https://doi.org/10.1016/j.omega.2009.10.009
[23] Heart Disease - UCI Machine Learning Repository, https://archive.ics.uci.edu/dataset/45/heart+disease [05.08.2025].
[24] Statlog (German Credit Data) - UCI Machine Learning Repository, https://archive.ics.uci.edu/dataset/144/statlog+german+credit+data [06.08.2025].
[25] Spambase - UCI Machine Learning Repository, https://archive.ics.uci.edu/dataset/94/spambase [06.08.2025].
[26] Online Shoppers Purchasing Intention Dataset - UCI Machine Learning Repository, https://archive.ics.uci.edu/dataset/468/online+shoppers+purchasing+intention+dataset [11.08.2025].
[27] G. C. Cawley, N. L.C. Talbot, On Over-fitting in Model Selection and Subsequent Selection Bias in Performance Evaluation, The Journal of Machine Learning Research 11 (2010) 2079–2107, https://doi.org/10.5555/1756006.1859921.
Article Details
Abstract views: 10

