The effectiveness of machine learning in detecting phishing websites

Main Article Content

DOI

Jacek Łukasz Wilk-Jakubowski

jwilk@tu.kielce.pl

https://orcid.org/0000-0003-1275-948X
Aleksandra Sikora

asikora@tu.kielce.pl

Dawid Maciejski

dawidmaciejski0@proton.me

https://orcid.org/0009-0003-6426-2552

Abstract

Phishing poses a significant risk in the field of digital security, requiring effective methods for identifying fraudulent websites. This study evaluated the performance of nine machine learning classification models in the context of phishing website detection. Two different input datasets were prepared: the first included the full HTML code, while the second was based on a set of features extracted from that code. The analysis revealed that models trained on the extracted features achieved nearly twice the detection performance compared to those operating on raw HTML code. The use of majority voting further improved classification effectiveness. The study results confirm that proper feature selection and the integration of outputs from multiple models significantly enhance the effectiveness of systems for detecting online threats.

Keywords:

phishing, machine learning, website classification, feature analysis, network security, threat detection

References

Article Details

Wilk-Jakubowski, J. Łukasz, Sikora, A., & Maciejski, D. (2025). The effectiveness of machine learning in detecting phishing websites. Informatyka, Automatyka, Pomiary W Gospodarce I Ochronie Środowiska, 15(3), 105–109. https://doi.org/10.35784/iapgos.8143
Author Biographies

Jacek Łukasz Wilk-Jakubowski, Kielce University of Technology, Department of Information Systems

He is an associate professor at the Kielce University of Technology, Faculty of Electrical Engineering, Automatic Control and Computer Science, Department of Information Systems. He was awarded the doctor of technical science degree (with the specialization in ICT, Teleinformatics, Data Transmission and Signal Processing) and doctor of science (habilitation) degree in the Informatics and Computer Science discipline. He is the author of several inventions that have been granted protection by the Patent Office, participant of many national and international conferences and projects, and laureate of several awards, among others for patents. He is the author more than 90 scientific publications (including 5 monographs, 5 chapters in monographs, as well as more than 80 papers).

http://orcid.org/0000-0003-1275-948X

Aleksandra Sikora, Kielce University of Technology, Department of Computer Science, Electronics and Electrical Engineering

She is an assistant professor at the Kielce University of Technology in the Faculty of Electrical Engineering, Automatic Control and Computer Science, Department of Computer Science, Electronics and Electrical Engineering. She has participated in numerous IT projects integrating scientific research with student education in areas such as cybersecurity, big data, machine learning, and digital metrology.

https://orcid.org/0009-0003-6426-2552

Dawid Maciejski, Kielce University of Technology, Faculty of Electrical Engineering, Automatic Control and Computer Science

A graduate of second-cycle studies in Computer Science, specializing in Cybersecurity, at the Faculty of Electrical Engineering, Automation, and Computer Science at the Kielce University of Technology, completed in 2025. His main areas of interest include developing machine learning methods, issues related to computer networks, and user security on the Internet.