Detection of suspicious facial objects in neutral ATMs using deep learning architectures based on YOLOV8 and Faster R-CNN

Marco Manuel ARAGON PAUCAR; Kelvin Yhonson FERNANDEZ ACERO; Erasmo SULLA ESPINOZA

doi:10.35784/acs_8906

PDF

Published: Jun 30, 2026

DOI: https://doi.org/10.35784/acs_8906

Issue Vol. 22 No. 2 (2026)

Articles

Path planning in swarm robotics exploration using SARSA and ACO algorithms
Aicha HAFID, Riadh HOCINE, Lahcene GUEZOULI

1-15
Detection of suspicious facial objects in neutral ATMs using deep learning architectures based on YOLOV8 and Faster R-CNN
Marco Manuel ARAGON PAUCAR, Kelvin Yhonson FERNANDEZ ACERO, Erasmo SULLA ESPINOZA

16-32
Assessing the effectiveness of one-stage and two-stage methods for identifying high-voltage power grid equipment in UAV imagery
Thi Thanh Tan NGUYEN, Thi Thu Nga VU

33-47
An automatic speech recognition approach for controlled medications prescription with natural language processing
Luis Enrique COLMENARES-GUILLÉN, Angel Axel MÉNDEZ-MENESES

48-66
Improving image retrieval using CNN with PCA and Optimized K-Means clustering
Mohsin Hasan HUSSEIN, Ali Mohsin Ahmed AL-SABAAWI, Zakaria A. Hamed ALNAISH

67-84
Numerical investigation into the hydrodynamic characteristics of water vortex turbines with varied blade angles
Sarwo EDHY SOFYAN, Zamzami, Akhyar AKHYAR, Suriadi, Agus SASMITO

85-104
Optimization of the corporate cluster structure using the Tabu Search method
Andrzej IMIEŁOWSKI, Łukasz BANAŚ, Bogusław TWARÓG, Janusz BYTNAR

105-116
Application controls audit framework in the context of ERP systems
Sakchai TANGPRASERT, Nalinpat BHUMPENPEIN

117-125
Autonomous AI agents in digital markets: Economic implications for competition, pricing, and regulation
Elmira KYDYRBAYEVA, Balhiya SHOMSHEKOVA, Asset ABZHAKOV, Ainur ASHIMOVA, Assel NURTAYEVA

126-137
Multi-criteria analysis of parameter impact in large-scale robotic 3D printing
Łukasz SOBASZEK, Ivan GAJDOŠ, Pavol ŠTEFČÁK

138-147
Designing cloud-based knowledge management systems to improve organizational innovation
Hayfaa Subhi MALALLAH, Sherzad Mohammad AJEEL

148-168
Data normalisation methods on microarray data
Inggih PERMANA, Shir Li WANG, Hoi Yeh LEE, Suliana SULAIMAN, Hasnatul Nazuha HASSAN

169-179
Log-based learning analytics of gamified Moodle activities: Quantifying student engagement
Iva GRUBJEŠIĆ, Tomislav IVANJKO, Vedran JURIČIĆ

180-192
SFAB-Net: Semantic segmentation network for railway track surface defects based on Spatial Fusion and Adaptive Bottleneck feature enhancement
Qike WU, Sharafiz ABDUL RAHIM, Sai Hong TANG, Muhammad Azim AZIZI, Li ZHANG

193-207
Machine learning approach to detect GAI-disguised academic programming plagiarism
Oscar KARNALIM, Yehezkiel David SETIAWAN, Maresha Caroline WIJANTO, Rossevine Artha NATHASYA

208-224

Authors

Marco Manuel ARAGON PAUCAR

maragonp@unsa.edu.pe

Universidad Nacional de San Agustin de Arequipa, Peru

https://orcid.org/0009-0009-8805-1251

Kelvin Yhonson FERNANDEZ ACERO

kfernandezac@unsa.edu.pe

Universidad Nacional de San Agustin de Arequipa, Peru

https://orcid.org/0009-0006-3352-1330

Erasmo SULLA ESPINOZA

esullae@unsa.edu.pe

Universidad Nacional de San Agustín de Arequipa, Peru

https://orcid.org/0000-0002-1223-1223

Abstract

This study presents an automatic detection system for suspicious facial objects in neutral automated teller machines (ATMs), using and comparing the deep learning architectures YOLOv8 and Faster R-CNN. A dataset was built from real ATM surveillance videos and complementary images, from which frames were extracted and annotated with masks, hats, and glasses. Both models were trained under the same preprocessing pipeline and evaluated using standard object detection metrics such as precision, recall, F1-score, Intersection over Union (IoU), and mean Average Precision (mAP), in order to analyze their performance in real surveillance conditions. The results show that YOLOv8 achieves higher precision, reducing the generation of false positives, while Faster R-CNN demonstrates higher recall and superior mAP@0.5 values in several classes, indicating greater sensitivity to partially visible objects. A decision-making logic was also integrated to automatically classify each scene as NORMAL or SUSPECT, based on the combined presence of facial-occluding elements. The implementation demonstrates that computer vision systems can complement security mechanisms in neutral ATMs by providing early detection of potential risks and enabling real-time remote monitoring.

Keywords:

object detection, face occlusion, YOLOv8, faster R-CNN, ATM security

Sustainable Development Goals (SDG)

16 - Peace, justice and strong institutions

References

Amirgaliyev, B., Mussabek, M., Rakhimzhanova, T., & Zhumadillayeva, A. (2025). A review of machine learning and deep learning methods for person detection, tracking and identification, and face recognition with applications. Sensors, 25(5), Article 1410. https://doi.org/10.3390/s25051410

Duong, H.-T., Le, V.-T., & Hoang, V. T. (2023). Deep learning-based anomaly detection in video surveillance: A survey. Sensors, 23(11), Article 5024. https://doi.org/10.3390/s23115024

Elrahman, M. A., Elbahri, F., & Zhao, C. (2025). Deep BiLSTM attention model for spatial and temporal anomaly detection in video surveillance. Sensors, 25(1), Article 251. https://doi.org/10.3390/s25010251

Feng, Y., Yu, S., Peng, H., Li, Y.-R., & Zhang, J. (2022). Detect faces efficiently: A survey and evaluations. IEEE Transactions on Biometrics, Behavior, and Identity Science, 4(1), 1–18. https://doi.org/10.1109/TBIOM.2021.3120412

Hermens, F. (2024). Automatic object detection for behavioural research using YOLOv8. Behavior Research Methods, 56(7), 7307–7330. https://doi.org/10.3758/s13428-024-02420-5

Hussain, M. (2023). YOLO-v1 to YOLO-v8, the rise of YOLO and its complementary nature toward digital manufacturing and industrial defect detection. Machines, 11(7), Article 677. https://doi.org/10.3390/machines11070677

Ihsan, U., Jhanjhi, N. Z., Ashraf, H., Ashfaq, F., & Wicaksana, F. A. (2025). A real-time intelligent surveillance system for suspicious behavior and facial emotion analysis using YOLOv8 and DeepFace. Engineering Proceedings, 59, Article 59. https://doi.org/10.3390/engproc2025107059

Ingle, P. Y., & Kim, Y.-G. (2022). Real-time abnormal object detection for video surveillance in smart cities. Sensors, 22(10), Article 3862. https://doi.org/10.3390/s22103862

Ju, R.-Y., & Cai, W. (2023). Fracture detection in pediatric wrist trauma X-ray images using YOLOv8 algorithm. Scientific Reports, 13, Article 20077. https://doi.org/10.1038/s41598-023-47460-7

Khalili, B., & Smyth, A. W. (2024). SOD-YOLOv8-Enhancing YOLOv8 for small object detection in aerial imagery and traffic scenes. Sensors, 24(19), Article 6209. https://doi.org/10.3390/s24196209

Kim, J., & Cho, J. (2021). RGDiNet: Efficient onboard object detection with Faster R-CNN for air-to-ground surveillance. Sensors, 21(5), Article 1677. https://doi.org/10.3390/s21051677

Li, X., Hao, T., Li, F., Zhao, L., & Wang, Z. (2023). Faster R-CNN-LSTM construction site unsafe behavior recognition model. Applied Sciences, 13(19), Article 10700. https://doi.org/10.3390/app131910700

Makhlouf, A., Ben Ali, M., & Al-Ali, A. (2024). Advances in computer vision and deep learning and its applications. Electronics, 14(8), Article 1551. https://doi.org/10.3390/electronics14081551

Mittal, P. (2024). A comprehensive survey of deep learning-based lightweight object detection models for edge devices. Artificial Intelligence Review, 57(9). https://doi.org/10.1007/s10462-024-10877-1

Rahim, A., Zhong, Y., Ahmad, T., Ahmad, S., Pławiak, P., & Hammad, M. (2023). Enhancing smart home security: Anomaly detection and face recognition in smart home IoT devices using logit-boosted CNN models. Sensors, 23(15), Article 6979. https://doi.org/10.3390/s23156979

Redmon, J., Divvala, S., Girshick, R., & Farhadi, A. (2016). You only look once: Unified, real-time object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (pp. 779–788). IEEE. https://doi.org/10.1109/CVPR.2016.91

Selvi, E., Adimoolam, M., Karthi, G., Thinakaran, K., Balamurugan, N. M., Kannadasan, R., Wechtaisong, C., & Khan, A. A. (2022). Suspicious actions detection system using enhanced CNN and surveillance video. Electronics, 11(24), Article 4210. https://doi.org/10.3390/electronics11244210

Terven, J., Córdova-Esparza, D. M., & Romero-González, J. A. (2023). A comprehensive review of YOLO architectures in computer vision: From YOLOv1 to YOLOv8 and YOLO-NAS. Machine Learning and Knowledge Extraction, 5(4), 1680–1716. https://doi.org/10.3390/make5040083

Thaer, T., Majdi, M., Muhammed, S., Hakim, A., & El-Saleh, A. A. (2025). A comprehensive review of face detection techniques for occluded faces: Methods, datasets, and open challenges. Computer Modeling in Engineering & Sciences, 143(3), 2615–2673. https://doi.org/10.32604/cmes.2025.064857

Trigka, M., & Dritsas, E. (2025). A comprehensive survey of machine learning techniques and models for object detection. Sensors, 25(1), Article 214. https://doi.org/10.3390/s25010214

Wu, H., Zheng, Z., Lv, L., Xu, Y., Bardou, D., Niu, S., Yu, G., & Wang, Y. (2025). A spatially aware global and local perspective approach for few-shot incremental learning. Scientific Reports, 15(1), Article 8323. https://doi.org/10.1038/s41598-025-08323-5

Wu, W., Yin, Y., Wang, X., & Xu, D. (2019). Face detection with different scales based on Faster R-CNN. IEEE Transactions on Cybernetics, 50(10), 1–12. https://doi.org/10.1109/TCYB.2018.2859482

Xiao, Y., Wang, X., Zhang, P., Meng, F., & Shao, F. (2020). Object detection based on Faster R-CNN algorithm with skip pooling and fusion of contextual information. Sensors, 20(19), Article 5490. https://doi.org/10.3390/s20195490

Yaseen, M. (2024). What is YOLOv8: An in-depth exploration of the internal features of the next-generation object detector. arXiv. https://doi.org/10.48550/arXiv.2408.15857

Zhang, S., Zhu, X., Lei, Z., Shi, H., Wang, X., & Li, S. Z. (2017). FaceBoxes: A CPU real-time face detector with high accuracy. arXiv. https://doi.org/10.48550/arXiv.1708.05234

Zhang, W., & Gu, X. (2023). Few shot class incremental learning via efficient prototype replay and calibration. Entropy, 25(5), Article 776. https://doi.org/10.3390/e25050776

ARAGON PAUCAR, M. M., FERNANDEZ ACERO, K. Y., & SULLA ESPINOZA, E. (2026). Detection of suspicious facial objects in neutral ATMs using deep learning architectures based on YOLOV8 and Faster R-CNN. Applied Computer Science, 22(2), 16–32. https://doi.org/10.35784/acs_8906

Detection of suspicious facial objects in neutral ATMs using deep learning architectures based on YOLOV8 and Faster R-CNN

Issue Vol. 22 No. 2 (2026)

Archives

Authors

Abstract

Keywords:

Sustainable Development Goals (SDG)

References

License

Article Sidebar

Issue Vol. 22 No. 2 (2026)

Archives

Main Article Content

Authors

Abstract

Keywords:

Sustainable Development Goals (SDG)

References

Article Details

License