Evaluating the effectiveness of selected tools in recognizing emotions from facial photos

Main Article Content

DOI

Klaudiusz Wierzbowski

s95605@pollub.edu.pl

https://orcid.org/0009-0007-0229-3272

Abstract

Emotion recognition from facial images has become a key area in computer vision and affective computing. Deep learning models such as convolutional neural networks and vision transformers have shown high potential in this domain. In this study, the performance of two representative architectures, ResNet-50, a convolutional neural networks based model, and ViT-B/16, a transformer-based model, is evaluated on the widely used Facial Expression Recognition 2013 dataset. Both models are trained using data augmentation and regularization techniques to enhance generalization. Their effectiveness is assessed using metrics including accuracy, precision, recall, and F1-score, alongside a detailed examination of confusion matrices. The observed differences in classification performance across emotion categories highlight the influence of architectural design on model behavior. The obtained results serve as a reference point for selecting appropriate deep learning architectures.

Keywords:

convolutional neural networks, vision transformers, emotion recognition

References

Article Details

Wierzbowski, K. (2025). Evaluating the effectiveness of selected tools in recognizing emotions from facial photos. Journal of Computer Sciences Institute, 37, 443–450. https://doi.org/10.35784/jcsi.7973