ELSHAMY, Ghada, Marco ALFONSE, Islam HEGAZY, and Mostafa AREF. “A Multi-Modal Transformer-Based Model for Generative Visual Dialog System”. Applied Computer Science 21, no. 1 (March 31, 2025): 1–17. Accessed April 11, 2025. https://ph.pollub.pl/index.php/acs/article/view/6856.