[1]
ELSHAMY, G. et al. 2025. A multi-modal transformer-based model for generative visual dialog system.
Applied Computer Science
. 21, 1 (Mar. 2025), 1–17. DOI:https://doi.org/10.35784/acs_6856.