1.
Aderinokun A. Unified Multimodal Transformers: Improving Vision-Language Models with Knowledge-Guided Attention Mechanisms. MZJAI [Internet]. 2024 Sep. 8 [cited 2025 Jul. 18];1(2). Available from: http://mzresearch.com/index.php/MZJAI/article/view/272