[1]
O. Whitaker, “A Survey on Multimodal Foundation Models: Architectures, Training Paradigms, and Emerging Applications”, JCTS, vol. 5, no. 2, Feb. 2026.