Optimizing Multimodal LLMs for Egocentric Video Understanding: A Solution for the HD-EPIC VQA Challenge
Arxiv (CVPR 2025 EgoVis Workshop), 2026
Sicheng Yang*, Yukai Huang*, Shitong Sun, Weitong Cai, Jiankang Deng, Jifei Song, Zhensong Zhang. (2026). "Optimizing Multimodal LLMs for Egocentric Video Understanding: A Solution for the HD-EPIC VQA Challenge." Arxiv.
