Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment
Yuhao Dong PRO
THUdyh
AI & ML interests
None yet
Recent Activity
upvoted a paper 2 days ago
VideoKR: Towards Knowledge- and Reasoning-Intensive Video Understanding authored a paper 10 days ago
From Pixels to Words -- Towards Native One-Vision Models at Scale upvoted a paper 10 days ago
GEM: Generative Supervision Helps Embodied Intelligence