ATLAS: Learning to Optimally Memorize the Context at Test Time Paper • 2505.23735 • Published 8 days ago • 22
FAMA: The First Large-Scale Open-Science Speech Foundation Model for English and Italian Paper • 2505.22759 • Published 9 days ago • 20
VidText: Towards Comprehensive Evaluation for Video Text Understanding Paper • 2505.22810 • Published 9 days ago • 20
Backdoor Cleaning without External Guidance in MLLM Fine-tuning Paper • 2505.16916 • Published 15 days ago • 16
VideoGameQA-Bench: Evaluating Vision-Language Models for Video Game Quality Assurance Paper • 2505.15952 • Published 16 days ago • 19