Submitted by BestWishYsh 22 UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation · 12 authors 1
Submitted by zelaix 17 VS-Bench: Evaluating VLMs for Strategic Reasoning and Decision-Making in Multi-Agent Environments · 8 authors 1
Submitted by AnonMegumi 17 MotionSight: Boosting Fine-Grained Motion Understanding in Multimodal LLMs · 9 authors 1
Submitted by xyliu6 11 SynthRL: Scaling Visual Reasoning with Verifiable Data Synthesis · 6 authors 1
Submitted by luojunyu 11 FinMME: Benchmark Dataset for Financial Multi-Modal Reasoning Evaluation · 13 authors 2
Submitted by Lingaaaaaaa 10 Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning · 5 authors 1
Submitted by gentaiscool 3 Datasheets Aren't Enough: DataRubrics for Automated Quality Metrics and Accountability · 20 authors 1
Submitted by erjui 3 PCoreSet: Effective Active Learning through Knowledge Distillation from Vision-Language Models · 5 authors 2
Submitted by yiren98 2 RelationAdapter: Learning and Transferring Visual Relation with Diffusion Transformers · 5 authors 1
Submitted by Hila 2 FlowMo: Variance-Based Flow Guidance for Coherent Motion in Video Generation · 4 authors 1
Submitted by amazingj 1 M^3FinMeeting: A Multilingual, Multi-Sector, and Multi-Task Financial Meeting Understanding Evaluation Dataset · 6 authors 1
Submitted by hyungjoochae 1 One Missing Piece for Open-Source Reasoning Models: A Dataset to Mitigate Cold-Starting Short CoT LLMs in RL · 9 authors 1
Submitted by danielmisrael 1 Accelerating Diffusion LLMs via Adaptive Parallel Decoding · 3 authors 1
Submitted by zhaoruiyang - Multimodal DeepResearcher: Generating Text-Chart Interleaved Reports From Scratch with Agentic Framework · 8 authors 1