Speed Always Wins: A Survey on Efficient Architectures for Large Language Models Paper • 2508.09834 • Published 7 days ago • 41
Has GPT-5 Achieved Spatial Intelligence? An Empirical Study Paper • 2508.13142 • Published 2 days ago • 23
BeyondWeb: Lessons from Scaling Synthetic Data for Trillion-scale Pretraining Paper • 2508.10975 • Published 6 days ago • 46
NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale Paper • 2508.10711 • Published 6 days ago • 133
Sample More to Think Less: Group Filtered Policy Optimization for Concise Reasoning Paper • 2508.09726 • Published 7 days ago • 11
Cooper: Co-Optimizing Policy and Reward Models in Reinforcement Learning for Large Language Models Paper • 2508.05613 • Published 13 days ago • 16
AWorld: Dynamic Multi-Agent System with Stable Maneuvering for Robust GAIA Problem Solving Paper • 2508.09889 • Published 7 days ago • 30
Mol-R1: Towards Explicit Long-CoT Reasoning in Molecule Discovery Paper • 2508.08401 • Published 9 days ago • 37
Story2Board: A Training-Free Approach for Expressive Storyboard Generation Paper • 2508.09983 • Published 7 days ago • 61
Adversarial Video Promotion Against Text-to-Video Retrieval Paper • 2508.06964 • Published 11 days ago • 9
Train Long, Think Short: Curriculum Learning for Efficient Reasoning Paper • 2508.08940 • Published 8 days ago • 22
WebWatcher: Breaking New Frontier of Vision-Language Deep Research Agent Paper • 2508.05748 • Published 13 days ago • 114
Less Is More: Training-Free Sparse Attention with Global Locality for Efficient Reasoning Paper • 2508.07101 • Published 11 days ago • 13
Part I: Tricks or Traps? A Deep Dive into RL for LLM Reasoning Paper • 2508.08221 • Published 9 days ago • 39