LongVie: Multimodal-Guided Controllable Ultra-Long Video Generation Paper • 2508.03694 • Published 15 days ago • 50
EmoAgent: Assessing and Safeguarding Human-AI Interaction for Mental Health Safety Paper • 2504.09689 • Published Apr 13 • 7
Diffusion Distillation With Direct Preference Optimization For Efficient 3D LiDAR Scene Completion Paper • 2504.11447 • Published Apr 15 • 5
Transformer Copilot: Learning from The Mistake Log in LLM Fine-tuning Paper • 2505.16270 • Published May 22 • 6
On Path to Multimodal Historical Reasoning: HistBench and HistAgent Paper • 2505.20246 • Published May 26
Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evolution Paper • 2505.20286 • Published May 26 • 7