SeedVR2: One-Step Video Restoration via Diffusion Adversarial Post-Training Paper • 2506.05301 • Published 1 day ago • 36
ComfyUI-Copilot: An Intelligent Assistant for Automated Workflow Development Paper • 2506.05010 • Published 1 day ago • 38
RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics Paper • 2506.04308 • Published 2 days ago • 32
FlexPainter: Flexible and Multi-View Consistent Texture Generation Paper • 2506.02620 • Published 3 days ago • 9
Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models Paper • 2506.05176 • Published 1 day ago • 22
Ctrl-Crash: Controllable Diffusion for Realistic Car Crashes Paper • 2506.00227 • Published 7 days ago • 9
Robustness in Both Domains: CLIP Needs a Robust Text Encoder Paper • 2506.03355 • Published 3 days ago • 6
Voyager: Long-Range and World-Consistent Video Diffusion for Explorable 3D Scene Generation Paper • 2506.04225 • Published 2 days ago • 21
Advancing Multimodal Reasoning: From Optimized Cold Start to Staged Reinforcement Learning Paper • 2506.04207 • Published 2 days ago • 41
SVGenius: Benchmarking LLMs in SVG Understanding, Editing and Generation Paper • 2506.03139 • Published 3 days ago • 13
LayerFlow: A Unified Model for Layer-aware Video Generation Paper • 2506.04228 • Published 2 days ago • 13
RefEdit: A Benchmark and Method for Improving Instruction-based Image Editing Model on Referring Expressions Paper • 2506.03448 • Published 3 days ago • 4
ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models Paper • 2505.24864 • Published 7 days ago • 112