StreamBP: Memory-Efficient Exact Backpropagation for Long Sequence Training of LLMs Paper • 2506.03077 • Published 3 days ago • 15
VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models Paper • 2505.23656 • Published 8 days ago • 22
Diagonal Batching Unlocks Parallelism in Recurrent Memory Transformers for Long Contexts Paper • 2506.05229 • Published about 24 hours ago • 31
RobustSplat: Decoupling Densification and Dynamics for Transient-Free 3DGS Paper • 2506.02751 • Published 3 days ago • 3
Contextual Integrity in LLMs via Reasoning and Reinforcement Learning Paper • 2506.04245 • Published 8 days ago • 3
SkyReels-Audio: Omni Audio-Conditioned Talking Portraits in Video Diffusion Transformers Paper • 2506.00830 • Published 6 days ago • 3
Rectified Point Flow: Generic Point Cloud Pose Estimation Paper • 2506.05282 • Published about 23 hours ago • 3
FreeTimeGS: Free Gaussians at Anytime and Anywhere for Dynamic Scene Reconstruction Paper • 2506.05348 • Published about 23 hours ago • 4
Geometry-Editable and Appearance-Preserving Object Compositon Paper • 2505.20914 • Published 10 days ago • 4
MedAgentGym: Training LLM Agents for Code-Based Medical Reasoning at Scale Paper • 2506.04405 • Published 2 days ago • 4
The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text Paper • 2506.05209 • Published 1 day ago • 16
Inference-Time Hyper-Scaling with KV Cache Compression Paper • 2506.05345 • Published about 23 hours ago • 14
FlexPainter: Flexible and Multi-View Consistent Texture Generation Paper • 2506.02620 • Published 3 days ago • 9
Revisiting Depth Representations for Feed-Forward 3D Gaussian Splatting Paper • 2506.05327 • Published about 23 hours ago • 10
AV-Reasoner: Improving and Benchmarking Clue-Grounded Audio-Visual Counting for MLLMs Paper • 2506.05328 • Published about 23 hours ago • 19
MINT-CoT: Enabling Interleaved Visual Tokens in Mathematical Chain-of-Thought Reasoning Paper • 2506.05331 • Published about 23 hours ago • 12
SparseMM: Head Sparsity Emerges from Visual Concept Responses in MLLMs Paper • 2506.05344 • Published about 23 hours ago • 14
Unfolding Spatial Cognition: Evaluating Multimodal Models on Visual Simulations Paper • 2506.04633 • Published 1 day ago • 15
Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models Paper • 2506.05176 • Published 1 day ago • 20