Images are Worth Variable Length of Representations Paper • 2506.03643 • Published 3 days ago • 2 • 2
BEVCALIB: LiDAR-Camera Calibration via Geometry-Guided Bird's-Eye View Representations Paper • 2506.02587 • Published 4 days ago • 2 • 2
FlexPainter: Flexible and Multi-View Consistent Texture Generation Paper • 2506.02620 • Published 4 days ago • 9 • 2
RobustSplat: Decoupling Densification and Dynamics for Transient-Free 3DGS Paper • 2506.02751 • Published 4 days ago • 3 • 2
Geometry-Editable and Appearance-Preserving Object Compositon Paper • 2505.20914 • Published 11 days ago • 5 • 2
StreamBP: Memory-Efficient Exact Backpropagation for Long Sequence Training of LLMs Paper • 2506.03077 • Published 4 days ago • 15 • 2
What do self-supervised speech models know about Dutch? Analyzing advantages of language-specific pre-training Paper • 2506.00981 • Published 6 days ago • 1 • 2
VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models Paper • 2505.23656 • Published 9 days ago • 23 • 2
SViMo: Synchronized Diffusion for Video and Motion Generation in Hand-object Interaction Scenarios Paper • 2506.02444 • Published 4 days ago • 1 • 3
Surfer-H Meets Holo1: Cost-Efficient Web Agent Powered by Open Weights Paper • 2506.02865 • Published 4 days ago • 27 • 2
Diffusion-Based Generative Models for 3D Occupancy Prediction in Autonomous Driving Paper • 2505.23115 • Published 9 days ago • 2 • 2
Autoregressive Images Watermarking through Lexical Biasing: An Approach Resistant to Regeneration Attack Paper • 2506.01011 • Published 6 days ago • 8 • 2
SkyReels-Audio: Omni Audio-Conditioned Talking Portraits in Video Diffusion Transformers Paper • 2506.00830 • Published 6 days ago • 4 • 2
Rethinking Whole-Body CT Image Interpretation: An Abnormality-Centric Approach Paper • 2506.03238 • Published 4 days ago • 1 • 2