What do self-supervised speech models know about Dutch? Analyzing advantages of language-specific pre-training Paper • 2506.00981 • Published 5 days ago • 1
PATS: Proficiency-Aware Temporal Sampling for Multi-View Sports Skill Assessment Paper • 2506.04996 • Published 1 day ago • 1
BEVCALIB: LiDAR-Camera Calibration via Geometry-Guided Bird's-Eye View Representations Paper • 2506.02587 • Published 3 days ago • 2
Micro-Act: Mitigate Knowledge Conflict in Question Answering via Actionable Self-Reasoning Paper • 2506.05278 • Published 1 day ago • 3
Evaluation is All You Need: Strategic Overclaiming of LLM Reasoning Capabilities Through Evaluation Design Paper • 2506.04734 • Published 1 day ago • 6
Autoregressive Images Watermarking through Lexical Biasing: An Approach Resistant to Regeneration Attack Paper • 2506.01011 • Published 5 days ago • 8
The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text Paper • 2506.05209 • Published 1 day ago • 20
EOC-Bench: Can MLLMs Identify, Recall, and Forecast Objects in an Egocentric World? Paper • 2506.05287 • Published 1 day ago • 12
VideoMathQA: Benchmarking Mathematical Reasoning via Multimodal Understanding in Videos Paper • 2506.05349 • Published 1 day ago • 18
Inference-Time Hyper-Scaling with KV Cache Compression Paper • 2506.05345 • Published 1 day ago • 16
StreamBP: Memory-Efficient Exact Backpropagation for Long Sequence Training of LLMs Paper • 2506.03077 • Published 3 days ago • 15
AV-Reasoner: Improving and Benchmarking Clue-Grounded Audio-Visual Counting for MLLMs Paper • 2506.05328 • Published 1 day ago • 19
Surfer-H Meets Holo1: Cost-Efficient Web Agent Powered by Open Weights Paper • 2506.02865 • Published 3 days ago • 27
ComfyUI-Copilot: An Intelligent Assistant for Automated Workflow Development Paper • 2506.05010 • Published 1 day ago • 38
RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics Paper • 2506.04308 • Published 2 days ago • 32
Sounding that Object: Interactive Object-Aware Image to Audio Generation Paper • 2506.04214 • Published 2 days ago • 1
FinChain: A Symbolic Benchmark for Verifiable Chain-of-Thought Financial Reasoning Paper • 2506.02515 • Published 4 days ago • 2
HTSC-2025: A Benchmark Dataset of Ambient-Pressure High-Temperature Superconductors for AI-Driven Critical Temperature Prediction Paper • 2506.03837 • Published 2 days ago • 3
DLP: Dynamic Layerwise Pruning in Large Language Models Paper • 2505.23807 • Published 10 days ago • 4