MultiRef: Controllable Image Generation with Multiple Visual References Paper • 2508.06905 • Published 11 days ago • 13
MultiRef: Controllable Image Generation with Multiple Visual References Paper • 2508.06905 • Published 11 days ago • 13 • 1
MolmoAct: Action Reasoning Models that can Reason in Space Paper • 2508.07917 • Published 9 days ago • 38
WebWatcher: Breaking New Frontier of Vision-Language Deep Research Agent Paper • 2508.05748 • Published 13 days ago • 114
Are We on the Right Way for Assessing Document Retrieval-Augmented Generation? Paper • 2508.03644 • Published 15 days ago • 24
Are We on the Right Way for Assessing Document Retrieval-Augmented Generation? Paper • 2508.03644 • Published 15 days ago • 24 • 2
LaTCoder: Converting Webpage Design to Code with Layout-as-Thought Paper • 2508.03560 • Published 15 days ago • 21
Skip a Layer or Loop it? Test-Time Depth Adaptation of Pretrained LLMs Paper • 2507.07996 • Published Jul 10 • 32
Where to find Grokking in LLM Pretraining? Monitor Memorization-to-Generalization without Test Paper • 2506.21551 • Published Jun 26 • 28
Wait, We Don't Need to "Wait"! Removing Thinking Tokens Improves Reasoning Efficiency Paper • 2506.08343 • Published Jun 10 • 49
Wait, We Don't Need to "Wait"! Removing Thinking Tokens Improves Reasoning Efficiency Paper • 2506.08343 • Published Jun 10 • 49 • 2
MixSet Collection Benchmark dataset and model checkpoints of paper "LLM-as-a-Coauthor: Can Mixed Human-Written and Machine-Generated Text Be Detected?" • 2 items • Updated Jun 11
LiveVQA Collection Dataset, benchmark and model checkpoints from paper LiveVQA. • 5 items • Updated Jul 7 • 2