Datasheets Aren't Enough: DataRubrics for Automated Quality Metrics and Accountability Paper • 2506.01789 • Published 3 days ago • 10
How Does Vision-Language Adaptation Impact the Safety of Vision Language Models? Paper • 2410.07571 • Published Oct 10, 2024 • 2
Web-Shepherd: Advancing PRMs for Reinforcing Web Agents Paper • 2505.15277 • Published 15 days ago • 98
FREESON: Retriever-Free Retrieval-Augmented Reasoning via Corpus-Traversing MCTS Paper • 2505.16409 • Published 14 days ago • 2
The CoT Encyclopedia: Analyzing, Predicting, and Controlling how a Reasoning Model will Think Paper • 2505.10185 • Published 21 days ago • 25
Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning Paper • 2504.17192 • Published Apr 24 • 110
Scaling Evaluation-time Compute with Reasoning Models as Process Evaluators Paper • 2503.19877 • Published Mar 25 • 1
SCOB: Universal Text Understanding via Character-wise Supervised Contrastive Learning with Online Text Rendering for Bridging Domain Gap Paper • 2309.12382 • Published Sep 21, 2023
What Is Wrong With Scene Text Recognition Model Comparisons? Dataset and Model Analysis Paper • 1904.01906 • Published Apr 3, 2019
Cream: Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models Paper • 2305.15080 • Published May 24, 2023
Prometheus-Vision: Vision-Language Model as a Judge for Fine-Grained Evaluation Paper • 2401.06591 • Published Jan 12, 2024 • 4
On Web-based Visual Corpus Construction for Visual Document Understanding Paper • 2211.03256 • Published Nov 7, 2022 • 1
On Efficient Language and Vision Assistants for Visually-Situated Natural Language Understanding: What Matters in Reading and Reasoning Paper • 2406.11823 • Published Jun 17, 2024
How Does Vision-Language Adaptation Impact the Safety of Vision Language Models? Paper • 2410.07571 • Published Oct 10, 2024 • 2