System-1.5 Reasoning: Traversal in Language and Latent Spaces with Dynamic Shortcuts Paper • 2505.18962 • Published 13 days ago • 12
Investigating Prompting Techniques for Zero- and Few-Shot Visual Question Answering Paper • 2306.09996 • Published Jun 16, 2023
Benchmarking Vision Language Models for Cultural Understanding Paper • 2407.10920 • Published Jul 15, 2024
Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Compositional Understanding Paper • 2306.08832 • Published Jun 15, 2023
Rendering-Aware Reinforcement Learning for Vector Graphics Generation Paper • 2505.20793 • Published 10 days ago • 11
FACT: Examining the Effectiveness of Iterative Context Rewriting for Multi-fact Retrieval Paper • 2410.21012 • Published Oct 28, 2024
R$^3$Mem: Bridging Memory Retention and Retrieval via Reversible Compression Paper • 2502.15957 • Published Feb 21
GraphOmni: A Comprehensive and Extendable Benchmark Framework for Large Language Models on Graph-theoretic Tasks Paper • 2504.12764 • Published Apr 17 • 41
UI-Vision: A Desktop-centric GUI Benchmark for Visual Perception and Interaction Paper • 2503.15661 • Published Mar 19 • 2
Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems Paper • 2504.01990 • Published Mar 31 • 285
DNA Bench: When Silence is Smarter -- Benchmarking Over-Reasoning in Reasoning LLMs Paper • 2503.15793 • Published Mar 20
LIVS: A Pluralistic Alignment Dataset for Inclusive Public Spaces Paper • 2503.01894 • Published Feb 27 • 2
AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding Paper • 2502.01341 • Published Feb 3 • 39
AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding Paper • 2502.01341 • Published Feb 3 • 39
AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding Paper • 2502.01341 • Published Feb 3 • 39
AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding Paper • 2502.01341 • Published Feb 3 • 39
SynthCypher: A Fully Synthetic Data Generation Framework for Text-to-Cypher Querying in Knowledge Graphs Paper • 2412.12612 • Published Dec 17, 2024 • 4