Submitted by ambean 26 Clinical knowledge in LLMs does not translate to human interactions · 11 authors 5 5
Submitted by lgy0404 22 LLM-Powered GUI Agents in Phone Automation: Surveying Progress and Prospects · 18 authors 4
Submitted by akhaliq 18 Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory · 5 authors 2
Submitted by judge 18 SPC: Evolving Self-Play Critic via Adversarial Games for LLM Reasoning · 8 authors 19 2
Submitted by QizhiPei 17 CipherBank: Exploring the Boundary of LLM Reasoning Capabilities through Cryptography Challenges · 9 authors 8 4
Submitted by cloudcatcher2 13 Benchmarking Multimodal Mathematical Reasoning with Explicit Visual Dependency · 7 authors 10 3
Submitted by iofu728 9 MMInference: Accelerating Pre-filling for Long-Context VLMs via Modality-Aware Permutation Sparse Attention · 11 authors 1.11k 2
Submitted by soujanyaporia 7 NORA: A Small Open-Sourced Generalist Vision Language Action Model for Embodied Tasks · 8 authors 164 2
Submitted by AaronZ345 6 Versatile Framework for Song Generation with Prompt-based Control · 11 authors 2
Submitted by renqiux0302 6 TrustGeoGen: Scalable and Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving · 13 authors 2
Submitted by FocusV857 5 ICL CIPHERS: Quantifying "Learning'' in In-Context Learning via Substitution Ciphers · 5 authors 0 2
Submitted by observerw 4 ChiseLLM: Unleashing the Power of Reasoning LLMs for Chisel Agile Hardware Development · 6 authors 2