new

Get trending papers in your email inbox once a day!

Get trending papers in your email inbox!

Daily Papers

byAK and the research community

Apr 29

Submitted by

wanghaofan

RepText: Rendering Visual Text via Replicating

·
8 authors

Submitted by

ambean

Clinical knowledge in LLMs does not translate to human interactions

·
11 authors

Submitted by

lgy0404

LLM-Powered GUI Agents in Phone Automation: Surveying Progress and Prospects

·
18 authors

Submitted by

akhaliq

Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory

·
5 authors

Submitted by

judge

SPC: Evolving Self-Play Critic via Adversarial Games for LLM Reasoning

·
8 authors

Submitted by

QizhiPei

CipherBank: Exploring the Boundary of LLM Reasoning Capabilities through Cryptography Challenges

·
9 authors

Submitted by

cloudcatcher2

Benchmarking Multimodal Mathematical Reasoning with Explicit Visual Dependency

·
7 authors

Submitted by

ashiq24

Group Downsampling with Equivariant Anti-aliasing

·
2 authors

Submitted by

iofu728

MMInference: Accelerating Pre-filling for Long-Context VLMs via Modality-Aware Permutation Sparse Attention

·
11 authors

Submitted by

soujanyaporia

NORA: A Small Open-Sourced Generalist Vision Language Action Model for Embodied Tasks

·
8 authors

Submitted by

AaronZ345

Versatile Framework for Song Generation with Prompt-based Control

·
11 authors

2

Submitted by

renqiux0302

TrustGeoGen: Scalable and Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving

·
13 authors

2

Submitted by

FocusV857

ICL CIPHERS: Quantifying "Learning'' in In-Context Learning via Substitution Ciphers

·
5 authors

Submitted by

observerw

ChiseLLM: Unleashing the Power of Reasoning LLMs for Chisel Agile Hardware Development

·
6 authors

2