new

Get trending papers in your email inbox once a day!

Get trending papers in your email inbox!

Daily Papers

byAK and the research community

Aug 11

Submitted by

xianbao

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

·
171 authors

Submitted by

RyanL22

Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off

·
2 authors

Submitted by

SiriusL

InfiGUI-G1: Advancing GUI Grounding with Adaptive Exploration Policy Optimization

·
13 authors

Submitted by

Ningyu

Memp: Exploring Agent Procedural Memory

·
9 authors

Submitted by

YerbaPage

Pruning the Unsurprising: Efficient Code Reasoning via First-Token Surprisal

·
7 authors

Submitted by

JorgeeGF

Hidden Dynamics of Massive Activations in Transformer Training

·
5 authors

3

Submitted by

MikolajZ

GENIE: Gaussian Encoding for Neural Radiance Fields Interactive Editing

·
4 authors

Submitted by

hdong51

Adapting Vision-Language Models Without Labels: A Comprehensive Survey

·
6 authors

Submitted by

KejiaRobust

MELLA: Bridging Linguistic Capability and Cultural Groundedness for Low-Resource Language MLLMs

·
7 authors

Submitted by

fsk515

MeshLLM: Empowering Large Language Models to Progressively Understand and Generate 3D Mesh

·
9 authors

Submitted by

huxueyu

OS Agents: A Survey on MLLM-based Agents for General Computing Devices Use

·
29 authors

Submitted by

LianShuQuan

UI-AGILE: Advancing GUI Agents with Effective Reinforcement Learning and Precise Inference-Time Grounding

·
7 authors

2

Submitted by

thebluser

LightSwitch: Multi-view Relighting with Material-guided Diffusion

·
3 authors

3

Submitted by

shijiezhou

VLM4D: Towards Spatiotemporal Awareness in Vision Language Models

·
10 authors