new

Get trending papers in your email inbox once a day!

Get trending papers in your email inbox!

Daily Papers

byAK and the research community

Feb 25

Submitted by

akhaliq

VideoGrain: Modulating Space-Time Attention for Multi-grained Video Editing

·
4 authors

Submitted by

LiuXR

Thus Spake Long-Context Large Language Model

·
13 authors

Submitted by

gallilmaimon

Slamming: Training a Speech Language Model on One GPU in a Day

·
3 authors

Submitted by

Canyu

DICEPTION: A Generalist Diffusion Model for Visual Perceptual Tasks

·
8 authors

3

Submitted by

a43992899

Audio-FLAN: A Preliminary Release

·
22 authors

2

Submitted by

Facico

Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment

·
7 authors

Submitted by

yulunliu

GCC: Generative Color Constancy via Diffusing a Color Checker

·
7 authors

2

Submitted by

CheeryLJH

CodeCriticBench: A Holistic Code Critique Benchmark for Large Language Models

·
18 authors

3

Submitted by

amphora

Linguistic Generalizability of Test-Time Scaling in Mathematical Reasoning

·
4 authors

2

Submitted by

akhaliq

RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers

·
6 authors

3

Submitted by

TianjinHuang

Stable-SPAM: How to Train in 4-Bit More Stably than 16-Bit Adam

·
11 authors

2

Submitted by

xw-eric

Multimodal Inconsistency Reasoning (MMIR): A New Benchmark for Multimodal Reasoning Models

·
8 authors

2

Submitted by

irenesolaiman

Beyond Release: Access Considerations for Generative AI Systems

·
7 authors

4

Submitted by

akhaliq

X-Dancer: Expressive Music to Human Dance Video Generation

·
9 authors

Submitted by

xhyandwyy

Mobile-Agent-V: Learning Mobile Device Operation Through Video-Guided Multi-Agent Collaboration

·
7 authors

2

Submitted by

jianlanluo

Reflective Planning: Vision-Language Models for Multi-Stage Long-Horizon Robotic Manipulation

·
6 authors

2

Submitted by

ProKil

Grounded Persuasive Language Generation for Automated Marketing

·
7 authors

3

Submitted by

clem

Forecasting Open-Weight AI Model Growth on Hugging Face

·
3 authors

3

Submitted by

GPaolo

TAG: A Decentralized Framework for Multi-Agent Hierarchical Reinforcement Learning

·
5 authors

2

Submitted by

callanwu

Benchmarking Temporal Reasoning and Alignment Across Chinese Dynasties

·
5 authors

4

Submitted by

wenyueH

InductionBench: LLMs Fail in the Simplest Complexity Class

·
6 authors

2

Submitted by

dalime

Investigating the Impact of Quantization Methods on the Safety and Reliability of Large Language Models

·
6 authors

2

Submitted by

peterji

Pandora3D: A Comprehensive Framework for High-Quality 3D Shape and Texture Generation

·
10 authors

Submitted by

Nadav

Can Community Notes Replace Professional Fact-Checkers?

·
4 authors

2

Submitted by

codezakh

MutaGReP: Execution-Free Repository-Grounded Plan Search for Code-Use

·
6 authors

2

Submitted by

WillHeld

Mind the Gap! Static and Interactive Evaluations of Large Audio Models

·
7 authors

2

Submitted by

zouharvi

Early-Exit and Instant Confidence Translation Quality Estimation

·
5 authors

2

Submitted by

gberton

MegaLoc: One Retrieval to Place Them All

·
2 authors

2

Submitted by

yzhuang

Self-Taught Agentic Long Context Understanding

·
10 authors

2

Submitted by

angus924

MONSTER: Monash Scalable Time Series Evaluation Repository

·
9 authors

2

Submitted by

ludolara

Diagnosing COVID-19 Severity from Chest X-Ray Images Using ViT and CNN Architectures

·
4 authors

2

Submitted by

nielsr

M3-AGIQA: Multimodal, Multi-Round, Multi-Aspect AI-Generated Image Quality Assessment

·
6 authors

Submitted by

ZarkLngeW

The snake in the Brownian sphere

·
4 authors