Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models Paper • 2503.09573 • Published Mar 12 • 72
Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective Paper • 2505.15045 • Published 17 days ago • 53
LLaDA-V: Large Language Diffusion Models with Visual Instruction Tuning Paper • 2505.16933 • Published 15 days ago • 30
LaViDa: A Large Diffusion Language Model for Multimodal Understanding Paper • 2505.16839 • Published 15 days ago • 12