ReasonFLux-Coder Collection Coding LLMs excel at both writing code and generating unit tests. • 9 items • Updated 11 days ago • 6
Enigmata Collection Resources for the Enigmata Project: https://seed-enigmata.github.io. • 4 items • Updated 10 days ago • 1
Emerging Properties in Unified Multimodal Pretraining Paper • 2505.14683 • Published 16 days ago • 129
LightLab: Controlling Light Sources in Images with Diffusion Models Paper • 2505.09608 • Published 22 days ago • 31
Fast Text-to-Audio Generation with Adversarial Post-Training Paper • 2505.08175 • Published 24 days ago • 22
LiveCC Collection Learning Video LLM with Streaming Speech Transcription at Scale (CVPR 2025) • 8 items • Updated Apr 23 • 4
view article Article 17 Reasons Why Gradio Isn't Just Another UI Library By ysharma and 1 other • Apr 16 • 37
Kimi-VL-A3B Collection Moonshot's efficient MoE VLMs, exceptional on agent, long-context, and thinking • 6 items • Updated Apr 12 • 65
Qwen2.5-Omni Collection End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5 • 7 items • Updated 16 days ago • 139
Open-RS Collection Model weights & datasets in the paper "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn’t" • 8 items • Updated Mar 21 • 12