ReZero: Enhancing LLM search ability by trying one-more-time Paper • 2504.11001 • Published Apr 15 • 14
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper • 2412.13663 • Published Dec 18, 2024 • 150
OmniSVG: A Unified Scalable Vector Graphics Generation Model Paper • 2504.06263 • Published Apr 8 • 168
DreamID: High-Fidelity and Fast diffusion-based Face Swapping via Triplet ID Group Learning Paper • 2504.14509 • Published Apr 20 • 51
A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment Paper • 2504.15585 • Published Apr 22 • 13
Personalized Text-to-Image Generation with Auto-Regressive Models Paper • 2504.13162 • Published Apr 17 • 19
Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning Paper • 2504.17192 • Published Apr 24 • 110
MUSAR: Exploring Multi-Subject Customization from Single-Subject Dataset via Attention Routing Paper • 2505.02823 • Published May 5 • 5
Style Customization of Text-to-Vector Generation with Image Diffusion Priors Paper • 2505.10558 • Published 22 days ago • 15
InstanceGen: Image Generation with Instance-level Instructions Paper • 2505.05678 • Published 29 days ago • 7
Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image Analysis Paper • 2505.09358 • Published 23 days ago • 24
SageAttention2++: A More Efficient Implementation of SageAttention2 Paper • 2505.21136 • Published 10 days ago • 43
OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data Paper • 2505.18445 • Published 14 days ago • 63
ComfyMind: Toward General-Purpose Generation via Tree-Based Planning and Reactive Feedback Paper • 2505.17908 • Published 14 days ago • 3
Inverse Virtual Try-On: Generating Multi-Category Product-Style Images from Clothed Individuals Paper • 2505.21062 • Published 10 days ago • 3
Jodi: Unification of Visual Generation and Understanding via Joint Modeling Paper • 2505.19084 • Published 12 days ago • 20
LoRAShop: Training-Free Multi-Concept Image Generation and Editing with Rectified Flow Transformers Paper • 2505.23758 • Published 8 days ago • 23
Chain-of-Zoom: Extreme Super-Resolution via Scale Autoregression and Preference Alignment Paper • 2505.18600 • Published 14 days ago • 45
UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation Paper • 2506.03147 • Published 3 days ago • 55
RelationAdapter: Learning and Transferring Visual Relation with Diffusion Transformers Paper • 2506.02528 • Published 4 days ago • 15
Voyager: Long-Range and World-Consistent Video Diffusion for Explorable 3D Scene Generation Paper • 2506.04225 • Published 2 days ago • 21