RefEdit: A Benchmark and Method for Improving Instruction-based Image Editing Model on Referring Expressions Paper • 2506.03448 • Published 3 days ago • 4
RelationAdapter: Learning and Transferring Visual Relation with Diffusion Transformers Paper • 2506.02528 • Published 4 days ago • 15
DCM: Dual-Expert Consistency Model for Efficient and High-Quality Video Generation Paper • 2506.03123 • Published 3 days ago • 14
FlowMo: Variance-Based Flow Guidance for Coherent Motion in Video Generation Paper • 2506.01144 • Published 5 days ago • 14
Running on Zero 8 8 Solving Inverse Problems with FLAIR 🎨 Restore and enhance images using text prompts
Sparse-vDiT: Unleashing the Power of Sparse Attention to Accelerate Video Diffusion Transformers Paper • 2506.03065 • Published 3 days ago • 27
RenderFormer: Transformer-based Neural Rendering of Triangle Meshes with Global Illumination Paper • 2505.21925 • Published 10 days ago • 34
Chain-of-Zoom: Extreme Super-Resolution via Scale Autoregression and Preference Alignment Paper • 2505.18600 • Published 13 days ago • 45