Interesting Papers - a pencaharlangit Collection

pencaharlangit 's Collections

Interesting Papers

Interesting Papers

updated 1 day ago

ReZero: Enhancing LLM search ability by trying one-more-time

Paper • 2504.11001 • Published Apr 15 • 14
FonTS: Text Rendering with Typography and Style Controls

Paper • 2412.00136 • Published Nov 28, 2024
GenEx: Generating an Explorable World

Paper • 2412.09624 • Published Dec 12, 2024 • 97
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 150
An Empirical Study of GPT-4o Image Generation Capabilities

Paper • 2504.05979 • Published Apr 8 • 63
OmniSVG: A Unified Scalable Vector Graphics Generation Model

Paper • 2504.06263 • Published Apr 8 • 168
DreamO: A Unified Framework for Image Customization

Paper • 2504.16915 • Published Apr 23 • 25
DreamID: High-Fidelity and Fast diffusion-based Face Swapping via Triplet ID Group Learning

Paper • 2504.14509 • Published Apr 20 • 51
Tina: Tiny Reasoning Models via LoRA

Paper • 2504.15777 • Published Apr 22 • 55
A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment

Paper • 2504.15585 • Published Apr 22 • 13
Personalized Text-to-Image Generation with Auto-Regressive Models

Paper • 2504.13162 • Published Apr 17 • 19
Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning

Paper • 2504.17192 • Published Apr 24 • 110
MUSAR: Exploring Multi-Subject Customization from Single-Subject Dataset via Attention Routing

Paper • 2505.02823 • Published May 5 • 5
Style Customization of Text-to-Vector Generation with Image Diffusion Priors

Paper • 2505.10558 • Published 22 days ago • 15
InstanceGen: Image Generation with Instance-level Instructions

Paper • 2505.05678 • Published 29 days ago • 7
Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image Analysis

Paper • 2505.09358 • Published 23 days ago • 24
SageAttention2++: A More Efficient Implementation of SageAttention2

Paper • 2505.21136 • Published 10 days ago • 43
OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data

Paper • 2505.18445 • Published 14 days ago • 63
ComfyMind: Toward General-Purpose Generation via Tree-Based Planning and Reactive Feedback

Paper • 2505.17908 • Published 14 days ago • 3
Inverse Virtual Try-On: Generating Multi-Category Product-Style Images from Clothed Individuals

Paper • 2505.21062 • Published 10 days ago • 3
ARM: Adaptive Reasoning Model

Paper • 2505.20258 • Published 11 days ago • 43
Jodi: Unification of Visual Generation and Understanding via Joint Modeling

Paper • 2505.19084 • Published 12 days ago • 20
D-AR: Diffusion via Autoregressive Models

Paper • 2505.23660 • Published 8 days ago • 34
LoRAShop: Training-Free Multi-Concept Image Generation and Editing with Rectified Flow Transformers

Paper • 2505.23758 • Published 8 days ago • 23
Chain-of-Zoom: Extreme Super-Resolution via Scale Autoregression and Preference Alignment

Paper • 2505.18600 • Published 14 days ago • 45
UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation

Paper • 2506.03147 • Published 3 days ago • 55
RelationAdapter: Learning and Transferring Visual Relation with Diffusion Transformers

Paper • 2506.02528 • Published 4 days ago • 15
Native-Resolution Image Synthesis

Paper • 2506.03131 • Published 3 days ago • 17
Voyager: Long-Range and World-Consistent Video Diffusion for Explorable 3D Scene Generation

Paper • 2506.04225 • Published 2 days ago • 21
Image Editing As Programs with Diffusion Models

Paper • 2506.04158 • Published 2 days ago • 16