-
EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
Paper โข 2402.17485 โข Published โข 196 -
VividTalk: One-Shot Audio-Driven Talking Head Generation Based on 3D Hybrid Prior
Paper โข 2312.01841 โข Published โข 1 -
MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
Paper โข 2311.16498 โข Published โข 1 -
GaussianAvatar: Towards Realistic Human Avatar Modeling from a Single Video via Animatable 3D Gaussians
Paper โข 2312.02134 โข Published โข 2
Collections
Discover the best community collections!
Collections including paper arxiv:2404.10667
-
VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time
Paper โข 2404.10667 โข Published โข 20 -
CyberHost: Taming Audio-driven Avatar Diffusion Model with Region Codebook Attention
Paper โข 2409.01876 โข Published โข 2 -
DREAM-Talk: Diffusion-based Realistic Emotional Audio-driven Method for Single Image Talking Face Generation
Paper โข 2312.13578 โข Published โข 29 -
Gaussian Head Avatar: Ultra High-fidelity Head Avatar via Dynamic Gaussians
Paper โข 2312.03029 โข Published โข 26
-
sdasd112132/Vision-8B-MiniCPM-2_5-Uncensored-and-Detailed-4bit
Visual Question Answering โข 5B โข Updated โข 15 โข 32 -
102
Idefics3
๐Generate text based on an image and prompt
-
37
Vilt Vqa
๐Ask questions about images and get answers
-
vikhyatk/moondream2
Image-Text-to-Text โข 2B โข Updated โข 141k โข 1.25k
-
Rho-1: Not All Tokens Are What You Need
Paper โข 2404.07965 โข Published โข 94 -
VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time
Paper โข 2404.10667 โข Published โข 20 -
Instruction-tuned Language Models are Better Knowledge Learners
Paper โข 2402.12847 โข Published โข 27 -
DoRA: Weight-Decomposed Low-Rank Adaptation
Paper โข 2402.09353 โข Published โข 28
-
EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
Paper โข 2402.17485 โข Published โข 196 -
VividTalk: One-Shot Audio-Driven Talking Head Generation Based on 3D Hybrid Prior
Paper โข 2312.01841 โข Published โข 1 -
MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
Paper โข 2311.16498 โข Published โข 1 -
GaussianAvatar: Towards Realistic Human Avatar Modeling from a Single Video via Animatable 3D Gaussians
Paper โข 2312.02134 โข Published โข 2
-
VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time
Paper โข 2404.10667 โข Published โข 20 -
CyberHost: Taming Audio-driven Avatar Diffusion Model with Region Codebook Attention
Paper โข 2409.01876 โข Published โข 2 -
DREAM-Talk: Diffusion-based Realistic Emotional Audio-driven Method for Single Image Talking Face Generation
Paper โข 2312.13578 โข Published โข 29 -
Gaussian Head Avatar: Ultra High-fidelity Head Avatar via Dynamic Gaussians
Paper โข 2312.03029 โข Published โข 26
-
sdasd112132/Vision-8B-MiniCPM-2_5-Uncensored-and-Detailed-4bit
Visual Question Answering โข 5B โข Updated โข 15 โข 32 -
102
Idefics3
๐Generate text based on an image and prompt
-
37
Vilt Vqa
๐Ask questions about images and get answers
-
vikhyatk/moondream2
Image-Text-to-Text โข 2B โข Updated โข 141k โข 1.25k
-
Rho-1: Not All Tokens Are What You Need
Paper โข 2404.07965 โข Published โข 94 -
VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time
Paper โข 2404.10667 โข Published โข 20 -
Instruction-tuned Language Models are Better Knowledge Learners
Paper โข 2402.12847 โข Published โข 27 -
DoRA: Weight-Decomposed Low-Rank Adaptation
Paper โข 2402.09353 โข Published โข 28