Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 ⢠11 items ⢠Updated Apr 28 ⢠484
Falcon-H1 Collection Falcon-H1 Family of Hybrid-Head Language Models, including 0.5B, 1.5B, 1.5B-Deep, 3B, 7B, and 34B (pretrained and instruction-tuned). ⢠37 items ⢠Updated 16 days ago ⢠38
view article Article Vision Language Models (Better, Faster, Stronger) By merve and 4 others ⢠26 days ago ⢠417
MAI-DS-R1 Collection MAI-DS-R1 is a DeepSeek-R1 reasoning model that has been post-trained by the Microsoft AI team. ⢠2 items ⢠Updated May 1 ⢠11
view article Article Welcome the Falcon 3 Family of Open Models! By ariG23498 ⢠Dec 17, 2024 ⢠128
HunyuanPortrait: Implicit Condition Control for Enhanced Portrait Animation Paper ⢠2503.18860 ⢠Published Mar 24 ⢠6
Reconstructing Humans with a Biomechanically Accurate Skeleton Paper ⢠2503.21751 ⢠Published Mar 27 ⢠9
EmoAgent: Assessing and Safeguarding Human-AI Interaction for Mental Health Safety Paper ⢠2504.09689 ⢠Published Apr 13 ⢠7
FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis Paper ⢠2504.04842 ⢠Published Apr 7 ⢠36
TransMamba: Flexibly Switching between Transformer and Mamba Paper ⢠2503.24067 ⢠Published Mar 31 ⢠21
view article Article Welcome Llama 4 Maverick & Scout on Hugging Face! By burtenshaw and 6 others ⢠Apr 5 ⢠144
MambaVision Collection MambaVision: A Hybrid Mamba-Transformer Vision Backbone. Includes both 1K and 21K pretrained models. ⢠13 items ⢠Updated 38 minutes ago ⢠31
TaoAvatar: Real-Time Lifelike Full-Body Talking Avatars for Augmented Reality via 3D Gaussian Splatting Paper ⢠2503.17032 ⢠Published Mar 21 ⢠26
MAPS: A Multi-Agent Framework Based on Big Seven Personality and Socratic Guidance for Multimodal Scientific Problem Solving Paper ⢠2503.16905 ⢠Published Mar 21 ⢠54