Dmitry Ryumin's picture

Dmitry Ryumin

DmitryRyumin

·

https://dmitryryumin.github.io

DmitryRyumin

AI & ML interests

Machine Learning and Applications, Multi-Modal Understanding

Recent Activity

upvoted a collection about 21 hours ago

liked a Space 4 days ago

NihalGazi/Text-To-Speech-Unlimited

liked a Space 4 days ago

OpenSound/SoloSpeech

View all activity

Organizations

DmitryRyumin's activity

upvoted a collection about 21 hours ago

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 11 items • Updated Apr 28 • 484

upvoted a collection 16 days ago

Falcon-H1

Falcon-H1 Family of Hybrid-Head Language Models, including 0.5B, 1.5B, 1.5B-Deep, 3B, 7B, and 34B (pretrained and instruction-tuned). • 37 items • Updated 16 days ago • 38

upvoted an article 24 days ago

Article

Vision Language Models (Better, Faster, Stronger)

By

and 4 others •

26 days ago

• 417

upvoted 4 collections about 1 month ago

Qwen3

40 items • Updated 16 days ago • 738

MAI-DS-R1

MAI-DS-R1 is a DeepSeek-R1 reasoning model that has been post-trained by the Microsoft AI team. • 2 items • Updated May 1 • 11

Kimi-Audio-7B

Kimi audio 7B models • 3 items • Updated Apr 28 • 8

Gemma 3 Release

24 items • Updated 7 days ago • 379

upvoted an article about 1 month ago

Article

Welcome the Falcon 3 Family of Open Models!

By

•

Dec 17, 2024

• 128

upvoted 3 papers about 2 months ago

HunyuanPortrait: Implicit Condition Control for Enhanced Portrait Animation

Paper • 2503.18860 • Published Mar 24 • 6

Reconstructing Humans with a Biomechanically Accurate Skeleton

Paper • 2503.21751 • Published Mar 27 • 9

EmoAgent: Assessing and Safeguarding Human-AI Interaction for Mental Health Safety

Paper • 2504.09689 • Published Apr 13 • 7

upvoted a collection about 2 months ago

heartsync-mbti

17 items • Updated about 1 month ago • 59

upvoted 3 papers about 2 months ago

FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis

Paper • 2504.04842 • Published Apr 7 • 36

DDT: Decoupled Diffusion Transformer

Paper • 2504.05741 • Published Apr 8 • 75

TransMamba: Flexibly Switching between Transformer and Mamba

Paper • 2503.24067 • Published Mar 31 • 21

upvoted an article 2 months ago

Article

Welcome Llama 4 Maverick & Scout on Hugging Face!

By

and 6 others •

Apr 5

• 144

upvoted a paper 2 months ago

Multi-Token Attention

Paper • 2504.00927 • Published Apr 1 • 52

upvoted a collection 2 months ago

MambaVision

MambaVision: A Hybrid Mamba-Transformer Vision Backbone. Includes both 1K and 21K pretrained models. • 13 items • Updated 38 minutes ago • 31

upvoted 2 papers 2 months ago

TaoAvatar: Real-Time Lifelike Full-Body Talking Avatars for Augmented Reality via 3D Gaussian Splatting

Paper • 2503.17032 • Published Mar 21 • 26

MAPS: A Multi-Agent Framework Based on Big Seven Personality and Socratic Guidance for Multimodal Scientific Problem Solving

Paper • 2503.16905 • Published Mar 21 • 54