Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1528
219
441
Merve Noyan
PRO
merve
Follow
mahmut632734's profile picture
38saidul's profile picture
cahya's profile picture
7530 followers
·
332 following
https://github.com/merveenoyan/smol-vision
mervenoyann
merveenoyan
merve.bsky.social
AI & ML interests
VLMs, vision & co
Recent Activity
posted
an
update
1 day ago
Qwen2.5-Omni is soooo good that people build multimodal reasoning models off of it 🥹 > https://huggingface.co/KE-Team/Ke-Omni-R-3B is open-source audio reasoning model sota on average of benchmarks, based on https://huggingface.co/Qwen/Qwen2.5-Omni-3B 🗣️ > https://huggingface.co/Haoz0206/Omni-R1 is a video reasoning model with pixel level grounding (see below) and it's super competitive ⏯️ based on https://huggingface.co/Qwen/Qwen2.5-Omni-7B
upvoted
a
changelog
1 day ago
New Inference Providers Dashboard
liked
a dataset
2 days ago
lmms-lab/multimodal-open-r1-8k-verified
View all activity
Organizations
merve
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a dataset
2 days ago
lmms-lab/multimodal-open-r1-8k-verified
Viewer
•
Updated
Jan 27
•
7.69k
•
973
•
55
liked
2 models
2 days ago
PlayHT/PlayDiffusion
Updated
8 days ago
•
84
lerobot/smolvla_base
Robotics
•
Updated
2 days ago
•
1.57k
•
93
liked
a Space
2 days ago
Running
on
Zero
17
17
Holo1 Localization
📚
Web Localization powered by Holo1
liked
4 models
2 days ago
ResembleAI/chatterbox
Text-to-Speech
•
Updated
7 days ago
•
•
647
XiaomiMiMo/MiMo-7B-RL-0530
Text Generation
•
Updated
1 day ago
•
412
•
24
tencent/HunyuanVideo-Avatar
Image-to-Video
•
Updated
9 days ago
•
157
BAAI/Video-XL-2
Video-Text-to-Text
•
Updated
about 17 hours ago
•
316
•
22
liked
3 datasets
2 days ago
Rapidata/text-2-video-human-preferences-veo3
Viewer
•
Updated
10 days ago
•
1.02k
•
476
•
13
MiniMaxAI/SynLogic
Viewer
•
Updated
1 day ago
•
49.3k
•
906
•
78
yandex/yambda
Viewer
•
Updated
about 6 hours ago
•
5.31B
•
30k
•
137
liked
4 models
3 days ago
Hcompany/Holo1-7B
Image-Text-to-Text
•
Updated
2 days ago
•
679
•
83
Hcompany/Holo1-3B
Image-Text-to-Text
•
Updated
2 days ago
•
1.07k
•
58
Haoz0206/Omni-R1
Video-Text-to-Text
•
Updated
9 days ago
•
80
•
16
KE-Team/Ke-Omni-R-3B
Audio-Text-to-Text
•
Updated
11 days ago
•
8
•
11
liked
a model
4 days ago
vidore/colqwen2-v1.0-hf
Visual Document Retrieval
•
Updated
4 days ago
•
1.83k
•
18
liked
a model
5 days ago
Qwen/Qwen2.5-Omni-7B
Any-to-Any
•
Updated
Apr 30
•
236k
•
1.64k
liked
3 datasets
5 days ago
m-a-p/OmniBench
Viewer
•
Updated
Jan 31
•
1.14k
•
391
•
8
m-a-p/OmniInstruct_v1
Viewer
•
Updated
Mar 31
•
96.1k
•
259
•
4
PKU-Alignment/align-anything
Viewer
•
Updated
Apr 5
•
69.4k
•
2.56k
•
35
Load more