Merve Noyan's picture

Merve Noyan PRO

merve

·

https://github.com/merveenoyan/smol-vision

AI & ML interests

VLMs, vision & co

Recent Activity

posted an update 1 day ago

Qwen2.5-Omni is soooo good that people build multimodal reasoning models off of it 🥹 > https://huggingface.co/KE-Team/Ke-Omni-R-3B is open-source audio reasoning model sota on average of benchmarks, based on https://huggingface.co/Qwen/Qwen2.5-Omni-3B 🗣️ > https://huggingface.co/Haoz0206/Omni-R1 is a video reasoning model with pixel level grounding (see below) and it's super competitive ⏯️ based on https://huggingface.co/Qwen/Qwen2.5-Omni-7B

upvoted a changelog 1 day ago

New Inference Providers Dashboard

liked a dataset 2 days ago

lmms-lab/multimodal-open-r1-8k-verified

View all activity

Organizations

merve's activity

liked a dataset 2 days ago

lmms-lab/multimodal-open-r1-8k-verified

Viewer • Updated Jan 27 • 7.69k • 973 • 55

liked 2 models 2 days ago

PlayHT/PlayDiffusion

Updated 8 days ago • 84

lerobot/smolvla_base

Robotics • Updated 2 days ago • 1.57k • 93

liked a Space 2 days ago

Holo1 Localization

Web Localization powered by Holo1

liked 4 models 2 days ago

ResembleAI/chatterbox

Text-to-Speech • Updated 7 days ago • • 647

XiaomiMiMo/MiMo-7B-RL-0530

Text Generation • Updated 1 day ago • 412 • 24

tencent/HunyuanVideo-Avatar

Image-to-Video • Updated 9 days ago • 157

BAAI/Video-XL-2

Video-Text-to-Text • Updated about 17 hours ago • 316 • 22

liked 3 datasets 2 days ago

Rapidata/text-2-video-human-preferences-veo3

Viewer • Updated 10 days ago • 1.02k • 476 • 13

MiniMaxAI/SynLogic

Viewer • Updated 1 day ago • 49.3k • 906 • 78

yandex/yambda

Viewer • Updated about 6 hours ago • 5.31B • 30k • 137

liked 4 models 3 days ago

Hcompany/Holo1-7B

Image-Text-to-Text • Updated 2 days ago • 679 • 83

Hcompany/Holo1-3B

Image-Text-to-Text • Updated 2 days ago • 1.07k • 58

Haoz0206/Omni-R1

Video-Text-to-Text • Updated 9 days ago • 80 • 16

KE-Team/Ke-Omni-R-3B

Audio-Text-to-Text • Updated 11 days ago • 8 • 11

liked a model 4 days ago

vidore/colqwen2-v1.0-hf

Visual Document Retrieval • Updated 4 days ago • 1.83k • 18

liked a model 5 days ago

Qwen/Qwen2.5-Omni-7B

Any-to-Any • Updated Apr 30 • 236k • 1.64k

liked 3 datasets 5 days ago

m-a-p/OmniBench

Viewer • Updated Jan 31 • 1.14k • 391 • 8

m-a-p/OmniInstruct_v1

Viewer • Updated Mar 31 • 96.1k • 259 • 4

PKU-Alignment/align-anything

Viewer • Updated Apr 5 • 69.4k • 2.56k • 35