Edit Models filters

Inference Providers

Nebius AI Studio

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

Mixture of Experts

Misc with no match

text-embeddings-inference

Carbon Emissions

Models

1,112

Full-text search

Active filters: multimodal

Hcompany/Holo1-7B

Image-Text-to-Text • Updated 2 days ago • 679 • 83

Hcompany/Holo1-3B

Image-Text-to-Text • Updated 2 days ago • 1.07k • 58

ByteDance/Dolphin

Image-Text-to-Text • Updated 11 days ago • 3.82k • 256

BAAI/Video-XL-2

Video-Text-to-Text • Updated about 16 hours ago • 316 • 22

Qwen/Qwen2.5-VL-7B-Instruct

Image-Text-to-Text • Updated Apr 6 • 2.52M • • 939

stockmark/Stockmark-2-VL-100B-beta

Image-Text-to-Text • Updated 4 days ago • 323 • 15

Qwen/Qwen2.5-VL-3B-Instruct

Image-Text-to-Text • Updated Apr 6 • 2.92M • 397

Qwen/Qwen2.5-Omni-7B

Any-to-Any • Updated Apr 30 • 236k • 1.64k

stepfun-ai/Step1X-Edit

Image-to-Image • Updated 15 days ago • 533 • 285

Qwen/Qwen2.5-VL-72B-Instruct

Image-Text-to-Text • Updated about 15 hours ago • 203k • • 474

Qwen/Qwen2.5-VL-32B-Instruct

Image-Text-to-Text • Updated Apr 14 • 506k • • 381

ByteDance-Seed/UI-TARS-1.5-7B

Image-Text-to-Text • Updated Apr 18 • 29.6k • 288

imageomics/bioclip-2

Zero-Shot Image Classification • Updated about 17 hours ago • 85 • 5

unsloth/Qwen2.5-Omni-7B-GGUF

Any-to-Any • Updated 9 days ago • 7.9k • 7

jinaai/jina-clip-v2

Feature Extraction • Updated Apr 28 • 41.6k • 241

Ertugrul/Qwen2.5-VL-7B-Captioner-Relaxed

Image-Text-to-Text • Updated Mar 22 • 26.7k • 20

Qwen/Qwen2.5-Omni-3B

Any-to-Any • Updated Apr 30 • 78.8k • 235

osunlp/UGround-V1-7B

Image-Text-to-Text • Updated Apr 16 • 2.03k • 16

NousResearch/Nous-Hermes-2-Vision-Alpha

Text Generation • Updated Dec 3, 2023 • 68 • 304

Qwen/Qwen2-VL-2B-Instruct

Image-Text-to-Text • Updated Jan 12 • 544k • 423

Qwen/Qwen2-VL-7B-Instruct

Image-Text-to-Text • Updated Feb 6 • 1.32M • • 1.19k

lmms-lab/LLaVA-Video-7B-Qwen2

Video-Text-to-Text • Updated Oct 25, 2024 • 207k • 96

Qwen/Qwen2-VL-72B-Instruct

Image-Text-to-Text • Updated Feb 6 • 27.6k • • 303

unsloth/Llama-3.2-11B-Vision-Instruct

Image-Text-to-Text • Updated Dec 10, 2024 • 28.6k • 81

nvidia/NVLM-D-72B

Image-Text-to-Text • Updated Jan 14 • 16k • 771

ByteDance-Seed/UI-TARS-72B-DPO

Image-Text-to-Text • Updated Jan 25 • 6.65k • 131

unsloth/Qwen2.5-VL-7B-Instruct-unsloth-bnb-4bit

Image-Text-to-Text • Updated 25 days ago • 100k • 33

Qwen/Qwen2.5-VL-72B-Instruct-AWQ

Image-Text-to-Text • Updated Mar 7 • 47.4k • 54

Qwen/Qwen2.5-VL-7B-Instruct-AWQ

Image-Text-to-Text • Updated Apr 6 • 184k • 71

Mungert/Qwen2.5-VL-32B-Instruct-GGUF

Image-Text-to-Text • Updated 22 minutes ago • 3.3k • 9