Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
fal
Together AI
Cohere
Novita
Fireworks
SambaNova
Cerebras
Nebius AI Studio
Hyperbolic
Replicate
Nscale
HF Inference API
Misc
Reset Misc
multimodal
Inference Endpoints
text-generation-inference
custom_code
4-bit precision
Eval Results
Merge
8-bit precision
Mixture of Experts
Misc with no match
text-embeddings-inference
Carbon Emissions
Apply filters
Models
1,114
Full-text search
Edit filters
Sort: Trending
Active filters:
multimodal
Clear all
Mungert/Qwen2.5-VL-7B-Instruct-GGUF
Image-Text-to-Text
•
Updated
about 12 hours ago
•
24.6k
•
16
Mungert/Qwen2.5-VL-3B-Instruct-GGUF
Image-Text-to-Text
•
Updated
about 12 hours ago
•
6.66k
•
16
Mungert/Qwen2.5-VL-72B-Instruct-GGUF
Image-Text-to-Text
•
Updated
about 12 hours ago
•
14.4k
•
10
huihui-ai/Qwen2.5-VL-32B-Instruct-abliterated
Image-Text-to-Text
•
Updated
21 days ago
•
1.21k
•
7
osunlp/Dreamer-7B
Image-Text-to-Text
•
Updated
Apr 9
•
2.8k
•
4
mradermacher/Qwen2.5-VL-32B-Instruct-abliterated-GGUF
Updated
21 days ago
•
1.72k
•
1
remyxai/SpaceThinker-Qwen2.5VL-3B
Image-Text-to-Text
•
Updated
1 day ago
•
2.17k
•
14
lusxvr/nanoVLM-222M
Image-Text-to-Text
•
Updated
30 days ago
•
5.23k
•
83
lmstudio-community/Qwen2.5-VL-7B-Instruct-GGUF
Image-Text-to-Text
•
Updated
29 days ago
•
15.4k
•
1
unsloth/Qwen2.5-VL-3B-Instruct-GGUF
Image-Text-to-Text
•
Updated
26 days ago
•
7.31k
•
6
unsloth/Qwen2.5-VL-7B-Instruct-GGUF
Image-Text-to-Text
•
Updated
26 days ago
•
15.3k
•
7
lusxvr/nanoVLM
Image-Text-to-Text
•
Updated
about 5 hours ago
•
72
•
3
kelkalot/medgemma-4b-it-sft-lora-kvasir-vqa
Updated
14 days ago
•
2
One-RL-to-See-Them-All/Orsta-7B
Image-Text-to-Text
•
Updated
3 days ago
•
720
•
7
One-RL-to-See-Them-All/Orsta-32B-0326
Image-Text-to-Text
•
Updated
3 days ago
•
142
•
4
unsloth/Qwen2.5-Omni-7B
Any-to-Any
•
Updated
10 days ago
•
34
•
4
unsloth/Qwen2.5-Omni-3B
Any-to-Any
•
Updated
9 days ago
•
54
•
3
unsloth/Qwen2.5-Omni-3B-GGUF
Any-to-Any
•
Updated
9 days ago
•
5.04k
•
6
davidelobba/TEMU-VTOFF
Image-to-Image
•
Updated
8 days ago
•
1
mlx-community/Holo1-3B-4bit
Image-Text-to-Text
•
Updated
4 days ago
•
30
•
1
sizzlebop/Holo1-7B-Q8_0-GGUF
Image-Text-to-Text
•
Updated
3 days ago
•
42
•
1
ogulcanakca/blip-itu-turkish-captions-finetuned
Image-to-Text
•
Updated
3 days ago
•
19
•
1
mradermacher/Holo1-3B-i1-GGUF
Updated
3 days ago
•
458
•
1
sujitpal/clip-imageclef
Zero-Shot Image Classification
•
Updated
Oct 31, 2023
•
16
•
3
waybarrios/guidance-based-video-grounding
Updated
Apr 1, 2023
MonoHime/mosei-senti-intermodal
Feature Extraction
•
Updated
May 18, 2023
•
6
MonoHime/mosei-emo-intermodal
Feature Extraction
•
Updated
May 18, 2023
•
9
MonoHime/iemocap-emo-intermodal
Feature Extraction
•
Updated
May 18, 2023
•
9
MonoHime/mosi-senti-intermodal
Feature Extraction
•
Updated
May 18, 2023
•
5
MonoHime/meld-emo-intermodal
Feature Extraction
•
Updated
May 18, 2023
•
9
Previous
1
2
3
4
5
...
38
Next