Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
1
Libraries
Languages
Licenses
Other
Tasks
Reset Tasks
Text Generation
Any-to-Any
Image-Text-to-Text
Image-to-Text
Image-to-Image
Text-to-Image
Text-to-Video
Text-to-Speech
+ 42
Parameters
Reset Parameters
< 1B
6B
12B
32B
128B
> 500B
< 1B
> 500B
Libraries
PyTorch
google-tensorflow
TensorFlow
JAX
Transformers
Diffusers
Safetensors
ONNX
GGUF
Transformers.js
MLX
Keras
+ 41
Apps
vLLM
TGI
llama.cpp
MLX LM
LM Studio
Ollama
Jan
+ 13
Inference Providers
Cerebras
Nebius AI
Fireworks
Together AI
SambaNova
Novita
Groq
Nscale
+ 6
Apply filters
Models
6,455
Full-text search
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
openbmb/MiniCPM-V-4_5
Image-Text-to-Text
•
9B
•
Updated
2 days ago
•
55.2k
•
932
tencent/POINTS-Reader
Image-Text-to-Text
•
4B
•
Updated
2 days ago
•
827
•
60
baidu/ERNIE-4.5-VL-424B-A47B-PT
Image-Text-to-Text
•
424B
•
Updated
13 days ago
•
161k
•
90
rednote-hilab/dots.ocr
Image-Text-to-Text
•
3B
•
Updated
about 18 hours ago
•
285k
•
938
Qwen/Qwen2.5-VL-7B-Instruct
Image-Text-to-Text
•
8B
•
Updated
Apr 6
•
4.21M
•
•
1.23k
merve/smol-vision
Image-Text-to-Text
•
Updated
1 day ago
•
133
YannQi/R-4B
Image-Text-to-Text
•
5B
•
Updated
10 days ago
•
60.3k
•
157
baidu/ERNIE-4.5-VL-28B-A3B-PT
Image-Text-to-Text
•
29B
•
Updated
13 days ago
•
176k
•
•
77
microsoft/kosmos-2.5
Image-Text-to-Text
•
1B
•
Updated
17 days ago
•
8.19k
•
254
baidu/ERNIE-4.5-VL-28B-A3B-Paddle
Image-Text-to-Text
•
29B
•
Updated
25 days ago
•
18k
•
37
nanonets/Nanonets-OCR-s
Image-Text-to-Text
•
4B
•
Updated
Jun 20
•
287k
•
1.51k
google/gemma-3-27b-it
Image-Text-to-Text
•
27B
•
Updated
Mar 21
•
847k
•
•
1.61k
google/medgemma-4b-it
Image-Text-to-Text
•
5B
•
Updated
Jul 9
•
91.4k
•
658
google/gemma-3-4b-it
Image-Text-to-Text
•
4B
•
Updated
Mar 21
•
1.64M
•
840
google/gemma-3n-E4B-it
Image-Text-to-Text
•
8B
•
Updated
Jul 14
•
209k
•
761
zai-org/GLM-4.5V
Image-Text-to-Text
•
108B
•
Updated
27 days ago
•
42.7k
•
•
636
microsoft/Florence-2-large
Image-Text-to-Text
•
0.8B
•
Updated
Aug 4
•
606k
•
1.66k
ByteDance-Seed/UI-TARS-1.5-7B
Image-Text-to-Text
•
8B
•
Updated
Apr 18
•
93.9k
•
391
OpenGVLab/InternVL3_5-8B
Image-Text-to-Text
•
9B
•
Updated
15 days ago
•
15.9k
•
61
huihui-ai/Huihui-MiniCPM-V-4_5-abliterated
Image-Text-to-Text
•
9B
•
Updated
5 days ago
•
7.17k
•
20
nvidia/Cosmos-Reason1-7B
Image-Text-to-Text
•
8B
•
Updated
about 1 month ago
•
333k
•
169
unsloth/Qwen2.5-VL-7B-Instruct-GGUF
Image-Text-to-Text
•
8B
•
Updated
May 12
•
88.5k
•
63
onnx-community/FastVLM-0.5B-ONNX
Image-Text-to-Text
•
Updated
12 days ago
•
15.7k
•
71
vikhyatk/moondream2
Image-Text-to-Text
•
2B
•
Updated
Jul 7
•
198k
•
1.29k
google/gemma-3-12b-it
Image-Text-to-Text
•
12B
•
Updated
Mar 21
•
556k
•
•
520
google/gemma-3-12b-it-qat-q4_0-gguf
Image-Text-to-Text
•
12B
•
Updated
Apr 11
•
84.3k
•
185
LiquidAI/LFM2-VL-1.6B
Image-Text-to-Text
•
2B
•
Updated
Aug 13
•
12.2k
•
181
AIDC-AI/Ovis2.5-9B
Image-Text-to-Text
•
9B
•
Updated
22 days ago
•
25k
•
287
Qwen/Qwen2.5-VL-3B-Instruct
Image-Text-to-Text
•
4B
•
Updated
Apr 6
•
3.92M
•
507
google/gemma-3n-E2B-it
Image-Text-to-Text
•
5B
•
Updated
Jul 14
•
181k
•
204
Previous
1
2
3
...
100
Next