Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Cohere
Cerebras
Nscale
Nebius AI Studio
Together AI
SambaNova
Hyperbolic
Replicate
fal
Fireworks
Novita
HF Inference API
Misc
Reset Misc
Inference Endpoints
text-generation-inference
image-text-to-text
custom_code
4-bit precision
Merge
8-bit precision
Eval Results
Mixture of Experts
Carbon Emissions
Misc with no match
text-embeddings-inference
Apply filters
Models
11,076
Full-text search
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
microsoft/Florence-2-large
Image-Text-to-Text
•
Updated
Dec 8, 2024
•
738k
•
1.56k
openbmb/MiniCPM-V-2_6
Image-Text-to-Text
•
Updated
Jan 15
•
63.5k
•
976
microsoft/Phi-3.5-vision-instruct
Image-Text-to-Text
•
Updated
Sep 26, 2024
•
1.03M
•
683
microsoft/OmniParser
Image-Text-to-Text
•
Updated
Dec 2, 2024
•
643
•
1.67k
osunlp/UGround-V1-7B
Image-Text-to-Text
•
Updated
Apr 16
•
2.03k
•
16
5CD-AI/Vintern-1B-v3_5
Image-Text-to-Text
•
Updated
Feb 12
•
127k
•
68
allenai/olmOCR-7B-0225-preview
Image-Text-to-Text
•
Updated
Feb 25
•
314k
•
657
unsloth/gemma-3-12b-it-GGUF
Image-Text-to-Text
•
Updated
26 days ago
•
54.4k
•
78
mlabonne/gemma-3-27b-it-abliterated
Image-Text-to-Text
•
Updated
Mar 21
•
6.89k
•
•
152
soob3123/amoral-gemma3-12B-v2
Text Generation
•
Updated
Apr 19
•
488
•
33
meta-llama/Llama-4-Maverick-17B-128E
Image-Text-to-Text
•
Updated
Apr 9
•
1.41k
•
77
moonshotai/Kimi-VL-A3B-Thinking
Image-Text-to-Text
•
Updated
Apr 20
•
52.8k
•
410
OpenGVLab/InternVL3-78B
Image-Text-to-Text
•
Updated
9 days ago
•
1.19M
•
187
Skywork/SkyCaptioner-V1
Video-Text-to-Text
•
Updated
Apr 25
•
804
•
41
meta-llama/Llama-Guard-4-12B
Image-Text-to-Text
•
Updated
Apr 29
•
64.8k
•
41
prithivMLmods/docscopeOCR-7B-050425-exp-GGUF
Image-Text-to-Text
•
Updated
4 days ago
•
185
•
3
microsoft/git-base
Image-to-Text
•
Updated
Apr 24, 2023
•
369k
•
98
Salesforce/blip2-opt-2.7b
Image-Text-to-Text
•
Updated
Feb 3
•
915k
•
378
llava-hf/LLaVA-NeXT-Video-7B-hf
Video-Text-to-Text
•
Updated
Jan 27
•
159k
•
101
OpenGVLab/InternVL2-1B
Image-Text-to-Text
•
Updated
Mar 25
•
38.6k
•
73
5CD-AI/Vintern-1B-v2
Image-Text-to-Text
•
Updated
Jan 17
•
2.58k
•
72
Qwen/Qwen2-VL-2B-Instruct
Image-Text-to-Text
•
Updated
Jan 12
•
544k
•
423
Qwen/Qwen2-VL-7B-Instruct
Image-Text-to-Text
•
Updated
Feb 6
•
1.32M
•
•
1.19k
mistralai/Pixtral-12B-2409
Image-Text-to-Text
•
Updated
Dec 26, 2024
•
•
645
stepfun-ai/GOT-OCR2_0
Image-Text-to-Text
•
Updated
Feb 4
•
150k
•
1.48k
Qwen/Qwen2-VL-72B-Instruct
Image-Text-to-Text
•
Updated
Feb 6
•
27.6k
•
•
303
meta-llama/Llama-3.2-11B-Vision
Image-Text-to-Text
•
Updated
Sep 27, 2024
•
30.6k
•
519
meta-llama/Llama-3.2-90B-Vision-Instruct
Image-Text-to-Text
•
Updated
Mar 4
•
13.2k
•
•
342
unsloth/Llama-3.2-11B-Vision-Instruct
Image-Text-to-Text
•
Updated
Dec 10, 2024
•
28.6k
•
81
nvidia/NVLM-D-72B
Image-Text-to-Text
•
Updated
Jan 14
•
16k
•
771
Previous
1
2
3
4
5
...
100
Next