Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Hyperbolic
fal
Together AI
Nebius AI Studio
Cohere
Cerebras
Fireworks
Novita
SambaNova
Nscale
Replicate
HF Inference API
Misc
Reset Misc
VLM
Inference Endpoints
custom_code
text-generation-inference
Eval Results
Misc with no match
Merge
4-bit precision
8-bit precision
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
87
Full-text search
Edit filters
Sort: Trending
Active filters:
VLM
Clear all
nvidia/Llama-3.1-Nemotron-Nano-VL-8B-V1
Image-Text-to-Text
•
Updated
1 day ago
•
1.82k
•
75
prithivMLmods/Qwen2-VL-OCR-2B-Instruct
Image-Text-to-Text
•
Updated
May 2
•
74.7k
•
80
Efficient-Large-Model/NVILA-Lite-8B-stage2
Text Generation
•
Updated
Jan 6
•
48
•
1
nvidia/Eagle2-2B
Image-Text-to-Text
•
Updated
Apr 27
•
3.71k
•
28
mradermacher/Qwen2-VL-OCR-2B-Instruct-GGUF
Updated
6 days ago
•
501
•
2
prithivMLmods/Callisto-OCR3-2B-Instruct
Image-Text-to-Text
•
Updated
May 2
•
439
•
5
mradermacher/ImageQuality-R1-v1-i1-GGUF
Updated
27 days ago
•
11.5k
•
1
lusxvr/nanoVLM-222M
Image-Text-to-Text
•
Updated
30 days ago
•
5.23k
•
83
nvidia/VILA-HD-8B-PS3-1.5K-SigLIP
Image-Text-to-Text
•
Updated
2 days ago
•
31
•
1
nvidia/VILA-HD-8B-PS3-4K-SigLIP
Image-Text-to-Text
•
Updated
2 days ago
•
27
•
1
One-RL-to-See-Them-All/Orsta-7B
Image-Text-to-Text
•
Updated
3 days ago
•
720
•
7
One-RL-to-See-Them-All/Orsta-32B-0326
Image-Text-to-Text
•
Updated
3 days ago
•
142
•
4
Efficient-Large-Model/VILA-13b
Text Generation
•
Updated
Mar 4, 2024
•
57
•
20
Efficient-Large-Model/VILA-7b
Text Generation
•
Updated
Mar 4, 2024
•
162
•
26
Efficient-Large-Model/VILA-7b-4bit-awq
Text Generation
•
Updated
Mar 4, 2024
•
28
•
2
Efficient-Large-Model/VILA-13b-4bit-awq
Text Generation
•
Updated
Mar 4, 2024
•
22
•
2
Efficient-Large-Model/VILA-2.7b
Text Generation
•
Updated
Mar 4, 2024
•
108
•
15
TIGER-Lab/Mantis-bakllava-7b
Image-Text-to-Text
•
Updated
May 18, 2024
•
19
•
5
TIGER-Lab/Mantis-llava-7b
Image-Text-to-Text
•
Updated
May 18, 2024
•
109
•
15
Efficient-Large-Model/VILA1.5-3b
Text Generation
•
Updated
Jul 18, 2024
•
15k
•
27
Efficient-Large-Model/VILA1.5-13b
Text Generation
•
Updated
Jul 18, 2024
•
4.46k
•
3
Efficient-Large-Model/Llama-3-VILA1.5-8B
Text Generation
•
Updated
Aug 16, 2024
•
2.15k
•
32
Efficient-Large-Model/VILA1.5-40b
Text Generation
•
Updated
Jul 18, 2024
•
434
•
17
Efficient-Large-Model/VILA1.5-3b-s2
Text Generation
•
Updated
Jul 18, 2024
•
66
•
1
Efficient-Large-Model/VILA1.5-3b-AWQ
Text Generation
•
Updated
Jul 18, 2024
•
42
•
5
Efficient-Large-Model/VILA1.5-3b-s2-AWQ
Text Generation
•
Updated
Jul 18, 2024
•
14
•
1
Efficient-Large-Model/Llama-3-VILA1.5-8b-AWQ
Text Generation
•
Updated
Jul 18, 2024
•
21
•
7
Efficient-Large-Model/VILA1.5-13b-AWQ
Text Generation
•
Updated
Jul 18, 2024
•
45
•
3
Efficient-Large-Model/VILA1.5-40b-AWQ
Text Generation
•
Updated
Jul 18, 2024
•
41
•
3
RussRobin/SpatialBot-3B-LoRA
Visual Question Answering
•
Updated
Sep 5, 2024
•
2
•
3
Previous
1
2
3
Next