Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
fal
Nebius AI Studio
Fireworks
Nscale
Replicate
Hyperbolic
Cerebras
SambaNova
Novita
Cohere
Together AI
HF Inference API
Misc
Reset Misc
4-bit precision
text-generation-inference
Inference Endpoints
4-bit precision
Misc with no match
Eval Results
Merge
custom_code
8-bit precision
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
15
Full-text search
Edit filters
Sort: Trending
Active filters:
4-bit precision
Clear all
cmod/firefunction-v1-GGUF
Updated
Feb 25, 2024
Esperanto/gemma-2b-it-kvc-AWQ-int4-onnx
Updated
Dec 12, 2024
•
6
Esperanto/gemma-7b-it-kvc-AWQ-int4-onnx
Updated
Dec 12, 2024
•
6
Esperanto/mistral-7b-Instruct-v0.2-kvc-AWQ-int4-onnx
Updated
Dec 12, 2024
•
7
•
1
Esperanto/phi3-mini-4k-instruct-kvc-AWQ-int4-onnx
Updated
Dec 12, 2024
•
6
amornpan/openthaigpt-MedChatModelv11
Text Generation
•
Updated
Mar 10
•
39
•
2
Esperanto/llama-3.2-3B-Instruct-kvc-AWQ-int4-onnx
Text Generation
•
Updated
Dec 12, 2024
•
9
Esperanto/llama-3.2-1B-Instruct-kvc-AWQ-int4-onnx
Text Generation
•
Updated
Dec 12, 2024
Esperanto/llama3.1-8b-Instruct-kvc-AWQ-int4-onnx
Text Generation
•
Updated
Dec 12, 2024
•
9
Esperanto/mistral-7b-kvc-AWQ-int4-onnx
Updated
Dec 18, 2024
•
3
amornpan/openthaigpt1.5-14b-MedChatModelV1
Text Generation
•
Updated
Nov 4, 2024
•
14
•
1
akkikiki/j-moshi-ext-mlx-q4
Updated
Feb 12
•
39
ruslanmv/granite-3.1-2b-Reasoning-4bit
Text Generation
•
Updated
Feb 11
•
12
amornpan/V3_qwen2.5-32b-med-thai-optimized
Text Generation
•
Updated
Mar 14
•
1
korarishi/rishi-2-2b-it
Text Generation
•
Updated
28 days ago
•
39
•
1