Edit Models filters

Inference Providers

Nebius AI Studio

HF Inference API

Misc

4-bit precision

text-generation-inference

Inference Endpoints

4-bit precision

Misc with no match

8-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

15

Full-text search

Active filters: 4-bit precision

cmod/firefunction-v1-GGUF

Updated Feb 25, 2024

Esperanto/gemma-2b-it-kvc-AWQ-int4-onnx

Updated Dec 12, 2024 • 6

Esperanto/gemma-7b-it-kvc-AWQ-int4-onnx

Updated Dec 12, 2024 • 6

Esperanto/mistral-7b-Instruct-v0.2-kvc-AWQ-int4-onnx

Updated Dec 12, 2024 • 7 • 1

Esperanto/phi3-mini-4k-instruct-kvc-AWQ-int4-onnx

Updated Dec 12, 2024 • 6

amornpan/openthaigpt-MedChatModelv11

Text Generation • Updated Mar 10 • 39 • 2

Esperanto/llama-3.2-3B-Instruct-kvc-AWQ-int4-onnx

Text Generation • Updated Dec 12, 2024 • 9

Esperanto/llama-3.2-1B-Instruct-kvc-AWQ-int4-onnx

Text Generation • Updated Dec 12, 2024

Esperanto/llama3.1-8b-Instruct-kvc-AWQ-int4-onnx

Text Generation • Updated Dec 12, 2024 • 9

Esperanto/mistral-7b-kvc-AWQ-int4-onnx

Updated Dec 18, 2024 • 3

amornpan/openthaigpt1.5-14b-MedChatModelV1

Text Generation • Updated Nov 4, 2024 • 14 • 1

akkikiki/j-moshi-ext-mlx-q4

Updated Feb 12 • 39

ruslanmv/granite-3.1-2b-Reasoning-4bit

Text Generation • Updated Feb 11 • 12

amornpan/V3_qwen2.5-32b-med-thai-optimized

Text Generation • Updated Mar 14 • 1

korarishi/rishi-2-2b-it

Text Generation • Updated 28 days ago • 39 • 1