Edit Models filters

Inference Providers

HF Inference API

Misc

arxiv: 2501.15383

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

37

Full-text search

Active filters: 2501.15383

Qwen/Qwen3-235B-A22B-Instruct-2507

Text Generation • 235B • Updated 3 days ago • 71.5k • • 644

Qwen/Qwen3-235B-A22B-Thinking-2507

Text Generation • 235B • Updated 3 days ago • 26.7k • • 324

Qwen/Qwen3-30B-A3B-Instruct-2507

Text Generation • 31B • Updated 3 days ago • 446k • 494

Qwen/Qwen3-30B-A3B-Thinking-2507

Text Generation • 31B • Updated 3 days ago • 107k • 236

Qwen/Qwen2.5-14B-Instruct-1M

Text Generation • 15B • Updated Jan 29 • 11.9k • • 317

Mungert/Qwen3-30B-A3B-Instruct-2507-GGUF

Text Generation • 31B • Updated 4 days ago • 15.8k • 1

Qwen/Qwen2.5-7B-Instruct-1M

Text Generation • 8B • Updated Jan 29 • 1.42M • • 346

async0x42/Qwen2.5-7B-Instruct-1M-exl2_4.65bpw

Text Generation • Updated Jan 29 • 1

async0x42/Qwen2.5-14B-Instruct-1M-exl2_4.65bpw

Text Generation • Updated Jan 29 • 1

ReadyArt/Qwen2.5-7B-Instruct-1M_EXL2_4.0bpw_H8

Text Generation • Updated Jan 29 • 7

ReadyArt/Qwen2.5-7B-Instruct-1M_EXL2_4.65bpw_H8

Text Generation • Updated Jan 29 • 7

ReadyArt/Qwen2.5-7B-Instruct-1M_EXL2_5.0bpw_H8

Text Generation • Updated Jan 29 • 7

ReadyArt/Qwen2.5-7B-Instruct-1M_EXL2_6.0bpw_H8

Text Generation • Updated Jan 29 • 7

ReadyArt/Qwen2.5-7B-Instruct-1M_EXL2_8.0bpw_H8

Text Generation • Updated Jan 29 • 8

ZeroXClem/Qwen2.5-7B-CelestialHarmony-1M

Text Generation • 8B • Updated Feb 8 • 5 • 7

remymenard/Qwen2.5-7B-Instruct-1M-ct2-int8

Text Generation • Updated Feb 3 • 1

QuantFactory/Qwen2.5-14B-Instruct-1M-GGUF

Text Generation • 15B • Updated Feb 8 • 59 • 3

QuantFactory/Qwen2.5-7B-Instruct-1M-GGUF

Text Generation • 8B • Updated Feb 9 • 108 • 3

professorf/Qwen2.5-7B-Instruct-1M-gguf

Text Generation • 8B • Updated Feb 17 • 8

AightBits/Qwen2.5-14B-Instruct-1M-8.0bpw-h8-exl2

Text Generation • Updated Feb 19 • 3

AightBits/Qwen2.5-7B-Instruct-1M-8.0bpw-h8-exl2

Text Generation • Updated Feb 19 • 1

Mungert/Qwen2.5-14B-Instruct-1M-GGUF

Text Generation • 15B • Updated 10 days ago • 1.36k • 5

Mungert/Qwen2.5-7B-Instruct-1M-GGUF

Text Generation • 8B • Updated 10 days ago • 981 • 6

duyntnet/Qwen2.5-14B-Instruct-1M-imatrix-GGUF

Text Generation • 15B • Updated Mar 25 • 2

RichardErkhov/Qwen_-_Qwen2.5-7B-Instruct-1M-4bits

4B • Updated Mar 27 • 2

RichardErkhov/Qwen_-_Qwen2.5-7B-Instruct-1M-8bits

8B • Updated Mar 27 • 2

Mozilla/Qwen2.5-7B-Instruct-1M-llamafile

Text Generation • Updated Apr 30 • 30

Mozilla/Qwen2.5-14B-Instruct-1M-llamafile

Text Generation • Updated Apr 30 • 26

limcheekin/Qwen2.5-7B-Instruct-1M-rk3588-1.1.4

Text Generation • Updated Apr 17 • 2

limcheekin/Qwen2.5-14B-Instruct-1M-rk3588-1.1.4

Text Generation • Updated Apr 18 • 2