Edit Models filters

Inference Providers

Nebius AI Studio

HF Inference API

Misc

Reinforcement Learning

Inference Endpoints

text-generation-inference

Misc with no match

4-bit precision

8-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

10

Full-text search

Active filters: Reinforcement Learning

HUANG1993/GreedRL-VRP-pretrained-v1

Reinforcement Learning • Updated Apr 26, 2023 • 4

Hawk91/PongNoFrameskip-v4_DQN

Updated Aug 21, 2023 • 1

ledmands/ALE-Pacman-v5

Reinforcement Learning • Updated Jun 2, 2024 • 62 • 1

Daemontatox/Cogito-R1

Text Generation • Updated Feb 19 • 11 • 5

mradermacher/Cogito-R1-GGUF

Updated Feb 12 • 75

mradermacher/Cogito-R1-i1-GGUF

Updated Feb 13 • 425

mrlijun/SMR-R1

Updated Apr 2 • 3 • 2

omreab/SoccerTwos

Updated Apr 3 • 2

mradermacher/SMR-R1-GGUF

Updated Apr 12 • 56

mradermacher/SMR-R1-i1-GGUF

Updated Apr 12 • 56