Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Edit Models filters

Inference Providers
SambaNova
Replicate
Cerebras
Nebius AI Studio
Fireworks
Together AI
Cohere
Nscale
fal
Novita
Hyperbolic
HF Inference API
Misc
Reinforcement Learning
Inference Endpoints
text-generation-inference
Eval Results

Misc with no match

Merge
4-bit precision
custom_code
8-bit precision
text-embeddings-inference
Carbon Emissions
Mixture of Experts

Models

10
Full-text search
Active filters: Reinforcement Learning

HUANG1993/GreedRL-VRP-pretrained-v1

Reinforcement Learning • Updated Apr 26, 2023 • 4

Hawk91/PongNoFrameskip-v4_DQN

Updated Aug 21, 2023 • 1

ledmands/ALE-Pacman-v5

Reinforcement Learning • Updated Jun 2, 2024 • 62 • 1

Daemontatox/Cogito-R1

Text Generation • Updated Feb 19 • 11 • 5

mradermacher/Cogito-R1-GGUF

Updated Feb 12 • 75

mradermacher/Cogito-R1-i1-GGUF

Updated Feb 13 • 425

mrlijun/SMR-R1

Updated Apr 2 • 3 • 2

omreab/SoccerTwos

Updated Apr 3 • 2

mradermacher/SMR-R1-GGUF

Updated Apr 12 • 56

mradermacher/SMR-R1-i1-GGUF

Updated Apr 12 • 56
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs