Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
SambaNova
Replicate
Cerebras
Nebius AI Studio
Fireworks
Together AI
Cohere
Nscale
fal
Novita
Hyperbolic
HF Inference API
Misc
Reset Misc
Reinforcement Learning
Inference Endpoints
text-generation-inference
Eval Results
Misc with no match
Merge
4-bit precision
custom_code
8-bit precision
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
10
Full-text search
Edit filters
Sort: Trending
Active filters:
Reinforcement Learning
Clear all
HUANG1993/GreedRL-VRP-pretrained-v1
Reinforcement Learning
•
Updated
Apr 26, 2023
•
4
Hawk91/PongNoFrameskip-v4_DQN
Updated
Aug 21, 2023
•
1
ledmands/ALE-Pacman-v5
Reinforcement Learning
•
Updated
Jun 2, 2024
•
62
•
1
Daemontatox/Cogito-R1
Text Generation
•
Updated
Feb 19
•
11
•
5
mradermacher/Cogito-R1-GGUF
Updated
Feb 12
•
75
mradermacher/Cogito-R1-i1-GGUF
Updated
Feb 13
•
425
mrlijun/SMR-R1
Updated
Apr 2
•
3
•
2
omreab/SoccerTwos
Updated
Apr 3
•
2
mradermacher/SMR-R1-GGUF
Updated
Apr 12
•
56
mradermacher/SMR-R1-i1-GGUF
Updated
Apr 12
•
56