Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
TheJoeZenOne
/
qwen-3b-reasoning
like
0
PyTorch
GGUF
qwen2
unsloth
trl
grpo
conversational
License:
apache-2.0
Model card
Files
Files and versions
Community
Deploy
Use this model
main
qwen-3b-reasoning
/
README.md
TheJoeZenOne
Trained with Unsloth
b2103b4
verified
5 months ago
preview
code
|
raw
Copy download link
history
blame
contribute
delete
Safe
99 Bytes
metadata
license:
apache-2.0
base_model:
-
Qwen/Qwen2.5-VL-3B-Instruct
tags:
-
unsloth
-
trl
-
grpo