Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
3
6
kui
kuikui
Follow
0 followers
·
1 following
AI & ML interests
None yet
Recent Activity
upvoted
an
article
about 1 month ago
Putting RL back in RLHF
updated
a model
about 1 month ago
kuikui/Qwen2-0.5B-GRPO-test
published
a model
about 1 month ago
kuikui/Qwen2-0.5B-GRPO-test
View all activity
Organizations
None yet
models
3
Sort: Recently updated
kuikui/Qwen2-0.5B-GRPO-test
Updated
Jul 13
kuikui/qwen-2.5-3b-r1-countdown
Updated
Jul 12
kuikui/gpt2-wikitext2
Text Generation
•
Updated
Sep 28, 2023
•
5
datasets
0
None public yet