Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
3
8
Xuandong Zhao
Xuandong
Follow
Six-dollar-Cola's profile picture
1 follower
·
4 following
https://xuandongzhao.github.io/
xuandongzhao
XuandongZhao
xuandong-zhao-a3270610b
AI & ML interests
None yet
Recent Activity
updated
a model
4 days ago
sunblaze-ucb/Qwen2.5-3B-GRPO-MATH-1EPOCH
updated
a model
4 days ago
sunblaze-ucb/Qwen2.5-1.5B-GRPO-MATH-1EPOCH
authored
a paper
11 days ago
Learning to Reason without External Rewards
View all activity
Organizations
Papers
7
arxiv:
2505.19590
arxiv:
2504.04715
arxiv:
2410.06172
arxiv:
2401.17256
Expand 7 papers
spaces
1
Running
1
Unigram-Watermark
👀
models
4
Sort:Â Recently updated
Xuandong/Qwen2.5-SPO-7B-3e6
Updated
23 days ago
Xuandong/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Updated
Mar 7
Xuandong/HPD-TinyBERT-F128
Feature Extraction
•
Updated
May 10, 2022
•
39
•
1
Xuandong/HPD-MiniLM-F128
Feature Extraction
•
Updated
May 10, 2022
•
9
datasets
0
None public yet