Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Xiangyu's picture
6 8

Xiangyu

xixy
·
https://xixy.github.io/

AI & ML interests

None yet

Recent Activity

commented on a paper 4 days ago
Why Distillation can Outperform Zero-RL: The Role of Flexible Reasoning
authored a paper 11 days ago
Rethinking the Sampling Criteria in Reinforcement Learning for LLM Reasoning: A Competence-Difficulty Alignment Perspective
upvoted a paper 11 days ago
Rethinking the Sampling Criteria in Reinforcement Learning for LLM Reasoning: A Competence-Difficulty Alignment Perspective
View all activity

Organizations

None yet

xixy's activity

commented a paper 4 days ago

Why Distillation can Outperform Zero-RL: The Role of Flexible Reasoning

Paper • 2505.21067 • Published 11 days ago • 3 •
1
commented a paper 18 days ago

Model Merging in Pre-training of Large Language Models

Paper • 2505.12082 • Published 21 days ago • 35 •
5
commented a paper 7 months ago

IOPO: Empowering LLMs with Complex Instruction Following via Input-Output Preference Optimization

Paper • 2411.06208 • Published Nov 9, 2024 • 21 •
8
commented a paper 11 months ago

Scaling Synthetic Data Creation with 1,000,000,000 Personas

Paper • 2406.20094 • Published Jun 28, 2024 • 102 •
6
commented a paper about 1 year ago

Rho-1: Not All Tokens Are What You Need

Paper • 2404.07965 • Published Apr 11, 2024 • 94 •
16
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs