Xiangyu's picture

6 8

Xiangyu

xixy

·

https://xixy.github.io/

AI & ML interests

None yet

Recent Activity

commented on a paper 4 days ago

Why Distillation can Outperform Zero-RL: The Role of Flexible Reasoning

authored a paper 11 days ago

Rethinking the Sampling Criteria in Reinforcement Learning for LLM Reasoning: A Competence-Difficulty Alignment Perspective

upvoted a paper 11 days ago

Rethinking the Sampling Criteria in Reinforcement Learning for LLM Reasoning: A Competence-Difficulty Alignment Perspective

View all activity

Organizations

None yet

xixy's activity

commented a paper 4 days ago

Why Distillation can Outperform Zero-RL: The Role of Flexible Reasoning

Paper • 2505.21067 • Published 11 days ago • 3 •

commented a paper 18 days ago

Model Merging in Pre-training of Large Language Models

Paper • 2505.12082 • Published 21 days ago • 35 •

commented a paper 7 months ago

IOPO: Empowering LLMs with Complex Instruction Following via Input-Output Preference Optimization

Paper • 2411.06208 • Published Nov 9, 2024 • 21 •

commented a paper 11 months ago

Scaling Synthetic Data Creation with 1,000,000,000 Personas

Paper • 2406.20094 • Published Jun 28, 2024 • 102 •

commented a paper about 1 year ago

Rho-1: Not All Tokens Are What You Need

Paper • 2404.07965 • Published Apr 11, 2024 • 94 •