Xiangyu's picture

6 8

Xiangyu

xixy

·

https://xixy.github.io/

AI & ML interests

None yet

Recent Activity

commented on a paper 4 days ago

Why Distillation can Outperform Zero-RL: The Role of Flexible Reasoning

authored a paper 11 days ago

Rethinking the Sampling Criteria in Reinforcement Learning for LLM Reasoning: A Competence-Difficulty Alignment Perspective

upvoted a paper 11 days ago

Rethinking the Sampling Criteria in Reinforcement Learning for LLM Reasoning: A Competence-Difficulty Alignment Perspective

View all activity

Organizations

None yet

xixy's activity

upvoted a paper 11 days ago

Rethinking the Sampling Criteria in Reinforcement Learning for LLM Reasoning: A Competence-Difficulty Alignment Perspective

Paper • 2505.17652 • Published 15 days ago • 6

upvoted a paper 17 days ago

Not All Correct Answers Are Equal: Why Your Distillation Source Matters

Paper • 2505.14464 • Published 18 days ago • 8

upvoted a collection about 1 month ago

Qwen3

40 items • Updated 17 days ago • 739

upvoted a paper 3 months ago

SampleMix: A Sample-wise Pre-training Data Mixing Strategey by Coordinating Data Quality and Diversity

Paper • 2503.01506 • Published Mar 3 • 9

upvoted a paper 9 months ago

Physics of Language Models: Part 2.2, How to Learn From Mistakes on Grade-School Math Problems

Paper • 2408.16293 • Published Aug 29, 2024 • 27

upvoted 3 papers about 1 year ago

Chameleon: Mixed-Modal Early-Fusion Foundation Models

Paper • 2405.09818 • Published May 16, 2024 • 131

CodeShell Technical Report

Paper • 2403.15747 • Published Mar 23, 2024 • 1

Rho-1: Not All Tokens Are What You Need

Paper • 2404.07965 • Published Apr 11, 2024 • 94