Model and data for 'Expanding RL with Verifiable Rewards Across Diverse Domains'
Yi Su
virtuoussy
AI & ML interests
None yet
Recent Activity
new activity
about 1 month ago
virtuoussy/Qwen2.5-7B-Instruct-RLVR:如何使用
new activity
about 1 month ago
virtuoussy/Qwen2.5-7B-Instruct-RLVR:Improve language tag
liked
a dataset
about 2 months ago
zwhe99/DeepMath-103K
Organizations
None yet