Qian Liu's picture

Qian Liu

SivilTaram

·

http://siviltaram.github.io/

AI & ML interests

Cooking cool things

Recent Activity

liked a dataset 1 day ago

Skywork/Skywork-OR1-RL-Data

upvoted a collection 3 days ago

upvoted a paper 4 days ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

View all activity

Organizations

SivilTaram's activity

liked a dataset 1 day ago

Skywork/Skywork-OR1-RL-Data

Viewer • Updated 9 days ago • 119k • 1.46k • 37

upvoted a collection 3 days ago

Qwen3

40 items • Updated 17 days ago • 738

upvoted 2 papers 4 days ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published 4 days ago • 127

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Paper • 2505.24864 • Published 7 days ago • 112

upvoted a paper 14 days ago

AceReason-Nemotron: Advancing Math and Code Reasoning through Reinforcement Learning

Paper • 2505.16400 • Published 15 days ago • 30

upvoted a paper 16 days ago

Learn to Reason Efficiently with Adaptive Length-based Reward Shaping

Paper • 2505.15612 • Published 16 days ago • 32

authored a paper 16 days ago

General-Reasoner: Advancing LLM Reasoning Across All Domains

Paper • 2505.14652 • Published 17 days ago • 22

upvoted a paper 17 days ago

General-Reasoner: Advancing LLM Reasoning Across All Domains

Paper • 2505.14652 • Published 17 days ago • 22

upvoted 3 papers 18 days ago

Group-in-Group Policy Optimization for LLM Agent Training

Paper • 2505.10978 • Published 21 days ago • 3

Qwen3 Technical Report

Paper • 2505.09388 • Published 23 days ago • 182

Parallel Scaling Law for Language Models

Paper • 2505.10475 • Published 22 days ago • 78

updated 3 models 30 days ago

SVRL/verl-scalable-0504_Qwen3-4B-Base_webinstruct-verified

Updated 23 days ago

SVRL/verl-scalable-0504_Qwen3-4B-Base_webinstruct-verified

Updated 23 days ago

SVRL/verl-scalable-0504_Qwen3-4B-Base_webinstruct-verified

Updated 23 days ago