9 90 22

Jiaheng Liu

CheeryLJH

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

MiMo-VL Technical Report

upvoted a paper 3 days ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

upvoted a paper 4 days ago

HardTests: Synthesizing High-Quality Test Cases for LLM Coding

View all activity

Organizations

CheeryLJH's activity

upvoted a paper 1 day ago

MiMo-VL Technical Report

Paper • 2506.03569 • Published 3 days ago • 62

upvoted a paper 3 days ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published 4 days ago • 127

upvoted a paper 4 days ago

HardTests: Synthesizing High-Quality Test Cases for LLM Coding

Paper • 2505.24098 • Published 8 days ago • 41

upvoted 2 papers 8 days ago

SWE-rebench: An Automated Pipeline for Task Collection and Decontaminated Evaluation of Software Engineering Agents

Paper • 2505.20411 • Published 11 days ago • 84

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

Paper • 2505.22617 • Published 9 days ago • 116

upvoted a paper 10 days ago

Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers

Paper • 2505.21497 • Published 10 days ago • 91

upvoted 2 papers 11 days ago

QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning

Paper • 2505.17667 • Published 14 days ago • 85

One RL to See Them All: Visual Triple Unified Reinforcement Learning

Paper • 2505.18129 • Published 14 days ago • 59

upvoted 2 papers 15 days ago

QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-Design

Paper • 2505.16175 • Published 16 days ago • 39

Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning

Paper • 2505.15966 • Published 16 days ago • 51

upvoted a paper 16 days ago

MMaDA: Multimodal Large Diffusion Language Models

Paper • 2505.15809 • Published 16 days ago • 85

upvoted 2 papers 17 days ago

General-Reasoner: Advancing LLM Reasoning Across All Domains

Paper • 2505.14652 • Published 17 days ago • 22

Emerging Properties in Unified Multimodal Pretraining

Paper • 2505.14683 • Published 17 days ago • 129

upvoted a paper 19 days ago

Qwen3 Technical Report

Paper • 2505.09388 • Published 23 days ago • 182

liked a Space 23 days ago

SD3.5 M Flow GRPO

⚡

Generate images from text prompts

upvoted a paper 24 days ago

AttentionInfluence: Adopting Attention Head Influence for Weak-to-Strong Pretraining Data Selection

Paper • 2505.07293 • Published 25 days ago • 26

authored a paper 28 days ago

Flow-GRPO: Training Flow Matching Models via Online RL

Paper • 2505.05470 • Published 29 days ago • 78

upvoted a paper 28 days ago

Flow-GRPO: Training Flow Matching Models via Online RL

Paper • 2505.05470 • Published 29 days ago • 78

upvoted 2 papers about 1 month ago

Llama-Nemotron: Efficient Reasoning Models

Paper • 2505.00949 • Published May 2 • 35

A Survey of Interactive Generative Video

Paper • 2504.21853 • Published Apr 30 • 47