1 57 7

Shijie Geng

makitanikaze

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

Part I: Tricks or Traps? A Deep Dive into RL for LLM Reasoning

upvoted a paper 5 days ago

Beyond Ten Turns: Unlocking Long-Horizon Agentic Search with Large-Scale Asynchronous RL

upvoted a paper 5 days ago

We-Math 2.0: A Versatile MathBook System for Incentivizing Visual Mathematical Reasoning

View all activity

Organizations

None yet

upvoted 4 papers 5 days ago

Part I: Tricks or Traps? A Deep Dive into RL for LLM Reasoning

Paper • 2508.08221 • Published 9 days ago • 39

Beyond Ten Turns: Unlocking Long-Horizon Agentic Search with Large-Scale Asynchronous RL

Paper • 2508.07976 • Published 9 days ago • 45

We-Math 2.0: A Versatile MathBook System for Incentivizing Visual Mathematical Reasoning

Paper • 2508.10433 • Published 6 days ago • 140

Pass@k Training for Adaptively Balancing Exploration and Exploitation of Large Reasoning Models

Paper • 2508.10751 • Published 6 days ago • 23

upvoted 2 papers 23 days ago

Can One Domain Help Others? A Data-Centric Study on Multi-Domain Reasoning via Reinforcement Learning

Paper • 2507.17512 • Published 28 days ago • 36

Group Sequence Policy Optimization

Paper • 2507.18071 • Published 28 days ago • 289

upvoted 4 papers about 1 month ago

Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models

Paper • 2507.13344 • Published Jul 17 • 55

upvoted 2 papers about 2 months ago

Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning

Paper • 2507.00432 • Published Jul 1 • 73

Ark: An Open-source Python-based Framework for Robot Learning

Paper • 2506.21628 • Published Jun 24 • 15

upvoted 8 papers 2 months ago

LiveCodeBench Pro: How Do Olympiad Medalists Judge LLMs in Competitive Programming?

Paper • 2506.11928 • Published Jun 13 • 24

Seedance 1.0: Exploring the Boundaries of Video Generation Models

Paper • 2506.09113 • Published Jun 10 • 101

Resa: Transparent Reasoning Models via SAEs

Paper • 2506.09967 • Published Jun 11 • 22

SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks

Paper • 2506.10954 • Published Jun 12 • 51

ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning

Paper • 2506.09513 • Published Jun 11 • 98

Magistral

Paper • 2506.10910 • Published Jun 12 • 63

Confidence Is All You Need: Few-Shot RL Fine-Tuning of Language Models

Paper • 2506.06395 • Published Jun 5 • 129

OpenThoughts: Data Recipes for Reasoning Models

Paper • 2506.04178 • Published Jun 4 • 44

Shijie Geng

AI & ML interests

Recent Activity

Organizations

makitanikaze's activity