11 14 26

Shan Chen

shanchen

https://shanchen.dev/

AI & ML interests

I train and eval pretty ok

Recent Activity

liked a model about 22 hours ago

deepseek-ai/DeepSeek-V3.1-Base

liked a dataset 19 days ago

BytedTsinghua-SIA/DAPO-Math-17k

liked a dataset 21 days ago

zifeng-ai/TrialPanorama-benchmark

View all activity

Organizations

liked a model about 22 hours ago

deepseek-ai/DeepSeek-V3.1-Base

685B • Updated 1 day ago • 605

liked a dataset 19 days ago

BytedTsinghua-SIA/DAPO-Math-17k

Viewer • Updated Apr 18 • 1.79M • 4.23k • 92

liked a dataset 21 days ago

zifeng-ai/TrialPanorama-benchmark

Viewer • Updated May 14 • 152k • 242 • 3

updated a model 27 days ago

shanchen/Qwen2-GRPO-test1.5

Text Generation • 2B • Updated 27 days ago • 7

published 2 models 27 days ago

shanchen/Qwen2-GRPO-test1.5

Text Generation • 2B • Updated 27 days ago • 7

shanchen/Qwen2-GRPO-test3

Updated 27 days ago

updated a model 27 days ago

shanchen/Qwen2-GRPO-test

Text Generation • 0.5B • Updated 27 days ago • 4

published a model 27 days ago

shanchen/Qwen2-GRPO-test

Text Generation • 0.5B • Updated 27 days ago • 4

updated a model 27 days ago

shanchen/Qwen2-0.5B-GRPO-test

Text Generation • 0.5B • Updated 27 days ago • 9

published a model 27 days ago

shanchen/Qwen2-0.5B-GRPO-test

Text Generation • 0.5B • Updated 27 days ago • 9

liked a model about 2 months ago

polaris-73/ds7b_grpo_math_faithful_step200

8B • Updated Jul 2 • 8 • 1

upvoted a paper 2 months ago

Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning

Paper • 2506.07044 • Published Jun 8 • 112

authored 3 papers 3 months ago

MedBrowseComp: Benchmarking Medical Deep Research and Computer Use

Paper • 2505.14963 • Published May 20 • 2

Measuring the Faithfulness of Thinking Drafts in Large Reasoning Models

Paper • 2505.13774 • Published May 19 • 1

When Models Reason in Your Language: Controlling Thinking Trace Language Comes at the Cost of Accuracy

Paper • 2505.22888 • Published May 28 • 6

upvoted 3 papers 3 months ago

Measuring the Faithfulness of Thinking Drafts in Large Reasoning Models

Paper • 2505.13774 • Published May 19 • 1

MedBrowseComp: Benchmarking Medical Deep Research and Computer Use

Paper • 2505.14963 • Published May 20 • 2

When Models Reason in Your Language: Controlling Thinking Trace Language Comes at the Cost of Accuracy

Paper • 2505.22888 • Published May 28 • 6

updated 2 datasets 3 months ago

shanchen/combine_multilingual

Viewer • Updated Jun 4 • 2.1k • 11

shanchen/aime_2025_multilingual

Viewer • Updated Jun 4 • 330 • 151

Shan Chen

AI & ML interests

Recent Activity

Organizations

shanchen's activity