Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Dai's picture
1 17 8

Dai

Yinpei
·
https://yinpeidai.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a paper about 18 hours ago
Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning
published a model 21 days ago
Yinpei/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
published a model 21 days ago
Yinpei/Qwen2.5-1.5B-Open-R1-Distill
View all activity

Organizations

University of Michigan's profile picture Situated Language and Embodied Dialogue Lab's profile picture

Collections 1

offline-rl
  • Robotic Offline RL from Internet Videos via Value-Function Pre-Training

    Paper • 2309.13041 • Published Sep 22, 2023 • 8

Papers 6

arxiv:2409.14674
arxiv:2310.07968
arxiv:2305.13040
arxiv:2111.14592

models 9

Yinpei/DeepSeek-R1-Distill-Qwen-1.5B-GRPO

Updated 21 days ago

Yinpei/Qwen2.5-1.5B-Open-R1-Distill

Updated 21 days ago

Yinpei/runs_ckpt

Updated May 1

Yinpei/real_h5dy

Updated Apr 22

Yinpei/racer-visuomotor-policy-simple

Updated Oct 8, 2024

Yinpei/racer-visuomotor-policy-rich

Updated Oct 8, 2024

Yinpei/racer-llava-llama3-lora-simple

Updated Oct 8, 2024 • 1

Yinpei/racer-llava-llama3-lora-rich-betterswitch

Updated Oct 4, 2024 • 2

Yinpei/racer-llava-llama3-lora-rich

Updated Oct 4, 2024 • 1

datasets 2

Yinpei/lerobot_data_collection

Viewer • Updated May 1 • 274k • 12

Yinpei/reticle_sample_data

Viewer • Updated May 1 • 18.2k • 37
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs