Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Sohyun An's picture
3

Sohyun An

sohyunan
mhsonkyle's profile picture
·

AI & ML interests

None yet

Organizations

None yet

Collections 1

LLM Reasoning
  • Large Language Models Think Too Fast To Explore Effectively

    Paper • 2501.18009 • Published Jan 29 • 24
  • Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

    Paper • 2501.18585 • Published Jan 30 • 61
LLM Reasoning
  • Large Language Models Think Too Fast To Explore Effectively

    Paper • 2501.18009 • Published Jan 29 • 24
  • Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

    Paper • 2501.18585 • Published Jan 30 • 61

models 16

sohyunan/DeepSeek-R1-Distill-Qwen-1.5B-sft-lora

Updated Feb 20

sohyunan/DeepSeek-R1-Distill-Qwen-1.5B-sft-full

Updated Feb 20

sohyunan/gemma-2-2b-it-maze-sft-sys0.0

Text Generation • 3B • Updated Feb 8 • 1 •

sohyunan/gemma-2-2b-it-maze-sft-ctrl-sys0.5-a_star

Text Generation • 3B • Updated Feb 8 • 2 •

sohyunan/gemma-2-2b-it-maze-sft-sys1.0-a_star

Text Generation • 3B • Updated Feb 8 • 2

sohyunan/gemma-2-2b-it_controller_sft_random_grpo

Text Generation • 3B • Updated Feb 7 • 1

sohyunan/debug

Text Generation • Updated Feb 6 • 1

sohyunan/gemma-2-2b-it_controller_sft_random_grpo_lora

Updated Feb 6

sohyunan/gemma-2-2b-it_controller_grpo_lora

Updated Feb 6

sohyunan/gemma-2-2b-it_controller_sft_random

Text Generation • 3B • Updated Feb 6 • 3
View 16 models

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs