12 37 74

Seungone Kim

seungone

https://seungonekim.github.io/

AI & ML interests

Large Language Models, LLM-as-a-Judge, Reward Model Overoptimization, Personalized Alignment

Recent Activity

upvoted a paper about 11 hours ago

Text2Grad: Reinforcement Learning from Natural Language Feedback

liked a dataset about 12 hours ago

TIGER-Lab/WebInstruct-verified

authored a paper 3 days ago

Datasheets Aren't Enough: DataRubrics for Automated Quality Metrics and Accountability

View all activity

Organizations

seungone's activity

upvoted a paper about 11 hours ago

Text2Grad: Reinforcement Learning from Natural Language Feedback

Paper • 2505.22338 • Published 10 days ago • 7

upvoted a paper 3 days ago

Datasheets Aren't Enough: DataRubrics for Automated Quality Metrics and Accountability

Paper • 2506.01789 • Published 5 days ago • 12

upvoted a paper 9 days ago

Let's Predict Sentence by Sentence

Paper • 2505.22202 • Published 10 days ago • 17

upvoted a paper 17 days ago

Reasoning Models Better Express Their Confidence

Paper • 2505.14489 • Published 18 days ago • 19

upvoted a paper 22 days ago

The CoT Encyclopedia: Analyzing, Predicting, and Controlling how a Reasoning Model will Think

Paper • 2505.10185 • Published 23 days ago • 25

upvoted a paper 4 months ago

Demystifying Long Chain-of-Thought Reasoning in LLMs

Paper • 2502.03373 • Published Feb 5 • 59

upvoted 4 papers 5 months ago

upvoted 2 papers 6 months ago

MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale

Paper • 2412.05237 • Published Dec 6, 2024 • 48

Evaluating Language Models as Synthetic Data Generators

Paper • 2412.03679 • Published Dec 4, 2024 • 49

upvoted 2 articles 7 months ago

Article

Navigating Korean LLM Research #1: Models

•

Oct 22, 2024

• 26

Article

Navigating Korean LLM Research #2: Evaluation Tools

•

Oct 23, 2024

• 8

upvoted 3 papers 8 months ago

Pangea: A Fully Open Multilingual Multimodal LLM for 39 Languages

Paper • 2410.16153 • Published Oct 21, 2024 • 45

Web Agents with World Models: Learning and Leveraging Environment Dynamics in Web Navigation

Paper • 2410.13232 • Published Oct 17, 2024 • 45

Coffee-Gym: An Environment for Evaluating and Improving Natural Language Feedback on Erroneous Code

Paper • 2409.19715 • Published Sep 29, 2024 • 11

upvoted a paper 9 months ago

Consent in Crisis: The Rapid Decline of the AI Data Commons

Paper • 2407.14933 • Published Jul 20, 2024 • 12

upvoted a paper 12 months ago

The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models

Paper • 2406.05761 • Published Jun 9, 2024 • 3

upvoted a collection 12 months ago

System Message Generalization

Collection

11 items • Updated Jun 7, 2024 • 4