heegyu
's Collections
Reward Modeling Datasets
updated
Viewer
•
Updated
•
37.1k
•
1.82k
•
237
Viewer
•
Updated
•
169k
•
11.6k
•
1.35k
Viewer
•
Updated
•
386k
•
1.95k
•
308
PKU-Alignment/PKU-SafeRLHF
Viewer
•
Updated
•
164k
•
3.96k
•
142
openai/webgpt_comparisons
Viewer
•
Updated
•
19.6k
•
724
•
233
openai/summarize_from_feedback
Viewer
•
Updated
•
194k
•
1.52k
•
206
HuggingFaceH4/ultrafeedback_binarized
Viewer
•
Updated
•
187k
•
14.4k
•
294
Viewer
•
Updated
•
183k
•
596
•
289
HuggingFaceH4/stack-exchange-preferences
Viewer
•
Updated
•
10.8M
•
2.92k
•
132
HuggingFaceH4/hhh_alignment
Viewer
•
Updated
•
221
•
576
•
21
Birchlabs/openai-prm800k-stepwise-critic
Viewer
•
Updated
•
1.09M
•
258
•
44
prometheus-eval/Feedback-Collection
Viewer
•
Updated
•
100k
•
381
•
114
argilla/OpenHermesPreferences
Viewer
•
Updated
•
989k
•
456
•
206
Viewer
•
Updated
•
8.11k
•
8.07k
•
95
Viewer
•
Updated
•
21.4k
•
2.69k
•
419
Magpie-Align/Magpie-Pro-DPO-200K
Viewer
•
Updated
•
207k
•
17
•
7
argilla/magpie-ultra-v0.1
Viewer
•
Updated
•
50k
•
376
•
221