Collection related to the paper, "Training a Generally Curious Agent" (Project page: https://paprika-llm.github.io/)
Fahim Tajwar
ftajwar
AI & ML interests
LLMs, RLHF
Recent Activity
updated
a dataset
16 days ago
self-label-zanette-lab/big_math_filtered_pass_rate_between_0.3_and_0.7
published
a dataset
16 days ago
self-label-zanette-lab/big_math_filtered_pass_rate_between_0.3_and_0.7
updated
a dataset
2 months ago
self-label-zanette-lab/big_math_full_dataset