dim/hendrycks_math_train_12k_DeepSeek-R1-Distill-Qwen-1.5B_max_len_4096_greedy Viewer • Updated 19 days ago • 12k • 66
dim/hendrycks_math_train_1k_DeepSeek-R1-Distill-Qwen-1.5B_max_len_4096_greedy Viewer • Updated Apr 29 • 1k • 95
dim/hendrycks_math_test_500_DeepSeek-R1-Distill-Qwen-1.5B_max_len_4096_greedy Viewer • Updated Apr 29 • 500 • 85
dim/hendrycks_math_test_500_DeepSeek-R1-Distill-Qwen-1.5B_max_len_4096 Viewer • Updated Apr 21 • 500 • 31
dim/hendrycks_math_train_12k_DeepSeek-R1-Distill-Qwen-1.5B_max_len_8192 Viewer • Updated Apr 19 • 12k • 26
dim/hendrycks_math_train_12k_DeepSeek-R1-Distill-Qwen-1.5B_max_len_4096 Viewer • Updated Apr 19 • 12k • 38
dim/hendrycks_math_train_12k_DeepSeek-R1-Distill-Qwen-1.5B_max_len_32768 Viewer • Updated Apr 18 • 12k • 48