kaiwenw/nov22_lr_3e-6_lora_32_dropout_0.1_all_reject_first_ep_4 Text Generation • Updated Dec 7, 2024 • 16
kaiwenw/nov22_lr_3e-6_lora_32_dropout_0.1_all_reject_first_ep_3 Text Generation • Updated Dec 7, 2024 • 13
kaiwenw/nov22_lr_3e-6_lora_32_dropout_0.1_all_reject_first_ep_2 Text Generation • Updated Dec 7, 2024 • 13
kaiwenw/nov22_lr_3e-6_lora_32_dropout_0.1_all_reject_first_ep_1 Text Generation • Updated Dec 7, 2024 • 13
kaiwenw/distill-r1-qwen-1.5b-hmmt-feb-25-4096-with-bt-model-with-sigmoid Viewer • Updated about 1 month ago • 123k • 55
kaiwenw/distill-r1-qwen-1.5b-hmmt-feb-24-4096-with-bt-model-with-sigmoid Viewer • Updated about 1 month ago • 123k • 97
kaiwenw/distill-r1-qwen-1.5b-aime-25-4096-with-bt-model-with-sigmoid Viewer • Updated about 1 month ago • 123k • 59
kaiwenw/distill-r1-qwen-1.5b-aime-24-4096-with-bt-model-with-sigmoid Viewer • Updated about 1 month ago • 123k • 40
kaiwenw/distill-r1-qwen-1.5b-hmmt-feb-25-4096-with-bt-model-wout-sigmoid Viewer • Updated about 1 month ago • 123k • 37
kaiwenw/distill-r1-qwen-1.5b-hmmt-feb-24-4096-with-bt-model-wout-sigmoid Viewer • Updated about 1 month ago • 123k • 37
kaiwenw/distill-r1-qwen-1.5b-aime-25-4096-with-bt-model-wout-sigmoid Viewer • Updated about 1 month ago • 123k • 66
kaiwenw/distill-r1-qwen-1.5b-aime-24-4096-with-bt-model-wout-sigmoid Viewer • Updated May 6 • 123k • 130
kaiwenw/distill-r1-qwen-1.5b-hmmt-feb-25-4096-with-old-prm-indices_61440_69120 Viewer • Updated May 6 • 7.68k • 40
kaiwenw/distill-r1-qwen-1.5b-hmmt-feb-25-4096-with-old-prm-indices_76800_84480 Viewer • Updated May 6 • 7.68k • 40