Lora & full finetune experiments on r1 distills to generate python code for math problems
Ram PRO
0-hero
AI & ML interests
All work on this profile is personal
Recent Activity
new activity
20 days ago
fffiloni/bnb-iso-skeuo-3d-icns-gen:Might need to change fal model endpoint
published
a model
27 days ago
0-hero/r1-7b-grpo-full
published
a model
27 days ago
0-hero/R1-7B-MATH-GRPO-FULL
Organizations
Collections
5
models
49

0-hero/r1-7B-grpo-v3.3-epoch-3
Updated
•
5

0-hero/r1-7B-grpo-v3.3-epoch-2
Updated
•
4

0-hero/r1-7B-grpo-v3.3-epoch-1
Updated
•
3

0-hero/r1-7B-grpo-v3.2-epoch-2
Updated
•
6

0-hero/r1-7B-grpo-v3.2-epoch-1
Updated
•
3

0-hero/r1-14B-grpo-v3.1-epoch-2
Updated
•
4

0-hero/r1-14B-grpo-v3.1-epoch-1
Updated
•
5

0-hero/r1-7B-grpo-v3.1-epoch-3
Updated
•
2

0-hero/r1-7B-grpo-v3.1-epoch-2
Updated
•
5

0-hero/r1-7B-grpo-v2-temp-1.0-60
Updated
•
6
datasets
14
0-hero/MATH
Viewer
•
Updated
•
331k
•
59
0-hero/audio-samples-fixed
Viewer
•
Updated
•
10
•
15
0-hero/distilabel-math-preference-dpo
Viewer
•
Updated
•
2.42k
•
24
0-hero/lj_speech_with_spectogram_conversations
Viewer
•
Updated
•
13.1k
•
17
•
1
0-hero/lj_speech_with_spectogram
Viewer
•
Updated
•
13.1k
•
22
•
1
0-hero/Matter-0.2-alpha
Viewer
•
Updated
•
2.52M
•
46
•
3
0-hero/Matter-0.1
Viewer
•
Updated
•
2.25M
•
84
•
53
0-hero/Matter-0.1-Slim-D
Viewer
•
Updated
•
1.32M
•
49
0-hero/Matter-0.1-Slim-C
Viewer
•
Updated
•
343k
•
36
0-hero/Matter-0.1-Slim-B
Viewer
•
Updated
•
308k
•
22
•
1