Lewis Tunstall PRO
lewtun
AI & ML interests
LLMs, LLMs, LLMs
Recent Activity
new activity
about 12 hours ago
open-thoughts/OpenThoughts3-1.2M:Inconsistent eval scores between dataset card and blog post
liked
a dataset
about 12 hours ago
open-thoughts/OpenThoughts3-1.2M
liked
a model
1 day ago
futurehouse/ether0
Organizations
lewtun's activity
Inconsistent eval scores between dataset card and blog post
#2 opened about 12 hours ago
by
lewtun

Upload HTML.zip
#2 opened 5 days ago
by
NOUREDDINE25
Update evals with proper pass@1 scores
#8 opened 9 days ago
by
lewtun

Add domain categories?
2
#6 opened about 1 month ago
by
lewtun

[Experiment] Training R1-Zero-like models with Open R1
🔥
👀
8
13
#20 opened 2 months ago
by
lewtun

about <think> and </think>
2
#9 opened 2 months ago
by
volcanos

Please add HF Inference Endpoint and library tags which allow easier deployment
1
#8 opened 3 months ago
by
SolshineMisfit

Mode changed to Model
2
#7 opened 3 months ago
by
Solshine

Update README.md
1
#6 opened 3 months ago
by
nickname100231
Omitted <think> at the start and almost 10k tokens to debug 2 JS functions
➕
2
3
#2 opened 3 months ago
by
operationdarkside
It seems to overthink
1
#3 opened 3 months ago
by
sm54
Upload dataset
#4 opened 3 months ago
by
lewtun

missing </think> in all subset
2
#3 opened 3 months ago
by
volcanos

Why is there a discrepancy between the 'Solutions' subset and the 'Solutions_py' subset?
1
#2 opened 3 months ago
by
waple

Update README.md
1
#1 opened 3 months ago
by
lhoestq

Size of the weights > 140 GB for a 32 GB model?
👍
1
1
#2 opened 3 months ago
by
stelterlab

Remove fp32 weights
#4 opened 3 months ago
by
lewtun

Remove fp32 weights
#3 opened 3 months ago
by
lewtun

⚠️ Chat template foot gun with DeepSeek distilled models and RL format reward function
🚀
6
6
#17 opened 4 months ago
by
lewtun

the finetune config of open-r1?
2
#6 opened 4 months ago
by
MilyFang