Leandro von Werra's picture

Leandro von Werra

lvwerra

·

https://github.com/lvwerra

AI & ML interests

NLP and RL

Recent Activity

upvoted an article 2 days ago

No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL

upvoted a paper 3 days ago

SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

upvoted an article 9 days ago

CodeAgents + Structure: A Better Way to Execute Actions

View all activity

Organizations

lvwerra's activity

liked a Space 17 days ago

WikiRacing Language Models

Find answers by racing against LLM in a quiz game

liked a Space 24 days ago

Sheets

Create a dataset

liked a model about 1 month ago

ServiceNow-AI/Apriel-Nemotron-15b-Thinker

Text Generation • Updated 22 days ago • 2.67k • 87

liked a Space about 1 month ago

Computer Agent

Interact with an agent to perform web-based tasks

liked a model about 1 month ago

Qwen/Qwen3-235B-A22B

Text Generation • Updated 17 days ago • 187k • • 928

liked a Space about 1 month ago

Dia 1.6B

Generate realistic dialogue from a script, using Dia!

liked a model about 1 month ago

lldacing/flash-attention-windows-wheel

Updated 6 days ago • 161

liked 2 models about 2 months ago

ds4sd/SmolDocling-256M-preview

Image-Text-to-Text • Updated 21 days ago • 324k • 1.41k

rasbt/llama-3.2-from-scratch

Updated Apr 16 • 276

liked a Space 2 months ago

Try YourBench!

Generate a custom benchmark from any document

liked 2 Spaces 3 months ago

QwQ 32B Demo

Send text and get detailed responses

Open LLM Progress Tracker

Visualize Open vs. Proprietary LLM Progress

liked 2 Spaces 4 months ago

The Ultra-Scale Playbook

The ultimate guide to training LLM on large GPU Clusters

DABstep Leaderboard

DABstep Reasoning Benchmark Leaderboard

liked a model 5 months ago

deepseek-ai/DeepSeek-R1

Text Generation • Updated Mar 27 • 690k • • 12.3k

liked 2 Spaces 6 months ago

Jupyter Agent

Generate code solutions interactively

Scaling test-time compute

Enhance math problem solving by scaling test-time compute

liked 2 datasets 6 months ago

microsoft/RedStone

Updated Dec 5, 2024 • 20 • 34

ylecun/mnist

Viewer • Updated Aug 8, 2024 • 70k • 39.1k • 177

liked a Space 7 months ago

Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks

Evaluate multilingual models using FineTasks