Leandro von Werra's picture

Leandro von Werra

lvwerra

·

https://github.com/lvwerra

AI & ML interests

NLP and RL

Recent Activity

upvoted an article 2 days ago

No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL

upvoted a paper 3 days ago

SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

upvoted an article 9 days ago

CodeAgents + Structure: A Better Way to Execute Actions

View all activity

Organizations

lvwerra's activity

upvoted an article 2 days ago

Article

No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL

By

and 5 others •

4 days ago

• 37

upvoted a paper 3 days ago

SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

Paper • 2506.01844 • Published 4 days ago • 74

upvoted an article 9 days ago

Article

CodeAgents + Structure: A Better Way to Execute Actions

By

and 1 other •

10 days ago

• 43

upvoted an article 22 days ago

Article

Falcon-Edge: A series of powerful, universal, fine-tunable 1.58bit language models.

By

and 9 others •

22 days ago

• 33

upvoted 2 collections 24 days ago

🔥 Releases

Hugging Face Science team releases • 26 items • Updated 24 days ago • 1

SmolVLM2 📺 Smallest video LM ever 🤏🏻

11 items • Updated May 5 • 89

upvoted a paper about 2 months ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7 • 188

upvoted a collection 2 months ago

Llama 4

Llama 4 release • 13 items • Updated Apr 29 • 522

upvoted an article 3 months ago

Article

Open R1: Update #3

By

and 9 others •

Mar 11

• 292

upvoted 2 articles 4 months ago

Article

Blazing-Fast Code Editing via Multi-Layer Speculation

By

and 3 others •

Feb 15

• 15

Article

Open R1: Update #2

By

and 6 others •

Feb 10

• 214

upvoted a paper 4 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 232

upvoted 4 articles 4 months ago

Article

Open-source DeepResearch – Freeing our search agents

By

and 4 others •

Feb 4

• 1.25k

Article

DABStep: Data Agent Benchmark for Multi-step Reasoning

By

and 5 others •

Feb 4

• 89

Article

Open-R1: Update #1

By

and 7 others •

Feb 2

• 305

Article

Welcome to Inference Providers on the Hub 🔥

By

and 6 others •

Jan 28

• 483

upvoted a paper 4 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 400

upvoted an article 4 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

By

and 2 others •

Jan 28

• 862

upvoted a paper 5 months ago

Towards Best Practices for Open Datasets for LLM Training

Paper • 2501.08365 • Published Jan 14 • 63

upvoted a collection 6 months ago

🤖 Agents

21 items • Updated Dec 31, 2024 • 158