The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text Paper • 2506.05209 • Published 1 day ago • 23
Common Pile v0.1 Collection All resources related to Common Pile v0.1, an 8TB dataset of public domain and openly licensed text • 4 items • Updated 1 day ago • 13
view article Article FineWeb2-C: Help Build Better Language Models in Your Language By davanstrien and 5 others • Dec 23, 2024 • 20
👩💻 OlympicCoder Collection Reasoning datasets and models for competitive coding • 4 items • Updated 25 days ago • 17
🧠 Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 24 items • Updated 19 days ago • 148
view article Article Interactive Tools for machine learning, deep learning, and math By Suzana • 12 days ago • 40
view article Article The Transformers Library: standardizing model definitions By lysandre and 3 others • 23 days ago • 112
INTELLECT-2 Collection INTELLECT-2 is a 32 billion parameter language model with globally distributed reinforcement learning. • 3 items • Updated 26 days ago • 22
Putting the Value Back in RL: Better Test-Time Scaling by Unifying LLM Reasoners With Verifiers Paper • 2505.04842 • Published about 1 month ago • 12
70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length Float Paper • 2504.11651 • Published Apr 15 • 28
view article Article Gotchas in Tokenizer Behavior Every Developer Should Know By qgallouedec • Apr 18 • 37
view article Article Cohere on Hugging Face Inference Providers 🔥 By burtenshaw and 6 others • Apr 16 • 126