view article Article ScreenSuite - The most comprehensive evaluation suite for GUI Agents! 1 day ago • 18
SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics Paper • 2506.01844 • Published 5 days ago • 74
view article Article Tiny Agents in Python: a MCP-powered agent in ~70 lines of code By celinah and 3 others • 15 days ago • 122
view article Article The Transformers Library: standardizing model definitions By lysandre and 3 others • 23 days ago • 112
view article Article LeRobot Community Datasets: The “ImageNet” of Robotics — When and How? By danaaubakirova and 6 others • 27 days ago • 57
70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length Float Paper • 2504.11651 • Published Apr 15 • 28
view article Article Hugging Face to sell open-source robots thanks to Pollen Robotics acquisition 🤖 By thomwolf and 2 others • Apr 14 • 46
UI-R1: Enhancing Action Prediction of GUI Agents by Reinforcement Learning Paper • 2503.21620 • Published Mar 27 • 62
OpenVLThinker: An Early Exploration to Complex Vision-Language Reasoning via Iterative Self-Improvement Paper • 2503.17352 • Published Mar 21 • 23
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion Paper • 2503.11576 • Published Mar 14 • 108
view article Article FastRTC: The Real-Time Communication Library for Python By freddyaboulton and 1 other • Feb 25 • 162
view article Article Fixing Open LLM Leaderboard with Math-Verify By hynky and 3 others • Feb 14 • 30
view article Article DABStep: Data Agent Benchmark for Multi-step Reasoning By eggie5 and 5 others • Feb 4 • 89