view article Article ScreenSuite - The most comprehensive evaluation suite for GUI Agents! 1 day ago β’ 18
view article Article What if Your AI Conversations Become Public? By fdaudens β’ about 19 hours ago β’ 10
view article Article How to Build an MCP Server with Gradio By abidlabs and 1 other β’ Apr 30 β’ 162
Common Pile v0.1 Collection All resources related to Common Pile v0.1, an 8TB dataset of public domain and openly licensed text β’ 4 items β’ Updated 1 day ago β’ 13
Reward Bench 2 Collection Datasets, spaces, and models for Reward Bench 2 benchmark and paper! β’ 11 items β’ Updated 4 days ago β’ 8
view article Article *Context Is Gold to Find the Gold Passage*: Evaluating and Training Contextual Document Embeddings By manu and 1 other β’ 5 days ago β’ 23
view article Article AI Policy @π€: Response to the 2025 National AI R&D Strategic Plan By evijit and 2 others β’ 5 days ago β’ 12
view article Article CodeAgents + Structure: A Better Way to Execute Actions By akseljoonas and 1 other β’ 10 days ago β’ 43
view article Article Bigger isn't always better: how to choose the most efficient model for context-specific tasks π±π§πΌβπ» By sasha β’ 10 days ago β’ 19
view article Article Interactive Tools for machine learning, deep learning, and math By Suzana β’ 12 days ago β’ 40
view changelog Changelog Xet is now the default storage option for new users and organizations 15 days ago β’ 58
view article Article Tiny Agents in Python: a MCP-powered agent in ~70 lines of code By celinah and 3 others β’ 15 days ago β’ 122
Falcon Edge series Collection A series of powerful, universal and fine-tunable small Language Models β’ 7 items β’ Updated 17 days ago β’ 22
BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset Paper β’ 2505.09568 β’ Published 24 days ago β’ 90