view article Article No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL By toslali-ibm and 5 others • Jun 3 • 83
Seed-X Collection A powerful open-source multilingual translation language model series, including instruction and reasoning models. • 6 items • Updated 22 days ago • 61
view article Article What's going on with the Open LLM Leaderboard? By clefourrier and 3 others • Jun 23, 2023 • 43
DataDecide Collection A suite of models, data, and evals over 25 corpora, 14 sizes, and 3 seeds to measure how accurately small experiments predict rankings at large scale. • 358 items • Updated Apr 30 • 18
Skywork-Reward-V2 Collection Scaling preference data curation to the extreme • 9 items • Updated Jul 4 • 23
Math-PUMA: Progressive Upward Multimodal Alignment to Enhance Mathematical Reasoning Paper • 2408.08640 • Published Aug 16, 2024 • 3
Qwen2.5 Collection Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated 30 days ago • 631