Collections

Discover the best community collections!

Collections including paper arxiv:2502.09056
RL+reason model
Collection by about 18 hours ago
Typhoon R1 - ICLR 2025 SCI-FM Artifacts
Artifacts from our paper, Adapting Language-Specific LLMs to a Reasoning Mode https://arxiv.org/abs/2502.09056, accepted at ICLR 2025 SCI-FM workshop.
Reasoning, Thinking, RL and Test-Time Scaling
Collection by Apr 24
readings
Collection by about 3 hours ago