-
Let LLMs Break Free from Overthinking via Self-Braking Tuning
Paper • 2505.14604 • Published • 23 -
AGENTIF: Benchmarking Instruction Following of Large Language Models in Agentic Scenarios
Paper • 2505.16944 • Published • 8 -
Training Step-Level Reasoning Verifiers with Formal Verification Tools
Paper • 2505.15960 • Published • 7 -
The Unreasonable Effectiveness of Entropy Minimization in LLM Reasoning
Paper • 2505.15134 • Published • 6
Felix Tuma
floom
AI & ML interests
NLP
Recent Activity
updated
a collection
about 18 hours ago
PotentialApplication
liked
a model
3 days ago
Salesforce/Llama-xLAM-2-70b-fc-r
updated
a collection
5 days ago
PotentialApplication
Organizations
None yet
Collections
30
-
Atla Selene Mini: A General Purpose Evaluation Model
Paper • 2501.17195 • Published • 36 -
DeepSeek-V3 Technical Report
Paper • 2412.19437 • Published • 63 -
Optimizing Large Language Model Training Using FP4 Quantization
Paper • 2501.17116 • Published • 38 -
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Paper • 2402.03300 • Published • 123
models
0
None public yet
datasets
0
None public yet