Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning Paper • 2506.01939 • Published 4 days ago • 127 • 3
Fact, Fetch, and Reason: A Unified Evaluation of Retrieval-Augmented Generation Paper • 2409.12941 • Published Sep 19, 2024 • 25 • 5
Rethinking Mixture-of-Agents: Is Mixing Different Large Language Models Beneficial? Paper • 2502.00674 • Published Feb 2 • 13 • 4
Rethinking Mixture-of-Agents: Is Mixing Different Large Language Models Beneficial? Paper • 2502.00674 • Published Feb 2 • 13 • 4
RouteLLM: Learning to Route LLMs with Preference Data Paper • 2406.18665 • Published Jun 26, 2024 • 6 • 1
Adaptive Margin Global Classifier for Exemplar-Free Class-Incremental Learning Paper • 2409.13275 • Published Sep 20, 2024 • 1 • 1
Protoformer: Embedding Prototypes for Transformers Paper • 2206.12710 • Published Jun 25, 2022 • 1 • 1