63 90 204

Asankhaya Sharma PRO

codelion

http://asankhaya.github.io/

AI & ML interests

AI/ML, Dev Tools and Application Security

Recent Activity

upvoted a paper 1 day ago

BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-Haystack

upvoted a paper 1 day ago

ATLAS: Learning to Optimally Memorize the Context at Test Time

liked a model 2 days ago

google/gemma-3-12b-it-qat-q4_0-gguf

View all activity

Organizations

codelion's activity

upvoted 2 papers 1 day ago

BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-Haystack

Paper • 2406.10149 • Published Jun 14, 2024 • 53

ATLAS: Learning to Optimally Memorize the Context at Test Time

Paper • 2505.23735 • Published 8 days ago • 22

upvoted a paper 2 days ago

Thinker: Learning to Think Fast and Slow

Paper • 2505.21097 • Published 10 days ago • 10

upvoted a paper 3 days ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published 4 days ago • 127

upvoted a paper 4 days ago

AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time

Paper • 2505.24863 • Published 7 days ago • 86

upvoted an article 4 days ago

Article

System Prompt Learning: Teaching LLMs to Learn Problem-Solving Strategies from Experience

•

4 days ago

• 10

upvoted a paper 10 days ago

Distilling LLM Agent into Small Models with Retrieval and Code Tools

Paper • 2505.17612 • Published 14 days ago • 77

upvoted an article 10 days ago

Article

AutoThink: Adaptive Reasoning for Large Language Models

•

10 days ago

• 4

upvoted a paper 17 days ago

Chain-of-Model Learning for Language Model

Paper • 2505.11820 • Published 21 days ago • 116

upvoted an article 17 days ago

Article

OpenEvolve: An Open Source Implementation of Google DeepMind's AlphaEvolve

•

17 days ago

• 19

upvoted an article 20 days ago

Article

Introducing Pivotal Token Search (PTS): Targeting Critical Decision Points in LLM Training

•

20 days ago

• 5

upvoted a paper 21 days ago

Phi-4 Technical Report

Paper • 2412.08905 • Published Dec 12, 2024 • 119

upvoted a collection 24 days ago

Pivotal Token Search

Collection

Pivotal Token Search (PTS) identifies tokens in a language model's generation that significantly impact the probability of success • 9 items • Updated 24 days ago • 3

upvoted 2 papers 25 days ago

Learning from Peers in Reasoning Models

Paper • 2505.07787 • Published 25 days ago • 45

Scalable Chain of Thoughts via Elastic Reasoning

Paper • 2505.05315 • Published 29 days ago • 24

upvoted a paper 29 days ago

ZeroSearch: Incentivize the Search Capability of LLMs without Searching

Paper • 2505.04588 • Published 30 days ago • 64

upvoted 4 papers about 1 month ago

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6 • 168

A Survey of Interactive Generative Video

Paper • 2504.21853 • Published Apr 30 • 47

Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math

Paper • 2504.21233 • Published Apr 30 • 45

WebThinker: Empowering Large Reasoning Models with Deep Research Capability

Paper • 2504.21776 • Published Apr 30 • 56