63 90 204

Asankhaya Sharma PRO

codelion

http://asankhaya.github.io/

AI & ML interests

AI/ML, Dev Tools and Application Security

Recent Activity

upvoted a paper 1 day ago

BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-Haystack

upvoted a paper 1 day ago

ATLAS: Learning to Optimally Memorize the Context at Test Time

liked a model 2 days ago

google/gemma-3-12b-it-qat-q4_0-gguf

View all activity

Organizations

codelion's activity

commented a paper 2 days ago

Thinker: Learning to Think Fast and Slow

Paper • 2505.21097 • Published 10 days ago • 10 •

commented a paper 3 days ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published 4 days ago • 127 •

commented a paper 21 days ago

Phi-4 Technical Report

Paper • 2412.08905 • Published Dec 12, 2024 • 119 •

New activity in codelion/Qwen3-0.6B-pts-steering-vectors about 1 month ago

Upload steering_vectors.jsonl

#1 opened about 1 month ago by

codelion

New activity in codelion/Qwen3-0.6B-PTS-DPO about 1 month ago

Adding `safetensors` variant of this model

#1 opened about 1 month ago by

SFconvertbot

commented a paper about 1 month ago

Fact, Fetch, and Reason: A Unified Evaluation of Retrieval-Augmented Generation

Paper • 2409.12941 • Published Sep 19, 2024 • 25 •

New activity in patched-codes/static-analysis-eval about 2 months ago

Add link to paper

#3 opened about 2 months ago by

nielsr

New activity in codelion/optillm 3 months ago

Implement other approach that optillm supported?

#5 opened 3 months ago by

melekuk

New activity in codelion/LogProbsVisualizer 3 months ago

Update app.py

#1 opened 3 months ago by

codelion

commented a paper 4 months ago

Logical Reasoning in Large Language Models: A Survey

Paper • 2502.09100 • Published Feb 13 • 23 •

New activity in deepseek-ai/Janus-Pro-1B 4 months ago

Issue with Flash attention while running Janus-Pro-1B model locally on Mac (Solved)

#8 opened 4 months ago by

saurabhksa1

commented 2 papers 4 months ago

Rethinking Mixture-of-Agents: Is Mixing Different Large Language Models Beneficial?

Paper • 2502.00674 • Published Feb 2 • 13 •

Rethinking Mixture-of-Agents: Is Mixing Different Large Language Models Beneficial?

Paper • 2502.00674 • Published Feb 2 • 13 •

New activity in ProsusAI/finbert 5 months ago

What do the labels mean?

#21 opened 10 months ago by

gleni2101

commented a paper 5 months ago

RouteLLM: Learning to Route LLMs with Preference Data

Paper • 2406.18665 • Published Jun 26, 2024 • 6 •

New activity in codelion/Llama-3.2-3B-o1 5 months ago

Nice! Any chance we can have access to the unquantified model files?

#1 opened 5 months ago by

Joseph717171

New activity in Qwen/QwQ-32B-Preview 5 months ago

Can't reproduce the evaluation result of GPQA dataset

➕ 1

#47 opened 6 months ago by

Rinn000

commented 3 papers 5 months ago