Robin Williams's picture

Robin Williams PRO

bfuzzy1

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

updated a collection 11 days ago

upvoted a paper 11 days ago

Shifting AI Efficiency From Model-Centric to Data-Centric Compression

View all activity

Organizations

None yet

bfuzzy1's activity

upvoted a paper 4 days ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published 5 days ago • 129

updated a collection 11 days ago

Nifty

39 items • Updated 11 days ago

upvoted 5 papers 11 days ago

Shifting AI Efficiency From Model-Centric to Data-Centric Compression

Paper • 2505.19147 • Published 13 days ago • 144

Truth Neurons

Paper • 2505.12182 • Published 20 days ago • 7

dKV-Cache: The Cache for Diffusion Language Models

Paper • 2505.15781 • Published 17 days ago • 16

Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective

Paper • 2505.15045 • Published 17 days ago • 53

Position of Uncertainty: A Cross-Linguistic Study of Positional Bias in Large Language Models

Paper • 2505.16134 • Published 16 days ago • 18

updated a collection 13 days ago

Nifty

39 items • Updated 11 days ago

updated a collection 16 days ago

Nifty

39 items • Updated 11 days ago

upvoted a paper 25 days ago

TransMLA: Multi-head Latent Attention Is All You Need

Paper • 2502.07864 • Published Feb 11 • 54

updated a collection about 1 month ago

Nifty

39 items • Updated 11 days ago

upvoted 6 papers about 1 month ago

X-Fusion: Introducing New Modality to Frozen Large Language Models

Paper • 2504.20996 • Published Apr 29 • 12

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Paper • 2504.20571 • Published Apr 29 • 94

The Leaderboard Illusion

Paper • 2504.20879 • Published Apr 29 • 70

ReasonIR: Training Retrievers for Reasoning Tasks

Paper • 2504.20595 • Published Apr 29 • 55

Toward Evaluative Thinking: Meta Policy Optimization with Evolving Reward Models

Paper • 2504.20157 • Published Apr 28 • 37

TesserAct: Learning 4D Embodied World Models

Paper • 2504.20995 • Published Apr 29 • 20

updated a collection about 1 month ago

Nifty

39 items • Updated 11 days ago

upvoted a paper about 1 month ago

Taming the Titans: A Survey of Efficient LLM Inference Serving

Paper • 2504.19720 • Published Apr 28 • 10