New LLM Algorithms - a BHbean Collection

BHbean 's Collections

Survey

MoE LLM Systems

LLM resource-constrained Inference

New LLM Algorithms

LLM Internal Mechanism

Prompt Engineering

Speculative Decoding

KV Cache Compression

LLM reasoning systems

New LLM Algorithms

updated Apr 4

Multi-Token Attention

Paper • 2504.00927 • Published Apr 1 • 52