Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
kevin1020 's Collections
Data
RAG
Prompting
Inference Acceleration
LLM Agents
Code Generation
Efficient Tuning
Token Compression
Efficient VLM via Image Token Compression
VLM
Long Context
Reasoning
Visualizations
Forward tuning
PEFT
ViT
Modular
Benchmarks
Efficient LLM

Efficient LLM

updated Feb 24
Upvote
-

  • Phantom of Latent for Large Language and Vision Models

    Paper • 2409.14713 • Published Sep 23, 2024 • 30

  • SeerAttention: Learning Intrinsic Sparse Attention in Your LLMs

    Paper • 2410.13276 • Published Oct 17, 2024 • 30

  • LightThinker: Thinking Step-by-Step Compression

    Paper • 2502.15589 • Published Feb 21 • 29
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs