Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Zhenyu Zhang's picture
2 6 1

Zhenyu Zhang

Kyriection
cspikachu's profile picture 21world's profile picture Titus-von-Koeller's profile picture
·
  • KyriectionZhang
  • Kyriection

AI & ML interests

Large Language Models, Efficient Machine Learning, Quantum Computing

Organizations

None yet

Kyriection's activity

upvoted a paper 3 months ago

Stable-SPAM: How to Train in 4-Bit More Stably than 16-Bit Adam

Paper • 2502.17055 • Published Feb 24 • 18
upvoted a paper 4 months ago

Mask-Enhanced Autoregressive Prediction: Pay Less Attention to Learn More

Paper • 2502.07490 • Published Feb 11 • 9
upvoted 2 papers 6 months ago

Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN

Paper • 2412.13795 • Published Dec 18, 2024 • 20

APOLLO: SGD-like Memory, AdamW-level Performance

Paper • 2412.05270 • Published Dec 6, 2024 • 39
upvoted a paper 11 months ago

Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients

Paper • 2407.08296 • Published Jul 11, 2024 • 34
upvoted a paper almost 2 years ago

H_2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models

Paper • 2306.14048 • Published Jun 24, 2023 • 12
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs