Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
bird-of-paradise
/
deepseek-mla
like
9
Text Generation
Transformers
PyTorch
English
deepseek-mla
attention-mechanism
mla
efficient-attention
arxiv:
2405.04434
License:
mit
Model card
Files
Files and versions
xet
Community
2
Use this model
eb53dde
deepseek-mla
Ctrl+K
Ctrl+K
2 contributors
History:
1 commit
bird-of-paradise
initial commit
eb53dde
verified
6 months ago
.gitattributes
Safe
1.52 kB
initial commit
6 months ago
README.md
Safe
24 Bytes
initial commit
6 months ago