ModernBERT Bringing BERT into modernity via both architecture changes and scaling answerdotai/ModernBERT-base Fill-Mask • 0.1B • Updated Jan 15 • 1.14M • • 901 answerdotai/ModernBERT-large Fill-Mask • 0.4B • Updated Jan 15 • 135k • • 413 Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper • 2412.13663 • Published Dec 18, 2024 • 152
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper • 2412.13663 • Published Dec 18, 2024 • 152
CLA-Experiments answerdotai/llama3-8b-instruct-CLA-3 Text Generation • Updated Jul 18, 2024 • 11 • 1 answerdotai/llama3-8b-instruct-CLA-2 Text Generation • Updated Jul 18, 2024 • 3 • 1
ModernBERT Bringing BERT into modernity via both architecture changes and scaling answerdotai/ModernBERT-base Fill-Mask • 0.1B • Updated Jan 15 • 1.14M • • 901 answerdotai/ModernBERT-large Fill-Mask • 0.4B • Updated Jan 15 • 135k • • 413 Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper • 2412.13663 • Published Dec 18, 2024 • 152
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper • 2412.13663 • Published Dec 18, 2024 • 152
CLA-Experiments answerdotai/llama3-8b-instruct-CLA-3 Text Generation • Updated Jul 18, 2024 • 11 • 1 answerdotai/llama3-8b-instruct-CLA-2 Text Generation • Updated Jul 18, 2024 • 3 • 1