Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
fal
Novita
Nscale
Cohere
Hyperbolic
Cerebras
Nebius AI Studio
Fireworks
Replicate
SambaNova
Together AI
HF Inference API
Misc
Reset Misc
arxiv:
2502.10645
custom_code
Misc with no match
Inference Endpoints
text-generation-inference
Eval Results
Merge
4-bit precision
8-bit precision
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
9
Full-text search
Edit filters
Sort: Trending
Active filters:
2502.10645
Clear all
BabyLM-community/babylm-interaction-baseline-simpo
Updated
about 23 hours ago
•
83
•
1
BabyLM-community/babylm-baseline-100m-gpt-bert-causal-focus
Updated
23 days ago
•
90
BabyLM-community/babylm-baseline-100m-gpt-bert-mixed
Updated
23 days ago
•
119
BabyLM-community/babylm-baseline-10m-gpt-bert-mixed
Updated
23 days ago
•
167
BabyLM-community/babylm-baseline-10m-gpt-bert-causal-focus
Updated
23 days ago
•
190
BabyLM-community/babylm-baseline-10m-gpt-bert-masked-focus
Updated
23 days ago
•
235
BabyLM-community/babylm-baseline-100m-gpt-bert-masked-focus
Updated
23 days ago
•
196
BabyLM-community/babylm-baseline-100m-gpt2
Updated
about 23 hours ago
•
132
BabyLM-community/babylm-baseline-10m-gpt2
Updated
about 23 hours ago
•
532