Data shuffled only at the document-level

BabyLM Sequence Length
community
AI & ML interests
BabyLM 2025 paper submission
Recent Activity
models
34

babylm-seqlen/mamba-4096
Updated

babylm-seqlen/opt-8192
Updated

babylm-seqlen/opt-4096
Updated

babylm-seqlen/mamba-8192
Updated
•
14

babylm-seqlen/mamba-8192-warmup
Updated
•
18

babylm-seqlen/mamba-4096-warmup
Updated
•
11

babylm-seqlen/mamba-2048-warmup
Updated
•
4

babylm-seqlen/mamba-512-warmup
Updated
•
6

babylm-seqlen/mamba-1024-warmup
Updated
•
5

babylm-seqlen/mamba-64-warmup
Updated
•
1
datasets
18
babylm-seqlen/train_100M_64
Viewer
•
Updated
•
2.56M
•
12
babylm-seqlen/train_100M_512_single_shuffle
Viewer
•
Updated
•
319k
•
18
babylm-seqlen/train_100M_8192_single_shuffle
Viewer
•
Updated
•
19.8k
•
16
babylm-seqlen/train_100M_2048_single_shuffle
Viewer
•
Updated
•
79.8k
•
16
babylm-seqlen/train_100M_16384_single_shuffle
Viewer
•
Updated
•
9.86k
•
18
babylm-seqlen/train_100M_4096_single_shuffle
Viewer
•
Updated
•
39.8k
•
15
babylm-seqlen/train_100M_256_single_shuffle
Viewer
•
Updated
•
639k
•
19
babylm-seqlen/train_100M_1024_single_shuffle
Viewer
•
Updated
•
160k
•
22
babylm-seqlen/train_100M_128_single_shuffle
Viewer
•
Updated
•
1.28M
•
24
babylm-seqlen/train_100M_64_single_shuffle
Viewer
•
Updated
•
2.56M
•
23