Training, eval suite, and model from the paper "Large Scale Transfer Learning for Tabular Data via Language Modeling" https://arxiv.org/abs/2406.12031
ML Foundations
non-profit
AI & ML interests
None defined yet.
Recent Activity
View all activity
Collections
4
Data for "MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens"
-
MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens
Paper • 2406.11271 • Published • 21 -
mlfoundations/MINT-1T-HTML
Viewer • Updated • 623M • 98.4k • 85 -
mlfoundations/MINT-1T-ArXiv
Viewer • Updated • 5.6M • 4.98k • 47 -
mlfoundations/MINT-1T-PDF-CC-2024-18
Updated • 14k • 19
spaces
2
models
8
mlfoundations/dclm-7b-it
Updated
•
33
•
11
mlfoundations/dclm-fasttext-variants
Updated
mlfoundations/fasttext-oh-eli5
Updated
•
22
mlfoundations/tabula-8b
Text Generation
•
Updated
•
2.9k
•
25
mlfoundations/dclm-it-quantized
Updated
mlfoundations/scaling
Updated
•
4
mlfoundations/open_lm_7B_1.25T
Updated
•
4
mlfoundations/open_lm_1B
Updated
•
11
datasets
38
mlfoundations/nnetnav-live-uitars
Viewer
•
Updated
•
59.3k
•
90
mlfoundations/cua-v0-metadata
Updated
•
487
mlfoundations/cua-idm-frame-data
Updated
•
68
mlfoundations/tabula-8b-eval-suite
Viewer
•
Updated
•
19.7k
•
75
•
4
mlfoundations/MINT-1T-HTML
Viewer
•
Updated
•
623M
•
98.4k
•
85
mlfoundations/MINT-1T-ArXiv
Viewer
•
Updated
•
5.6M
•
4.98k
•
47
mlfoundations/MINT-1T-PDF-CC-2023-06
Updated
•
86.3k
•
2
mlfoundations/MINT-1T-PDF-CC-2023-14
Viewer
•
Updated
•
2.13M
•
40.6k
•
1
mlfoundations/MINT-1T-PDF-CC-2023-23
Viewer
•
Updated
•
2.82M
•
21.4k
•
1
mlfoundations/MINT-1T-PDF-CC-2023-40
Updated
•
47.7k
•
1