evals-for-every-language / results.json
davidpomerenke's picture
Upload from GitHub Actions: Evaluate on autotranslated GSM dataset
f3a09a2 verified
raw
history
2.7 MB
File too large to display, you can check the raw version instead.