Commit History

Upload from GitHub Actions: Improve UX and style
53d2039
verified

davidpomerenke commited on

Upload from GitHub Actions: Merge remote changes and apply terminology updates: Commercial->closed-source, Open->open-source
ebaf279
verified

davidpomerenke commited on

Upload from GitHub Actions: Use task subset for average score
b1e5b40
verified

davidpomerenke commited on

Upload from GitHub Actions: Eavaluate on 40 languages
941d5c5
verified

davidpomerenke commited on

Upload from GitHub Actions: Add math benchmarks
549360a
verified

davidpomerenke commited on

Upload from GitHub Actions: Use FLORES+ via Huggingface
913253a
verified

davidpomerenke commited on

Upload from GitHub Actions: Quick fixes
9c2c019
verified

davidpomerenke commited on

Upload from GitHub Actions: Display N/A scores as such
1e8952a
verified

davidpomerenke commited on

Only run tasks for which there is no result yet
2f9dee1

David Pomerenke commited on

Fix response when no evals data is available
32d50b0

David Pomerenke commited on

Add Global MMLU benchmark
ce2acb0

David Pomerenke commited on

Translation both from and to
731eddd

David Pomerenke commited on

Add OpenRouter metadata to models
9002fc2

David Pomerenke commited on

Run on 100 languages, adjust display
8274634

David Pomerenke commited on

Add Dockerfile
4d13673

David Pomerenke commited on

Fix world map and apply filters for it
92d8154

David Pomerenke commited on

Fix and refactor backend filtering
eb1696c

David Pomerenke commited on

Speed things up
566c57e

David Pomerenke commited on

Language selection checkboxes & filtering in backend
d91b022

David Pomerenke commited on

Basic backend setup with FastApi but without actual filtering
2c21cf7

David Pomerenke commited on