Spaces:

evalitahf
/

evalita_llm_leaderboard

Running

App Files Files Community

evalita_llm_leaderboard / src

Ctrl+K

Ctrl+K

3 contributors

History: 30 commits

rzanoli's picture

Remove author prefix from model names

831dff0 about 13 hours ago

display
Remove author prefix from model names about 13 hours ago
leaderboard
Add theoretical performance of a model that scores the highest on every individual task 4 days ago
submission
Small changes 5 months ago
about.py

10.4 kB

Added computation and display of the standard deviation across individual prompt accuracy values for each task about 1 month ago
envs.py

1.03 kB

Small changes 5 months ago
populate.py

2.68 kB

Added computation and display of the standard deviation across individual prompt accuracy values for each task about 1 month ago
tasks.py
18.8 kB

Rename prompts for LS, SU, NER, and REL about 15 hours ago