Spaces:

evalitahf
/

evalita_llm_leaderboard

Running

App Files Files Community

evalita_llm_leaderboard / src

Ctrl+K

Ctrl+K

3 contributors

History: 29 commits

rzanoli's picture

Rename prompts for LS, SU, NER, and REL

d0105c8 7 days ago

display
Add theoretical performance of a model that scores the highest on every individual task 10 days ago
leaderboard
Add theoretical performance of a model that scores the highest on every individual task 10 days ago
submission
Small changes 6 months ago
about.py

10.4 kB

Added computation and display of the standard deviation across individual prompt accuracy values for each task about 1 month ago
envs.py

1.03 kB

Small changes 5 months ago
populate.py

2.68 kB

Added computation and display of the standard deviation across individual prompt accuracy values for each task about 1 month ago
tasks.py

18.8 kB

Rename prompts for LS, SU, NER, and REL 7 days ago