Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
evalitahf
/
evalita_llm_leaderboard
like
5
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
evalita_llm_leaderboard
/
src
Ctrl+K
Ctrl+K
3 contributors
History:
30 commits
rzanoli
Remove author prefix from model names
831dff0
about 13 hours ago
display
Remove author prefix from model names
about 13 hours ago
leaderboard
Add theoretical performance of a model that scores the highest on every individual task
4 days ago
submission
Small changes
5 months ago
about.py
Safe
10.4 kB
Added computation and display of the standard deviation across individual prompt accuracy values for each task
about 1 month ago
envs.py
Safe
1.03 kB
Small changes
5 months ago
populate.py
Safe
2.68 kB
Added computation and display of the standard deviation across individual prompt accuracy values for each task
about 1 month ago
tasks.py
18.8 kB
Rename prompts for LS, SU, NER, and REL
about 15 hours ago