Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
evalitahf
/
evalita_llm_leaderboard
like
5
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
d0105c8
evalita_llm_leaderboard
/
src
Ctrl+K
Ctrl+K
3 contributors
History:
29 commits
rzanoli
Rename prompts for LS, SU, NER, and REL
d0105c8
7 days ago
display
Add theoretical performance of a model that scores the highest on every individual task
10 days ago
leaderboard
Add theoretical performance of a model that scores the highest on every individual task
10 days ago
submission
Small changes
6 months ago
about.py
Safe
10.4 kB
Added computation and display of the standard deviation across individual prompt accuracy values for each task
about 1 month ago
envs.py
Safe
1.03 kB
Small changes
5 months ago
populate.py
Safe
2.68 kB
Added computation and display of the standard deviation across individual prompt accuracy values for each task
about 1 month ago
tasks.py
Safe
18.8 kB
Rename prompts for LS, SU, NER, and REL
7 days ago