Spaces:

PatronusAI
/

TRAIL

Running

jitinpatronus commited on 28 days ago

Commit

8ba6848

verified ·

1 Parent(s): 3069ced

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -23,9 +23,12 @@ This is a Hugging Face Space that hosts a leaderboard for comparing model perfor
 ## Instructions
-1. Please refer to our GitHub repository at https://github.com/patronus-ai/trail-benchmark for step‑by‑step instructions on how to run your model with the TRAIL dataset.
-2. Compress the resulting JSON outputs into a ZIP archive whose filename begins with SWE_/GAIA_, and submit it.
-3. Once the evaluation is complete, we’ll upload the scores (this process will soon be automated).
 ## Benchmarking on TRAIL

 ## Instructions
+* Please refer to our GitHub repository at https://github.com/patronus-ai/trail-benchmark for step‑by‑step instructions on how to run your model with the TRAIL dataset.
+* Please upload a zip file containing your model outputs. The zip file should contain:
+  - One or more directories with model outputs
+  - Each directory should contain JSON files with the model's predictions
+  - Directory names should indicate the split (GAIA_ or SWE_)
+* Once the evaluation is complete, we’ll upload the scores (this process will soon be automated).
 ## Benchmarking on TRAIL