Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
@@ -23,8 +23,8 @@ This is a Hugging Face Space that hosts a leaderboard for comparing model perfor
|
|
23 |
|
24 |
## Instructions
|
25 |
|
26 |
-
1. Please refer to our GitHub repository at https://github.com/patronus-ai/trail-benchmark
|
27 |
-
2. Compress the resulting JSON outputs into a ZIP archive whose filename begins with
|
28 |
3. Once the evaluation is complete, we’ll upload the scores (this process will soon be automated).
|
29 |
|
30 |
## Benchmarking on TRAIL
|
|
|
23 |
|
24 |
## Instructions
|
25 |
|
26 |
+
1. Please refer to our GitHub repository at https://github.com/patronus-ai/trail-benchmark for step‑by‑step instructions on how to run your model with the TRAIL dataset.
|
27 |
+
2. Compress the resulting JSON outputs into a ZIP archive whose filename begins with SWE_/GAIA_, and submit it.
|
28 |
3. Once the evaluation is complete, we’ll upload the scores (this process will soon be automated).
|
29 |
|
30 |
## Benchmarking on TRAIL
|