Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
HASHIRUAgentX
/
hashiruAI
like
2
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
eb835bc
hashiruAI
/
bench
Ctrl+K
Ctrl+K
10 contributors
History:
8 commits
helloparthshah
QOL updates and refactoring. Also fixed the tool/agent budgeting
6900003
about 2 months ago
benchmarking_connections.py
3.47 kB
QOL updates and refactoring. Also fixed the tool/agent budgeting
about 2 months ago
benchmarking_globle.py
5.62 kB
Add benchmarking functionality for Globle game
about 2 months ago
benchmarking_hle.py
5.85 kB
Refactor get_last_assistant_content function to improve response handling and support various response formats
about 2 months ago
benchmarking_paper_reviews.py
3.6 kB
Add paper benchmarking, along with dataset for it
about 2 months ago
benchmarking_wordle.py
3.96 kB
Add benchmarking script for Wordle game
about 2 months ago