Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Spaces:

Duplicated fromย  HASHIRUAgentX/hashiruAI

guineapig
/
hashiruAI
Sleeping

App Files Files Community
Fetching metadata from the HF Docker repository...
hashiruAI / bench
Ctrl+K
Ctrl+K
  • 10 contributors
History: 8 commits
helloparthshah's picture
helloparthshah
QOL updates and refactoring. Also fixed the tool/agent budgeting
6900003 about 2 months ago
  • benchmarking_connections.py
    3.47 kB
    QOL updates and refactoring. Also fixed the tool/agent budgeting about 2 months ago
  • benchmarking_globle.py
    5.62 kB
    Add benchmarking functionality for Globle game about 2 months ago
  • benchmarking_hle.py
    5.85 kB
    Refactor get_last_assistant_content function to improve response handling and support various response formats about 2 months ago
  • benchmarking_paper_reviews.py
    3.6 kB
    Add paper benchmarking, along with dataset for it about 2 months ago
  • benchmarking_wordle.py
    3.96 kB
    Add benchmarking script for Wordle game about 2 months ago