hashiruAI / bench

Commit History

Refactor benchmarking script to implement HLE dataset performance evaluation and improve response handling
aa7e221

Kunal Pai commited on

Add benchmarking script for GlobleDistanceTool via Gradio API
97e9ed5

Kunal Pai commited on