Added computation and display of the standard deviation across individual prompt accuracy values for each task 67324c2 rzanoli commited on Jul 21