OpsEval / data_v2 /huaweicloud_zh_mc_gen.csv
Junetheriver's picture
update leaderboard 2024-09-06
fe35dbb
raw
history blame
897 Bytes
name,zero_self_con,zero_cot_self_con,few_self_con,few_cot_self_con
Baichuan2-13B-Chat,10.0,23.33,20.0,30.0
Chatglm3-6B,13.33333333,16.66666667,6.666666667,13.33333333
Devops-Model-14B-Chat,16.67,13.33,40.0,23.33
Ernie-Bot-4.0,16.67,20.0,36.67,23.33
Gpt-3.5-Turbo,13.33,26.67,20.0,23.33
GPT-4,20.0,20.0,43.33,46.67
Internlm2-Chat-20B,13.33333333,20.0,16.66666667,
Internlm2-Chat-7B,43.33333333,23.33333333,30.0,40.0
Llama-2-13B,10.0,20.0,26.67,13.33
Llama-2-70B-Chat,3.33,20.0,23.33,16.67
Llama-2-7B,10.0,26.67,16.67,33.33
Mistral-7B,0.0,23.33,0.0,16.67
Qwen-14B-Chat,13.33,26.67,30.0,33.33
Qwen-72B-Chat,36.67,33.33,43.33,36.67
Yi-34B-Chat,40.0,30.0,46.67,43.33
Claude-3-Opus,55.0,,,
gemma_2b,26.66667,10.0,26.66667,20.0
gemma_7b,3.333333,23.33333,13.33333,30.0
Meta-Llama-3-8B-Instruct,27.5,22.5,30.0,30.0
Qwen1.5-14B-Base,20.0,33.33333,20.0,30.0
Qwen1.5-14B-Chat,26.66667,13.33333,26.66667,30.0