Ctrl+K
Upload eval_results/luckeciano/Qwen-2.5-7B-RL-AC-BigLRv3-Fast-4-v3-AdamEps6/36ec96284216458fc05f39ffe9cd8816f474938f/eval_llm/results_2025-04-18T20-21-47.947931.json with huggingface_hub
1ef235a
verified
- Qwen-2.5-1.5B-Simple-RL
- Qwen-2.5-7B-Answer-Entropy-RL-0.1
- Qwen-2.5-7B-Answer-Entropy-RL-0.4
- Qwen-2.5-7B-Embedding-Entropy-0.45-Missing-Response
- Qwen-2.5-7B-Embedding-Entropy-RL-0.1
- Qwen-2.5-7B-Embedding-Entropy-RL-0.25
- Qwen-2.5-7B-Embedding-Entropy-RL-Len-Penalty
- Qwen-2.5-7B-Len-Penalty-Baseline-v2
- Qwen-2.5-7B-Len-Penalty-Baseline
- Qwen-2.5-7B-Missing-Response-RL-Baseline
- Qwen-2.5-7B-RL-AC-BigLRv3-Fast-4-v3-AdamEps6
- Qwen-2.5-7B-RL-AC-BigLRv3-Fast-4
- Qwen-2.5-7B-RL-AC-BigLRv3
- Qwen-2.5-7B-Simple-RL