license: mit | |
base_model: | |
- Qwen/Qwen2.5-7B-Instruct | |
library_name: transformers | |
**SPC: Evolving Self-Play Critic via Adversarial Games for LLM Reasoning** | |
[[arXiv]](https://arxiv.org/abs/2504.19162) [[Project]](https://chen-judge.github.io/SPC/) | |
**Jiaqi Chen**, Bang Zhang, Ruotian Ma, Peisong Wang, Xiaodan Liang, Zhaopeng Tu, Xiaolong Li, Kwan-Yee K. Wong. |