Unfortunately their paper (https://arxiv.org/abs/2508.08243) provides very little information on the training methodology. Maybe @Jeol would like to share?
This is a really good question, and for a long time I have suspected that this is the case. I would be curious as to how Jinx-Qwen3-32B scores on AHA vs the base.