maifoundations
/

Visionary-R1

Model card Files Files and versions Community

Model Sources

Repository: https://github.com/maifoundations/Visionary-R1
Paper: https://arxiv.org/pdf/2505.14677
Blog: https://www.maifoundations.com/blog/visionary-r1/

Uses

The model is trained based on the Qwen2.5-VL-3B-Instruct. You can follow the instructions of Qwen2.5-VL to use the checkpoints.

Citation

@article{xia2025visionary,
  title={Visionary-R1: Mitigating Shortcuts in Visual Reasoning with Reinforcement Learning},
  author={Xia, Jiaer and Zang, Yuhang and Gao, Peng and Li, Yixuan and Zhou, Kaiyang},
  journal={arXiv preprint arXiv:2505.14677},
  year={2025}
}

Downloads last month: 6

Safetensors

Model size

4.07B params

Tensor type

BF16

·

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for maifoundations/Visionary-R1

Base model

Qwen/Qwen2.5-VL-3B-Instruct

Finetuned

(205)

this model

Quantizations