Model Sources
- Repository: https://github.com/maifoundations/Visionary-R1
- Paper: https://arxiv.org/pdf/2505.14677
- Blog: https://www.maifoundations.com/blog/visionary-r1/
Uses
The model is trained based on the Qwen2.5-VL-3B-Instruct. You can follow the instructions of Qwen2.5-VL to use the checkpoints.
Citation
@article{xia2025visionary,
title={Visionary-R1: Mitigating Shortcuts in Visual Reasoning with Reinforcement Learning},
author={Xia, Jiaer and Zang, Yuhang and Gao, Peng and Li, Yixuan and Zhou, Kaiyang},
journal={arXiv preprint arXiv:2505.14677},
year={2025}
}
- Downloads last month
- 6
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support