metadata
license: apache-2.0
datasets:
- kolerk/TON-Math-SFT
language:
- en
metrics:
- accuracy
base_model:
- Qwen/Qwen2.5-VL-7B-Instruct
pipeline_tag: image-text-to-text
This is the model cited in the paper: Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models.