Safetensors
English
qwen2_5_vl

🦁 VisualSphinx: Large-Scale Synthetic Vision Logic Puzzles for RL

VisualSphinx is the largest fully-synthetic open-source dataset providing vision logic puzzles. It consists of over 660K automatically generated logical visual puzzles. Each logical puzzle is grounded with an interpretable rule and accompanied by both correct answers and plausible distractors.

πŸ“Š About This Model

This model is used for tagging the difficulty of our VisualSphinx-V1 synthetic dataset. To train this model, we perform GRPO on Qwen/Qwen2.5-VL-7B-Instruct using our seed dataset for 256 steps.

Downloads last month
8
Safetensors
Model size
8.29B params
Tensor type
BF16
Β·
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for VisualSphinx/VisualSphinx-Difficulty-Tagging

Finetuned
(346)
this model

Dataset used to train VisualSphinx/VisualSphinx-Difficulty-Tagging

Collection including VisualSphinx/VisualSphinx-Difficulty-Tagging