VisualSphinx: Large-Scale Synthetic Vision Logic Puzzles for RL Paper • 2505.23977 • Published 8 days ago • 9
VisualSphinx-V1 Collection VisualSphinx-V1 is the largest fully-synthetic open-source dataset providing vision logic puzzles. • 7 items • Updated 4 days ago • 1
VisualSphinx: Large-Scale Synthetic Vision Logic Puzzles for RL Paper • 2505.23977 • Published 8 days ago • 9
VisualSphinx: Large-Scale Synthetic Vision Logic Puzzles for RL Paper • 2505.23977 • Published 8 days ago • 9 • 2
zhangchenxu/Qwen2.5-Coder-7B-Instruct-taco_nosys-5k-llch_nosys-Vanilla-64-2N_step462 Updated 4 days ago • 66
zhangchenxu/Qwen2.5-Coder-7B-Instruct-taco_nosys-5k-llch_nosys-Vanilla-64-2N_step462 Updated 4 days ago • 66
zhangchenxu/Qwen2.5-Coder-7B-Instruct-taco_nosys-5k-llch_nosys-Vanilla-64-2N_step256 Updated 4 days ago • 8
zhangchenxu/Qwen2.5-Coder-7B-Instruct-taco_nosys-5k-llch_nosys-Vanilla-64-2N_step256 Updated 4 days ago • 8
zhangchenxu/Qwen2.5-Coder-7B-Instruct-taco_nosys-5k-llch_nosys-Vanilla-64-2N_step64 Updated 4 days ago • 8
zhangchenxu/Qwen2.5-Coder-7B-Instruct-taco_nosys-5k-llch_nosys-Vanilla-64-2N_step64 Updated 4 days ago • 8
Personalized Safety in LLMs: A Benchmark and A Planning-Based Agent Approach Paper • 2505.18882 • Published 13 days ago • 14
TinyV: Reducing False Negatives in Verification Improves RL for LLM Reasoning Paper • 2505.14625 • Published 18 days ago • 13