-
38
Llama 3.2V 11B Cot
💬Generate descriptions and answers by combining text and images
-
Xkev/Llama-3.2V-11B-cot
Image-Text-to-Text • 11B • Updated • 4.49k • 154 -
Xkev/LLaVA-CoT-100k
Viewer • Updated • 98.6k • 1.34k • 96 -
LLaVA-o1: Let Vision Language Models Reason Step-by-Step
Paper • 2411.10440 • Published • 128
Guowei Xu PRO
Xkev
AI & ML interests
None yet
Recent Activity
updated
a model
about 3 hours ago
Xkev/qwen-2.5-openthoughts-10k-subset-zosft
published
a model
about 4 hours ago
Xkev/qwen-2.5-openthoughts-10k-subset-zosft
liked
a model
7 days ago
openai/gpt-oss-20b
Organizations
None yet