Model and data for ReflectiVA: Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering [CVPR 2025]
Federico Cocchi
fede97
AI & ML interests
Multimodal LLM - Computer Vision
Recent Activity
updated
a dataset
23 days ago
aimagelab/RAID
updated
a collection
25 days ago
RAID
updated
a collection
25 days ago
RAID
Organizations
Collections
5
models
0
None public yet
datasets
5
fede97/external_test_set_v1
Viewer
•
Updated
•
340
•
27
fede97/external_data_test_example_v3
Updated
•
6
fede97/external_data_test_example
Viewer
•
Updated
•
410
•
70
fede97/external_data_test_example_v2
Viewer
•
Updated
•
410
•
54
fede97/dpo_demo
Viewer
•
Updated
•
148k
•
22