Vishal
mvish7
AI & ML interests
Multi-modal AI and Computer Vision
Recent Activity
new activity
about 12 hours ago
ktian6/NuScenes-SpatialQA:Open sourcing the dataset
commented on
a paper
14 days ago
AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal
Understanding
commented on
a paper
2 months ago
AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal
Understanding
Organizations
None yet
models
0
None public yet
datasets
0
None public yet