[CVPR 2024 Highlight🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding
Peng Jin
Chat-UniVi
AI & ML interests
None yet
Recent Activity
authored
a paper
4 months ago
VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video
Understanding
upvoted
a
paper
4 months ago
VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video
Understanding
Organizations
None yet
Collections
3
models
12
Chat-UniVi/MoH-LLaMA3-8B
Text Generation
•
Updated
•
261
•
3
Chat-UniVi/Chat-UniVi-13B
Video-Text-to-Text
•
Updated
•
157
•
9
Chat-UniVi/Chat-UniVi-7B-v1.5
Video-Text-to-Text
•
Updated
•
32
•
2
Chat-UniVi/MoE-Plus-Plus-7B
Text Generation
•
Updated
•
324
•
5
Chat-UniVi/Chat-UniVi
Video-Text-to-Text
•
Updated
•
179
•
17
Chat-UniVi/MoH-ViT-S-75
Updated
Chat-UniVi/MoH-ViT-S-80
Updated
Chat-UniVi/MoH-ViT-B-50
Updated
Chat-UniVi/MoH-ViT-B-75
Updated
Chat-UniVi/MoH-DiT-XL-90
Updated
•
3