4 10 13

Peng Jin

Chat-UniVi

https://scholar.google.com/citations?user=HHXLexAAAAAJ&hl=en

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation

authored a paper 4 months ago

VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding

upvoted a paper 4 months ago

VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding

View all activity

Organizations

None yet

Chat-UniVi's activity

upvoted a paper 2 days ago

UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation

Paper • 2506.03147 • Published 3 days ago • 55

authored a paper 4 months ago

VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding

Paper • 2501.13106 • Published Jan 22 • 91

upvoted a paper 4 months ago

VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding

Paper • 2501.13106 • Published Jan 22 • 91

liked a model 6 months ago

Chat-UniVi/Chat-UniVi-7B-v1.5

Video-Text-to-Text • Updated Dec 7, 2024 • 38 • 2

updated 4 models 6 months ago

liked a Space 6 months ago

115

ViewCrafter

🐨

Create a video from an image with camera motion

authored a paper 7 months ago

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published Nov 15, 2024 • 125

upvoted a paper 7 months ago

Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss

Paper • 2410.17243 • Published Oct 22, 2024 • 95

liked a dataset 8 months ago

Chat-UniVi/Chat-UniVi-Eval

Preview • Updated Nov 23, 2023 • 20 • 5

liked 3 models 8 months ago

Chat-UniVi/MoE-Plus-Plus-7B

Text Generation • Updated Dec 7, 2024 • 324 • 5

Chat-UniVi/MoH-LLaMA3-8B

Text Generation • Updated Dec 7, 2024 • 308 • 3

Chat-UniVi/MoH-DiT-XL-90

Updated Oct 17, 2024 • 3

New activity in Chat-UniVi/Chat-UniVi 8 months ago

Update pipeline tag

#1 opened 8 months ago by

nielsr

updated a model 8 months ago

Chat-UniVi/Chat-UniVi

Video-Text-to-Text • Updated Oct 22, 2024 • 168 • 17

commented a paper 8 months ago

MoH: Multi-Head Attention as Mixture-of-Head Attention

Paper • 2410.11842 • Published Oct 15, 2024 • 22 •

authored 2 papers 8 months ago

DiffusionRet: Generative Text-Video Retrieval with Diffusion Model

Paper • 2303.09867 • Published Mar 17, 2023

Multi-granularity Interaction Simulation for Unsupervised Interactive Segmentation

Paper • 2303.13399 • Published Mar 23, 2023