Modality Curation: Building Universal Embeddings for Advanced Multimodal Information Retrieval
Kong
friedrichor
AI & ML interests
Multimodal Dialogue, Large Multimodal Model, Large Language Model
Recent Activity
new activity
about 14 hours ago
friedrichor/Unite-Instruct-Retrieval-Train:[bot] Conversion to Parquet
updated
a model
about 16 hours ago
friedrichor/Unite-Instruct-Qwen2-VL-7B
updated
a model
about 16 hours ago
friedrichor/Unite-Instruct-Qwen2-VL-2B
Organizations
Collections
1
Papers
2
models
5

friedrichor/Unite-Instruct-Qwen2-VL-7B
Feature Extraction
•
Updated
•
25

friedrichor/Unite-Instruct-Qwen2-VL-2B
Feature Extraction
•
Updated
•
7

friedrichor/Unite-Base-Qwen2-VL-7B
Feature Extraction
•
Updated
•
18

friedrichor/Unite-Base-Qwen2-VL-2B
Feature Extraction
•
Updated
•
11

friedrichor/stable-diffusion-2-1-realistic
Text-to-Image
•
Updated
•
58
•
4
datasets
9
friedrichor/TUNA-Bench
Viewer
•
Updated
•
3.43k
•
136
friedrichor/Unite-Instruct-Retrieval-Train
Viewer
•
Updated
•
1.27M
•
249
•
1
friedrichor/Unite-Base-Retrieval-Train
Viewer
•
Updated
•
6.38M
•
478
friedrichor/ActivityNet_Captions
Viewer
•
Updated
•
19.8k
•
161
•
1
friedrichor/MSVD
Viewer
•
Updated
•
1.97k
•
141
•
1
friedrichor/MSR-VTT
Viewer
•
Updated
•
17k
•
429
•
1
friedrichor/DiDeMo
Viewer
•
Updated
•
9.4k
•
1.16k
•
3
friedrichor/PhotoChat_image
Viewer
•
Updated
•
8.54k
•
101
•
2
friedrichor/PhotoChat_120_square_HQ
Viewer
•
Updated
•
120
•
34