2 14 11

Aurelien Lac

uminaty

uminaty

AI & ML interests

Computer vision, image generation, image translation, LLMs, multimodal AI

Recent Activity

new activity 4 days ago

lightonai/MonoQwen2-VL-v0.1:Add pipeline tag & library name

upvoted a paper 17 days ago

Emerging Properties in Unified Multimodal Pretraining

upvoted a paper about 2 months ago

SmolVLM: Redefining small and efficient multimodal models

View all activity

Organizations

uminaty's activity

New activity in lightonai/MonoQwen2-VL-v0.1 4 days ago

Add pipeline tag & library name

#3 opened 4 days ago by

merve

upvoted a paper 17 days ago

Emerging Properties in Unified Multimodal Pretraining

Paper • 2505.14683 • Published 18 days ago • 129

upvoted a paper about 2 months ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7 • 188

liked a Space about 2 months ago

144

Vidore Leaderboard

🥇

Display visual document retrieval leaderboard

liked a model 4 months ago

Zyphra/Zonos-v0.1-hybrid

Text-to-Speech • Updated 4 days ago • 12.8k • 1.08k

upvoted a collection 4 months ago

Qwen2.5-VL

Collection

Vision-language model series based on Qwen2.5 • 11 items • Updated Apr 28 • 484

liked 2 models 6 months ago

answerdotai/ModernBERT-base

Fill-Mask • Updated Jan 15 • 510k • 858

answerdotai/ModernBERT-large

Fill-Mask • Updated Jan 15 • 67.7k • 397

upvoted a paper 6 months ago

Arbitrary-steps Image Super-resolution via Diffusion Inversion

Paper • 2412.09013 • Published Dec 12, 2024 • 13

New activity in lightonai/MonoQwen2-VL-v0.1 7 months ago

Model loading issues

#1 opened 7 months ago by

MaxJeblick

updated 2 models 7 months ago

lightonai/MonoQwen2-VL-v0.1

Visual Document Retrieval • Updated 4 days ago • 656 • 40

lightonai/MonoQwen2-VL-v0.1

Visual Document Retrieval • Updated 4 days ago • 656 • 40

liked a model 7 months ago

lightonai/MonoQwen2-VL-v0.1

Visual Document Retrieval • Updated 4 days ago • 656 • 40

liked a Space 8 months ago

Vision Pipeline

🌍

Query an image index to get answers

upvoted an article 10 months ago

Article

ArabicWeb24: Creating a High Quality Arabic Web-only Pre-training Dataset

•

Aug 8, 2024

• 11

upvoted a paper 11 months ago

ColPali: Efficient Document Retrieval with Vision Language Models

Paper • 2407.01449 • Published Jun 27, 2024 • 48

liked 2 models about 1 year ago

stabilityai/stable-audio-open-1.0

Text-to-Audio • Updated Apr 1 • 40.3k • 1.22k

microsoft/Phi-3-medium-128k-instruct

Text Generation • Updated Aug 20, 2024 • 10.7k • 381

upvoted a paper about 1 year ago

KAN: Kolmogorov-Arnold Networks

Paper • 2404.19756 • Published Apr 30, 2024 • 113

upvoted a paper over 1 year ago

World Model on Million-Length Video And Language With RingAttention

Paper • 2402.08268 • Published Feb 13, 2024 • 40