Sam Flin's picture

Sam Flin PRO

sflindrs

·

sflindrs

AI & ML interests

None yet

Recent Activity

liked a model 2 days ago

GoodEnough/NiT-XL-Models

upvoted a paper 2 days ago

UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation

upvoted a paper 2 days ago

Image Editing As Programs with Diffusion Models

View all activity

Organizations

None yet

sflindrs's activity

upvoted 3 papers 2 days ago

UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation

Paper • 2506.03147 • Published 4 days ago • 55

Image Editing As Programs with Diffusion Models

Paper • 2506.04158 • Published 3 days ago • 19

MiMo-VL Technical Report

Paper • 2506.03569 • Published 3 days ago • 65

upvoted 2 papers 23 days ago

Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image Analysis

Paper • 2505.09358 • Published 24 days ago • 24

BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset

Paper • 2505.09568 • Published 24 days ago • 90

upvoted a paper 24 days ago

MiniMax-Speech: Intrinsic Zero-Shot Text-to-Speech with a Learnable Speaker Encoder

Paper • 2505.07916 • Published 26 days ago • 124

upvoted a paper 26 days ago

UniVLA: Learning to Act Anywhere with Task-centric Latent Actions

Paper • 2505.06111 • Published 29 days ago • 24

upvoted a paper 29 days ago

StreamBridge: Turning Your Offline Video Large Language Model into a Proactive Streaming Assistant

Paper • 2505.05467 • Published 30 days ago • 13

upvoted 4 papers about 1 month ago

T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT

Paper • 2505.00703 • Published May 1 • 42

TeLoGraF: Temporal Logic Planning via Graph-encoded Flow Matching

Paper • 2505.00562 • Published May 1 • 3

Improving Editability in Image Generation with Layer-wise Memory

Paper • 2505.01079 • Published May 2 • 28

PixelHacker: Image Inpainting with Structural and Semantic Consistency

Paper • 2504.20438 • Published Apr 29 • 43

upvoted a collection about 1 month ago

MiniCPM

The MiniCPM family of LLMs and VLLMs. • 32 items • Updated Jan 19 • 70

upvoted 7 papers about 1 month ago

COMPACT: COMPositional Atomic-to-Complex Visual Capability Tuning

Paper • 2504.21850 • Published Apr 30 • 26

UniversalRAG: Retrieval-Augmented Generation over Multiple Corpora with Diverse Modalities and Granularities

Paper • 2504.20734 • Published Apr 29 • 62

YoChameleon: Personalized Vision and Language Generation

Paper • 2504.20998 • Published Apr 29 • 11

Dynamic Camera Poses and Where to Find Them

Paper • 2504.17788 • Published Apr 24 • 5

Distilling semantically aware orders for autoregressive image generation

Paper • 2504.17069 • Published Apr 23 • 6

ViSMaP: Unsupervised Hour-long Video Summarisation by Meta-Prompting

Paper • 2504.15921 • Published Apr 22 • 7

IberBench: LLM Evaluation on Iberian Languages

Paper • 2504.16921 • Published Apr 23 • 8