Sayak Paul's picture

Sayak Paul

sayakpaul

·

https://sayak.dev

AI & ML interests

Diffusion models, representation learning

Recent Activity

updated a dataset about 1 hour ago

diffusers/benchmarks

liked a dataset 2 days ago

Rapidata/text-2-video-human-preferences-veo3

liked a Space 2 days ago

chansung/auto-diffuser-config

View all activity

Organizations

sayakpaul's activity

upvoted an article 13 days ago

Article

How to train a new language model from scratch using Transformers and Tokenizers

By

•

Feb 14, 2020

• 37

upvoted an article 16 days ago

Article

Exploring Quantization Backends in Diffusers

By

and 2 others •

17 days ago

• 32

upvoted an article about 1 month ago

Article

Welcoming Llama Guard 4 on Hugging Face Hub

By

and 3 others •

Apr 29

• 37

upvoted 2 articles about 2 months ago

Article

CinePile 2.0 - making stronger datasets with adversarial refinement

By

and 3 others •

Oct 23, 2024

• 16

Article

Welcome Llama 4 Maverick & Scout on Hugging Face!

By

and 6 others •

Apr 5

• 144

upvoted 2 papers 2 months ago

A Refined Analysis of Massive Activations in LLMs

Paper • 2503.22329 • Published Mar 28 • 14

ZClip: Adaptive Spike Mitigation for LLM Pre-Training

Paper • 2504.02507 • Published Apr 3 • 78

upvoted a collection 2 months ago

SANA-1.5

SANA-1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer • 6 items • Updated Apr 17 • 6

upvoted 4 articles 3 months ago

Article

Don't repeat yourself - 🤗 Transformers Design Philosophy

By

•

Apr 5, 2022

• 34

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

By

and 3 others •

Mar 12

• 425

Article

Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models

By

and 2 others •

Jun 24, 2024

• 194

Article

Distilling from Dialogues: Finding Meaning in LLM Interactions

By

•

Feb 25

• 4

upvoted a collection 3 months ago

Remote VAE Inference Endpoints

Models and handler code used in https://huggingface.co/blog/remote_vae • 5 items • Updated Mar 10 • 5

upvoted an article 3 months ago

Article

Remote VAEs for decoding with HF endpoints 🤗

By

and 1 other •

Feb 24

• 39

upvoted an article 4 months ago

Article

SigLIP 2: A better multilingual vision language encoder

By

and 2 others •

Feb 21

• 165

upvoted a paper 4 months ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published Feb 20 • 144

upvoted a collection 4 months ago

PaliGemma 2 Release

Vision-Language Models available in multiple 3B, 10B and 28B variants. • 32 items • Updated 7 days ago • 148

upvoted 3 articles 4 months ago

Article

Build awesome datasets for video generation

By

and 1 other •

Feb 12

• 33

Article

Open-source DeepResearch – Freeing our search agents

By

and 4 others •

Feb 4

• 1.25k

Article

The AI tools for Art Newsletter - Issue 1

By

and 1 other •

Jan 31

• 79