Aritra Roy Gosthipaty's picture

Aritra Roy Gosthipaty PRO

ariG23498

AI & ML interests

Deep Representation Learning

Recent Activity

upvoted a collection about 12 hours ago
Qwen3-Embedding
updated a dataset about 12 hours ago
model-metadata/trending_models
commented on their article about 15 hours ago
KV Cache from scratch in nanoVLM
View all activity

Organizations

Hugging Face's profile picture Google's profile picture Notebooks-explorers's profile picture 🧨Diffusers's profile picture PyTorch Image Models's profile picture Keras's profile picture Cohere Labs's profile picture Hugging Test Lab's profile picture Hugging Face Fellows's profile picture Probing ViTs's profile picture TrystAI's profile picture PyImageSearch's profile picture Keras Dreambooth Event's profile picture Hugging Face OSS Metrics's profile picture Blog-explorers's profile picture ZeroGPU Explorers's profile picture kotol's profile picture gg-hf's profile picture MLX Community's profile picture IBM Granite's profile picture Open Generative Fill's profile picture Social Post Explorers's profile picture Hugging Face Discord Community's profile picture nltpt's profile picture nltpt-q's profile picture qrias's profile picture Hugging Face Science's profile picture open/ acc's profile picture wut?'s profile picture LLM from Scratch's profile picture s0225's profile picture gg-hf-g's profile picture llrehf's profile picture University of Science and Technology of China's profile picture Model Metadata's profile picture all things vision LMs's profile picture

ariG23498's activity

commented on KV Cache from scratch in nanoVLM about 15 hours ago
commented on KV Cache from scratch in nanoVLM 2 days ago
posted an update 2 days ago
view post
Post
1160
🚨 Implement KV Cache from scratch in pure PyTorch. 🚨

We have documented all of our learning while implementing KV Cache to nanoVLM. Joint work with @kashif @lusxvr @andito @pcuenq

Blog: hf.co/blog/kv-cache
  • 1 reply
·
upvoted an article 2 days ago
reacted to danielhanchen's post with 🔥 3 days ago
upvoted an article 3 days ago
view article
Article

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

By danaaubakirova and 8 others
93
published an article 3 days ago
published an article 4 days ago
view article
Article

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

By danaaubakirova and 8 others
93