view article Article KV Cache from scratch in nanoVLM By ariG23498 and 4 others • 3 days ago • 55
Wan2.1 14B T2V LoRAs Collection A collection of Remade's Wan2.1 14B T2V LoRAs • 20 items • Updated Mar 27 • 25
Wan2.1 14B 480p I2V LoRAs Collection A collection of Remade's Wan2.1 14B 480p I2V LoRAs • 49 items • Updated 13 days ago • 167
MedGemma Release Collection Collection of Gemma 3 variants for performance on medical text and image comprehension to accelerate building healthcare-based AI applications. • 4 items • Updated 8 days ago • 152
Seek in the Dark: Reasoning via Test-Time Instance-Level Policy Gradient in Latent Space Paper • 2505.13308 • Published 18 days ago • 26
view article Article Mixture of Experts Explained By osanseviero and 5 others • Dec 11, 2023 • 666