š§"raw" pretrained smol_llama checkpoints - WIP š§
-
BEE-spoke-data/smol_llama-101M-GQA
Text Generation ⢠Updated ⢠905 ⢠28 -
BEE-spoke-data/smol_llama-81M-tied
Text Generation ⢠Updated ⢠97 ⢠6 -
BEE-spoke-data/smol_llama-220M-GQA
Text Generation ⢠Updated ⢠609 ⢠12 -
BEE-spoke-data/verysmol_llama-v11-KIx2
Text Generation ⢠Updated ⢠474 ⢠4