Models for LaaS - a alzhang Collection

alzhang 's Collections

Models for LaaS

Models for LaaS

updated Jan 27, 2024

Collection of models that we are interested in running. Categorized by: (1) Text generation for inference, (2) Smaller models that we want to FT

TinyLlama/TinyLlama-1.1B-Chat-v1.0

Text Generation • 1B • Updated Mar 17, 2024 • 1.49M • • 1.37k
tiiuae/falcon-7b-instruct

Text Generation • 7B • Updated Oct 12, 2024 • 142k • 1k
mistralai/Mistral-7B-Instruct-v0.2

Text Generation • 7B • Updated 27 days ago • 689k • • 2.92k
meta-llama/Llama-2-7b

Text Generation • Updated Apr 17, 2024 • 860 • 4.38k
microsoft/phi-2

Text Generation • 3B • Updated Apr 29, 2024 • 738k • 3.38k
google-t5/t5-small

Translation • 0.1B • Updated Jun 30, 2023 • 2.79M • • 480
distilbert/distilgpt2

Text Generation • 0.1B • Updated Feb 19, 2024 • 3.61M • 562

Note Text generation models.
google-bert/bert-base-uncased

Fill-Mask • 0.1B • Updated Feb 19, 2024 • 52.6M • • 2.38k
prajjwal1/bert-tiny

Updated Oct 27, 2021 • 4.07M • 124

Note BERT models generally for fine-tuning. On inference, they are the base encoder models and only do MLM
EfficientNetV2: Smaller Models and Faster Training

Paper • 2104.00298 • Published Apr 1, 2021 • 1

Note Replace with the relevant models later. Ideally want: EfficientNet (v1,v2) MobileNet YOLO (v_x) Resnet variants FPN Diffusion: SAM (?) or SEEM