Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
alzhang 's Collections
Models for LaaS

Models for LaaS

updated Jan 27, 2024

Collection of models that we are interested in running. Categorized by: (1) Text generation for inference, (2) Smaller models that we want to FT

Upvote
1

  • TinyLlama/TinyLlama-1.1B-Chat-v1.0

    Text Generation • 1B • Updated Mar 17, 2024 • 1.49M • • 1.37k

  • tiiuae/falcon-7b-instruct

    Text Generation • 7B • Updated Oct 12, 2024 • 142k • 1k

  • mistralai/Mistral-7B-Instruct-v0.2

    Text Generation • 7B • Updated 27 days ago • 689k • • 2.92k

  • meta-llama/Llama-2-7b

    Text Generation • Updated Apr 17, 2024 • 860 • 4.38k

  • microsoft/phi-2

    Text Generation • 3B • Updated Apr 29, 2024 • 738k • 3.38k

  • google-t5/t5-small

    Translation • 0.1B • Updated Jun 30, 2023 • 2.79M • • 480

  • distilbert/distilgpt2

    Text Generation • 0.1B • Updated Feb 19, 2024 • 3.61M • 562

    Note Text generation models.


  • google-bert/bert-base-uncased

    Fill-Mask • 0.1B • Updated Feb 19, 2024 • 52.6M • • 2.38k

  • prajjwal1/bert-tiny

    Updated Oct 27, 2021 • 4.07M • 124

    Note BERT models generally for fine-tuning. On inference, they are the base encoder models and only do MLM


  • EfficientNetV2: Smaller Models and Faster Training

    Paper • 2104.00298 • Published Apr 1, 2021 • 1

    Note Replace with the relevant models later. Ideally want: EfficientNet (v1,v2) MobileNet YOLO (v_x) Resnet variants FPN Diffusion: SAM (?) or SEEM

Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs