Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
OpenNLPLab123 's Collections
TransNormerLLM
HGRN
HGRN2

TransNormerLLM

updated Jun 25, 2024

TransNormerLLM: A Faster and Better Large Language Model with Improved TransNormer

Upvote
3

  • OpenNLPLab/TransNormerLLM3-15B-Intermediate-Checkpoints

    Text Generation • Updated Apr 7, 2024 • 69 • 15

  • OpenNLPLab/TransNormerLLM2-1B-300B

    Text Generation • Updated Feb 26, 2024 • 32 • 3

  • OpenNLPLab/TransNormerLLM2-3B-300B

    Text Generation • Updated Feb 26, 2024 • 10 • 3

  • OpenNLPLab/TransNormerLLM2-7B-300B

    Text Generation • Updated Feb 26, 2024 • 10 • 4

  • OpenNLPLab/TransNormerLLM-385M

    Text Generation • Updated Feb 26, 2024 • 46 • 9

  • OpenNLPLab/TransNormerLLM-1B

    Text Generation • Updated Feb 26, 2024 • 19 • 12

  • OpenNLPLab/TransNormerLLM-7B

    Text Generation • Updated Feb 26, 2024 • 23 • 17

  • Scaling TransNormer to 175 Billion Parameters

    Paper • 2307.14995 • Published Jul 27, 2023 • 22

  • Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models

    Paper • 2401.04658 • Published Jan 9, 2024 • 28

  • Linear Attention Sequence Parallelism

    Paper • 2404.02882 • Published Apr 3, 2024 • 3

  • Scaling Laws for Linear Complexity Language Models

    Paper • 2406.16690 • Published Jun 24, 2024 • 23
Upvote
3
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs