QtGroup
/

CodeLlama-13B-QML

Code-Completion

Model card Files Files and versions Community

qt-spyro-hf commited on Jan 30

Commit

3dde0a6

·

verified ·

1 Parent(s): d5945cb

Upload README.md

Files changed (1) hide show

README.md +1 -3

README.md CHANGED Viewed

@@ -18,7 +18,7 @@ By accessing this model, you are agreeing to the Llama 2 terms and conditions of
  ## Usage:
-CodeLlama-13B-QML is a medium-sized Language Model that requires significant computing resources to perform with inference (response) times suitable for automatic code completion. Therefore, the primary use is in a private cloud such as AWS, Google Cloud, Microsoft Azure, or your own.
 Large Language Models, including CodeLlama-13B-QML, are not designed to be deployed in isolation but instead should be deployed as part of an overall AI system with additional safety guardrails as required. Developers are expected to deploy system safeguards when building AI systems.
@@ -31,8 +31,6 @@ The configuration has been thoroughly tested on Ubuntu 22.04 LTS running NVIDIA
 ## How to run CodeLlama-13B-QML in ollama:
-Note: These instructions are written for ollama version 0.5.7. Be aware that your computer needs significant computing resources for reasonable inference times.
 #### 1. Install ollama
 https://ollama.com/download

  ## Usage:
+CodeLlama-13B-QML is a medium-sized Language Model that requires significant computing resources to perform with inference (response) times suitable for automatic code completion. Therefore, it should be used with GPU accelerator, either in cloud environment such as AWS, Google Cloud, Microsoft Azure, or local.
 Large Language Models, including CodeLlama-13B-QML, are not designed to be deployed in isolation but instead should be deployed as part of an overall AI system with additional safety guardrails as required. Developers are expected to deploy system safeguards when building AI systems.
 ## How to run CodeLlama-13B-QML in ollama:
 #### 1. Install ollama
 https://ollama.com/download