ggerganov commited on
Commit
498da4a
·
verified ·
1 Parent(s): 36a6107

readme : update usage

Browse files
Files changed (1) hide show
  1. README.md +1 -37
README.md CHANGED
@@ -118,41 +118,5 @@ This model was converted to GGUF format from [`nomic-ai/nomic-embed-text-v2-moe`
118
  Refer to the [original model card](https://huggingface.co/nomic-ai/nomic-embed-text-v2-moe) for more details on the model.
119
 
120
  ## Use with llama.cpp
121
- Install llama.cpp through brew (works on Mac and Linux)
122
 
123
- ```bash
124
- brew install llama.cpp
125
-
126
- ```
127
- Invoke the llama.cpp server or the CLI.
128
-
129
- ### CLI:
130
- ```bash
131
- llama-cli --hf-repo ggml-org/Nomic-Embed-Text-V2-GGUF -p "The meaning to life and the universe is"
132
- ```
133
-
134
- ### Server:
135
- ```bash
136
- llama-server --hf-repo ggml-org/Nomic-Embed-Text-V2-GGUF -c 2048
137
- ```
138
-
139
- Note: You can also use this checkpoint directly through the [usage steps](https://github.com/ggml-org/llama.cpp?tab=readme-ov-file#usage) listed in the Llama.cpp repo as well.
140
-
141
- Step 1: Clone llama.cpp from GitHub.
142
- ```
143
- git clone https://github.com/ggml-org/llama.cpp
144
- ```
145
-
146
- Step 2: Move into the llama.cpp folder and build it with `LLAMA_CURL=1` flag along with other hardware-specific flags (for ex: LLAMA_CUDA=1 for Nvidia GPUs on Linux).
147
- ```
148
- cd llama.cpp && LLAMA_CURL=1 make
149
- ```
150
-
151
- Step 3: Run inference through the main binary.
152
- ```
153
- ./llama-cli --hf-repo ggml-org/Nomic-Embed-Text-V2-GGUF -p "The meaning to life and the universe is"
154
- ```
155
- or
156
- ```
157
- ./llama-server --hf-repo ggml-org/Nomic-Embed-Text-V2-GGUF -c 2048
158
- ```
 
118
  Refer to the [original model card](https://huggingface.co/nomic-ai/nomic-embed-text-v2-moe) for more details on the model.
119
 
120
  ## Use with llama.cpp
 
121
 
122
+ See https://github.com/ggml-org/llama.cpp/pull/12466#pullrequestreview-2799984859