jukofyork lbourdois commited on
Commit
52947dc
·
verified ·
1 Parent(s): 5b876a9

Improve language tag (#1)

Browse files

- Improve language tag (7446d215e0b8a502408a74f6b0ffed83523a7506)


Co-authored-by: Loïck BOURDOIS <lbourdois@users.noreply.huggingface.co>

Files changed (1) hide show
  1. README.md +28 -14
README.md CHANGED
@@ -1,15 +1,29 @@
1
- ---
2
- license: apache-2.0
3
- base_model:
4
- - Qwen/Qwen2.5-0.5B-Instruct
5
- ---
6
-
7
- **NOTE**: See [here](https://huggingface.co/jukofyork/DeepSeek-R1-DRAFT-0.5B-GGUF/discussions/1#67e14af77a8626d6e085aaa0) for update on the version with ~3B tokens of fine-tuning applied.
8
-
9
- ---
10
-
11
- A 0.5B parameter draft model for speculative sampling for use with [deepseek-ai/DeepSeek-R1](https://huggingface.co/deepseek-ai/DeepSeek-R1) created from [alamios/DeepSeek-R1-DRAFT-Qwen2.5-0.5B](https://huggingface.co/alamios/DeepSeek-R1-DRAFT-Qwen2.5-0.5B) using [transplant-vocab](https://github.com/jukofyork/transplant-vocab).
12
-
13
- **NOTE**: This is a draft model for the full-sized `DeepSeek-R1` model and not the smaller "distilled" models!
14
-
 
 
 
 
 
 
 
 
 
 
 
 
 
 
15
  See [jukofyork/DeepSeek-R1-DRAFT-0.5B-GGUF](https://huggingface.co/jukofyork/DeepSeek-R1-DRAFT-0.5B-GGUF) for a `F16` GGUF ready for use with `llama.cpp`.
 
1
+ ---
2
+ license: apache-2.0
3
+ base_model:
4
+ - Qwen/Qwen2.5-0.5B-Instruct
5
+ language:
6
+ - zho
7
+ - eng
8
+ - fra
9
+ - spa
10
+ - por
11
+ - deu
12
+ - ita
13
+ - rus
14
+ - jpn
15
+ - kor
16
+ - vie
17
+ - tha
18
+ - ara
19
+ ---
20
+
21
+ **NOTE**: See [here](https://huggingface.co/jukofyork/DeepSeek-R1-DRAFT-0.5B-GGUF/discussions/1#67e14af77a8626d6e085aaa0) for update on the version with ~3B tokens of fine-tuning applied.
22
+
23
+ ---
24
+
25
+ A 0.5B parameter draft model for speculative sampling for use with [deepseek-ai/DeepSeek-R1](https://huggingface.co/deepseek-ai/DeepSeek-R1) created from [alamios/DeepSeek-R1-DRAFT-Qwen2.5-0.5B](https://huggingface.co/alamios/DeepSeek-R1-DRAFT-Qwen2.5-0.5B) using [transplant-vocab](https://github.com/jukofyork/transplant-vocab).
26
+
27
+ **NOTE**: This is a draft model for the full-sized `DeepSeek-R1` model and not the smaller "distilled" models!
28
+
29
  See [jukofyork/DeepSeek-R1-DRAFT-0.5B-GGUF](https://huggingface.co/jukofyork/DeepSeek-R1-DRAFT-0.5B-GGUF) for a `F16` GGUF ready for use with `llama.cpp`.