Update README.md
Browse files
README.md
CHANGED
@@ -1,12 +1,14 @@
|
|
1 |
---
|
2 |
base_model: unsloth/llama-3.2-3b-instruct-bnb-4bit
|
3 |
-
|
|
|
4 |
---
|
5 |
|
6 |
# Model Card for Model ID
|
7 |
|
8 |
<!-- Provide a quick summary of what the model is/does. -->
|
9 |
-
|
|
|
10 |
|
11 |
|
12 |
## Model Details
|
@@ -17,21 +19,11 @@ library_name: peft
|
|
17 |
|
18 |
|
19 |
|
20 |
-
- **Developed by:**
|
21 |
-
- **Funded by [optional]:** [More Information Needed]
|
22 |
-
- **Shared by [optional]:** [More Information Needed]
|
23 |
- **Model type:** [More Information Needed]
|
24 |
- **Language(s) (NLP):** [More Information Needed]
|
25 |
- **License:** [More Information Needed]
|
26 |
-
- **Finetuned from model [optional]:**
|
27 |
-
|
28 |
-
### Model Sources [optional]
|
29 |
-
|
30 |
-
<!-- Provide the basic links for the model. -->
|
31 |
-
|
32 |
-
- **Repository:** [More Information Needed]
|
33 |
-
- **Paper [optional]:** [More Information Needed]
|
34 |
-
- **Demo [optional]:** [More Information Needed]
|
35 |
|
36 |
## Uses
|
37 |
|
|
|
1 |
---
|
2 |
base_model: unsloth/llama-3.2-3b-instruct-bnb-4bit
|
3 |
+
datasets:
|
4 |
+
- trl-lib/ultrafeedback_binarized
|
5 |
---
|
6 |
|
7 |
# Model Card for Model ID
|
8 |
|
9 |
<!-- Provide a quick summary of what the model is/does. -->
|
10 |
+
The Llama 3.2 collection of multilingual large language models (LLMs) is a collection of pretrained and instruction-tuned generative models in 1B and 3B sizes (text in/text out). The Llama 3.2 instruction-tuned text only models are optimized for multilingual dialogue use cases, including agentic retrieval and summarization tasks. They outperform many of the available open source and closed chat models on common industry benchmarks.
|
11 |
+
This is a version of the model that has undergone Direct Preference Optimization (DPO) training using the ultrafeedback dataset.
|
12 |
|
13 |
|
14 |
## Model Details
|
|
|
19 |
|
20 |
|
21 |
|
22 |
+
- **Developed by:** Treasure Mayowa
|
|
|
|
|
23 |
- **Model type:** [More Information Needed]
|
24 |
- **Language(s) (NLP):** [More Information Needed]
|
25 |
- **License:** [More Information Needed]
|
26 |
+
- **Finetuned from model [optional]:** Llama 3.2 Instruct
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
27 |
|
28 |
## Uses
|
29 |
|