treasure4l commited on
Commit
64b174c
·
verified ·
1 Parent(s): fb37779

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -14
README.md CHANGED
@@ -1,12 +1,14 @@
1
  ---
2
  base_model: unsloth/llama-3.2-3b-instruct-bnb-4bit
3
- library_name: peft
 
4
  ---
5
 
6
  # Model Card for Model ID
7
 
8
  <!-- Provide a quick summary of what the model is/does. -->
9
-
 
10
 
11
 
12
  ## Model Details
@@ -17,21 +19,11 @@ library_name: peft
17
 
18
 
19
 
20
- - **Developed by:** [More Information Needed]
21
- - **Funded by [optional]:** [More Information Needed]
22
- - **Shared by [optional]:** [More Information Needed]
23
  - **Model type:** [More Information Needed]
24
  - **Language(s) (NLP):** [More Information Needed]
25
  - **License:** [More Information Needed]
26
- - **Finetuned from model [optional]:** [More Information Needed]
27
-
28
- ### Model Sources [optional]
29
-
30
- <!-- Provide the basic links for the model. -->
31
-
32
- - **Repository:** [More Information Needed]
33
- - **Paper [optional]:** [More Information Needed]
34
- - **Demo [optional]:** [More Information Needed]
35
 
36
  ## Uses
37
 
 
1
  ---
2
  base_model: unsloth/llama-3.2-3b-instruct-bnb-4bit
3
+ datasets:
4
+ - trl-lib/ultrafeedback_binarized
5
  ---
6
 
7
  # Model Card for Model ID
8
 
9
  <!-- Provide a quick summary of what the model is/does. -->
10
+ The Llama 3.2 collection of multilingual large language models (LLMs) is a collection of pretrained and instruction-tuned generative models in 1B and 3B sizes (text in/text out). The Llama 3.2 instruction-tuned text only models are optimized for multilingual dialogue use cases, including agentic retrieval and summarization tasks. They outperform many of the available open source and closed chat models on common industry benchmarks.
11
+ This is a version of the model that has undergone Direct Preference Optimization (DPO) training using the ultrafeedback dataset.
12
 
13
 
14
  ## Model Details
 
19
 
20
 
21
 
22
+ - **Developed by:** Treasure Mayowa
 
 
23
  - **Model type:** [More Information Needed]
24
  - **Language(s) (NLP):** [More Information Needed]
25
  - **License:** [More Information Needed]
26
+ - **Finetuned from model [optional]:** Llama 3.2 Instruct
 
 
 
 
 
 
 
 
27
 
28
  ## Uses
29