sachen commited on
Commit
3f16eeb
·
verified ·
1 Parent(s): 9148e46

End of training

Browse files
Files changed (2) hide show
  1. README.md +2 -2
  2. training_args.bin +2 -2
README.md CHANGED
@@ -7,14 +7,14 @@ tags:
7
  - dpo
8
  - generated_from_trainer
9
  model-index:
10
- - name: meta-llama/Llama-3.2-1B-Instruct
11
  results: []
12
  ---
13
 
14
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
15
  should probably proofread and complete it, then remove this comment. -->
16
 
17
- # meta-llama/Llama-3.2-1B-Instruct
18
 
19
  This model is a fine-tuned version of [meta-llama/Llama-3.2-1B-Instruct](https://huggingface.co/meta-llama/Llama-3.2-1B-Instruct) on the None dataset.
20
 
 
7
  - dpo
8
  - generated_from_trainer
9
  model-index:
10
+ - name: hw1-329x-toy-dpo
11
  results: []
12
  ---
13
 
14
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
15
  should probably proofread and complete it, then remove this comment. -->
16
 
17
+ # hw1-329x-toy-dpo
18
 
19
  This model is a fine-tuned version of [meta-llama/Llama-3.2-1B-Instruct](https://huggingface.co/meta-llama/Llama-3.2-1B-Instruct) on the None dataset.
20
 
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:55c7540817ce0bb29607f6e2ef0dc52583c3f4be13ebeb8b57f972a77e80b37b
3
- size 5112
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6a0be7052098d65322b117e6ee73c921a4dcf67be9ef4538e6cbef3c534b3b23
3
+ size 5176