Abt Finetuning

by CS437help - opened Mar 10

Discussion

CS437help

Mar 10

Great work! Thanx a lot I really liked the model. Do you have any notebook to further fine-tuning it?
Thnx in advance

gpengzhi

Mar 17

Thank you for your interest!

We plan to set up a GitHub repo to include all training and evaluating scripts.

Gemini

Mar 19

I am looking forward to your work. I want to try it myself. If I use sft, is the data format like this:
{
"instruction": "Translate this from Chinese to Korean:\nChinese: ",
"input": "{value}\n",
"output": "Korean:"
},

ModelSpace

Owner Mar 19

The SFT data format is:
{
"instruction": "Translate this from Chinese to Korean:",
"input": "Chinese: {Chinese_sentence}\nKorean:",
"output": "{Korean_sentence}"
},

loscheris

Mar 19

Thanks for sharing the model, the translation quality is incredible! I'm eager to finetune and test on my custom data. Is there an expected date when you release the Git repo and the training script? Thanks!

gpengzhi

Mar 19

Thanks for sharing the model, the translation quality is incredible! I'm eager to finetune and test on my custom data. Is there an expected date when you release the Git repo and the training script? Thanks!

Thanks for your interest! The GitHub repo should be ready next week.

Gemini

Mar 20

Thanks for your reply, looking forward to your work!

The SFT data format is:
{
"instruction": "Translate this from Chinese to Korean:",
"input": "Chinese: {Chinese_sentence}\nKorean:",
"output": "{Korean_sentence}"
},

Thanks for your reply, looking forward to your GitHub repo!！！！！！！！！！！

EkmekE

Mar 20

Thanks for the model I tried it too and it is really good. As others said it would be really good a github repo. I was trying to finetune the model with some instructions but in some cases it translate the instruction itself, i tried unsloth for finetuning but looks like there is an incompatiblity with the framework bcs of chat template ( the current chat template doesnt have add_generation_prompt). I m excited too abt the repo :).

gpengzhi

Mar 21

Thanks for sharing the model, the translation quality is incredible! I'm eager to finetune and test on my custom data. Is there an expected date when you release the Git repo and the training script? Thanks!

https://github.com/xiaomi-research/gemmax

gpengzhi

Mar 21

Thanks for the model I tried it too and it is really good. As others said it would be really good a github repo. I was trying to finetune the model with some instructions but in some cases it translate the instruction itself, i tried unsloth for finetuning but looks like there is an incompatiblity with the framework bcs of chat template ( the current chat template doesnt have add_generation_prompt). I m excited too abt the repo :).

https://github.com/xiaomi-research/gemmax

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment