Model Card for f5-tts-hakka-finetune-with-word
Model Details
F5-TTS finetune on all formosan data (ithuan, fb ilrdf dict, klokah) with samples only one word, using ipa as input.
g2p from this repo.
Training Details
- learning rate: 0.00001
- batch size per gpu: 9511
- batch size type: frame
- max samples: 64
- grad accumulation steps: 1
- max grad norm: 1
- epochs: 210 (1254120 steps, current 324480)
- num warmup updates: 27040
Model Sources
- Repository: https://github.com/SWivid/F5-TTS
- Paper: https://arxiv.org/abs/2410.06885
Uses
please refer source repo
Demo
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support
Model tree for ithuan/f5-tts-formosan-all-finetune-with-word
Base model
SWivid/F5-TTS