metadata
title: MiniMax Speech Tech Report
emoji: 🎙️
colorFrom: indigo
colorTo: green
sdk: static
pinned: false
Here are our latest tech reports:
Citation
@misc{minimax2025minimaxspeechintrinsiczeroshottexttospeech,
title={MiniMax-Speech: Intrinsic Zero-Shot Text-to-Speech with a Learnable Speaker Encoder},
author={Bowen Zhang, Congchao Guo, Geng Yang, Hang Yu, Haozhe Zhang, Heidi Lei, Jialong Mai, Junjie Yan, Kaiyue Yang, Mingqi Yang, Peikai Huang, Ruiyang Jin, Sitan Jiang, Weihua Cheng, Yawei Li, Yichen Xiao, Yiying Zhou, Yongmao Zhang, Yuan Lu, Yucen He},
year={2025},
eprint={2505.07916},
archivePrefix={arXiv},
primaryClass={eess.AS},
url={https://arxiv.org/abs/2505.07916},
}