File size: 825 Bytes
e877412
31600f6
 
e877412
 
 
 
 
 
a707aa1
 
 
67e8274
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
---
title: MiniMax Speech Tech Report
emoji: 🎙️
colorFrom: indigo
colorTo: green
sdk: static
pinned: false
---

Here are our latest tech reports:

- [MiniMax Speech Tech Report](https://minimax-ai.github.io/tts_tech_report/)

Citation

```
@misc{minimax2025minimaxspeechintrinsiczeroshottexttospeech,
      title={MiniMax-Speech: Intrinsic Zero-Shot Text-to-Speech with a Learnable Speaker Encoder}, 
      author={Bowen Zhang, Congchao Guo, Geng Yang, Hang Yu, Haozhe Zhang, Heidi Lei, Jialong Mai, Junjie Yan, Kaiyue Yang, Mingqi Yang, Peikai Huang, Ruiyang Jin, Sitan Jiang, Weihua Cheng, Yawei Li, Yichen Xiao, Yiying Zhou, Yongmao Zhang, Yuan Lu, Yucen He},
      year={2025},
      eprint={2505.07916},
      archivePrefix={arXiv},
      primaryClass={eess.AS},
      url={https://arxiv.org/abs/2505.07916}, 
}
```