AudioX: Diffusion Transformer for Anything-to-Audio Generation Paper โข 2503.10522 โข Published Mar 13 โข 27
CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model Paper โข 2305.06908 โข Published May 11, 2023 โข 6