-
Long-form music generation with latent diffusion
Paper • 2404.10301 • Published • 28 -
Rhythmic Foley: A Framework For Seamless Audio-Visual Alignment In Video-to-Audio Synthesis
Paper • 2409.08628 • Published -
SpecDiff-GAN: A Spectrally-Shaped Noise Diffusion GAN for Speech and Music Synthesis
Paper • 2402.01753 • Published -
Apollo: Band-sequence Modeling for High-Quality Audio Restoration
Paper • 2409.08514 • Published • 12
Collections
Discover the best community collections!
Collections including paper arxiv:2404.10301
-
Long-form music generation with latent diffusion
Paper • 2404.10301 • Published • 28 -
Rhythmic Foley: A Framework For Seamless Audio-Visual Alignment In Video-to-Audio Synthesis
Paper • 2409.08628 • Published -
SpecDiff-GAN: A Spectrally-Shaped Noise Diffusion GAN for Speech and Music Synthesis
Paper • 2402.01753 • Published -
Apollo: Band-sequence Modeling for High-Quality Audio Restoration
Paper • 2409.08514 • Published • 12