Video2Music: Suitable Music Generation from Videos using an Affective Multimodal Transformer model Paper • 2311.00968 • Published Nov 2, 2023
MidiCaps -- A large-scale MIDI dataset with text captions Paper • 2406.02255 • Published Jun 4, 2024 • 1
DisfluencySpeech -- Single-Speaker Conversational Speech Dataset with Paralanguage Paper • 2406.08820 • Published Jun 13, 2024
Towards Unified Music Emotion Recognition across Dimensional and Categorical Models Paper • 2502.03979 • Published Feb 6 • 1
JamendoMaxCaps: A Large Scale Music-caption Dataset with Imputed Metadata Paper • 2502.07461 • Published Feb 11 • 1
MelodySim: Measuring Melody-aware Music Similarity for Plagiarism Detection Paper • 2505.20979 • Published 10 days ago
Text2midi-InferAlign: Improving Symbolic Music Generation with Inference-Time Alignment Paper • 2505.12669 • Published 19 days ago • 1
ImprovNet -- Generating Controllable Musical Improvisations with Iterative Corruption Refinement Paper • 2502.04522 • Published Feb 6 • 1
MIRFLEX: Music Information Retrieval Feature Library for Extraction Paper • 2411.00469 • Published Nov 1, 2024 • 1
Are We There Yet? A Brief Survey of Music Emotion Prediction Datasets, Models and Outstanding Challenges Paper • 2406.08809 • Published Jun 13, 2024 • 1
DART: Disentanglement of Accent and Speaker Representation in Multispeaker Text-to-Speech Paper • 2410.13342 • Published Oct 17, 2024 • 1
Leveraging LLM Embeddings for Cross Dataset Label Alignment and Zero Shot Music Emotion Prediction Paper • 2410.11522 • Published Oct 15, 2024 • 1
Accent Conversion in Text-To-Speech Using Multi-Level VAE and Adversarial Training Paper • 2406.01018 • Published Jun 3, 2024 • 1
Accented Text-to-Speech Synthesis with a Conditional Variational Autoencoder Paper • 2211.03316 • Published Nov 7, 2022
Prevailing Research Areas for Music AI in the Era of Foundation Models Paper • 2409.09378 • Published Sep 14, 2024 • 1
BandControlNet: Parallel Transformers-based Steerable Popular Music Generation with Fine-Grained Spatiotemporal Features Paper • 2407.10462 • Published Jul 15, 2024 • 1
DeepUnifiedMom: Unified Time-series Momentum Portfolio Construction via Multi-Task Learning with Multi-Gate Mixture of Experts Paper • 2406.08742 • Published Jun 13, 2024 • 1