AMAAI Lab

university

https://dorienherremans.com

dorienherremans

amaai-lab

Activity Feed Request to join this org

AI & ML interests

Audio, Music, and AI

Recent Activity

dorienh authored a paper 3 days ago

Video2Music: Suitable Music Generation from Videos using an Affective Multimodal Transformer model

dorienh authored a paper 3 days ago

Mustango: Toward Controllable Text-to-Music Generation

dorienh authored a paper 3 days ago

MidiCaps -- A large-scale MIDI dataset with text captions

View all activity

amaai-lab's activity

dorienh

authored 20 papers 3 days ago

Video2Music: Suitable Music Generation from Videos using an Affective Multimodal Transformer model

Paper • 2311.00968 • Published Nov 2, 2023

Mustango: Toward Controllable Text-to-Music Generation

Paper • 2311.08355 • Published Nov 14, 2023

MidiCaps -- A large-scale MIDI dataset with text captions

Paper • 2406.02255 • Published Jun 4, 2024 • 1

DisfluencySpeech -- Single-Speaker Conversational Speech Dataset with Paralanguage

Paper • 2406.08820 • Published Jun 13, 2024

Text2midi: Generating Symbolic Music from Captions

Paper • 2412.16526 • Published Dec 21, 2024 • 2

Towards Unified Music Emotion Recognition across Dimensional and Categorical Models

Paper • 2502.03979 • Published Feb 6 • 1

JamendoMaxCaps: A Large Scale Music-caption Dataset with Imputed Metadata

Paper • 2502.07461 • Published Feb 11 • 1

MelodySim: Measuring Melody-aware Music Similarity for Plagiarism Detection

Paper • 2505.20979 • Published 10 days ago

Text2midi-InferAlign: Improving Symbolic Music Generation with Inference-Time Alignment

Paper • 2505.12669 • Published 19 days ago • 1

ImprovNet -- Generating Controllable Musical Improvisations with Iterative Corruption Refinement

Paper • 2502.04522 • Published Feb 6 • 1

MIRFLEX: Music Information Retrieval Feature Library for Extraction

Paper • 2411.00469 • Published Nov 1, 2024 • 1

Are We There Yet? A Brief Survey of Music Emotion Prediction Datasets, Models and Outstanding Challenges

Paper • 2406.08809 • Published Jun 13, 2024 • 1

DART: Disentanglement of Accent and Speaker Representation in Multispeaker Text-to-Speech

Paper • 2410.13342 • Published Oct 17, 2024 • 1

Leveraging LLM Embeddings for Cross Dataset Label Alignment and Zero Shot Music Emotion Prediction

Paper • 2410.11522 • Published Oct 15, 2024 • 1

Accent Conversion in Text-To-Speech Using Multi-Level VAE and Adversarial Training

Paper • 2406.01018 • Published Jun 3, 2024 • 1

BandControlNet: Parallel Transformers-based Steerable Popular Music Generation with Fine-Grained Spatiotemporal Features

Paper • 2407.10462 • Published Jul 15, 2024 • 1

DeepUnifiedMom: Unified Time-series Momentum Portfolio Construction via Multi-Task Learning with Multi-Gate Mixture of Experts

Paper • 2406.08742 • Published Jun 13, 2024 • 1

AI & ML interests

Recent Activity

Team members 11

amaai-lab's activity