Skip to the content.

I. Deep Generative Modeling


SQ-VAE

[PMLR] [arXiv] [code]

SQ-VAE: Variational Bayes on Discrete Representation with Self-annealed Stochastic Quantization (ICML2022)

FP-Diffusion

[arXiv]

Improving Score-based Diffusion Models by Enforcing the Underlying Score Fokker-Planck Equation (previous work was to NeurIPS2022 Workshop on Score-Based Methods)

GibbsDDRM

[arXiv]

GibbsDDRM: A Partially Collapsed Gibbs Sampler for Solving Blind Linear Inverse Problems with Denoising Diffusion Restoration

Adversarially Slicing Generative Networks

[arXiv]

Adversarially Slicing Generative Networks: Discriminator Slices Feature for One-Dimensional Optimal Transport

II. Music and Cinematic Technologies


CLIPSep

[OpenReview]

CLIPSep: Learning Text-queried Sound Separation with Noisy Unlabeled Videos (ICLR2023)

Vocal Dereverberation

[arXiv] [demo]

Unsupervised Vocal Dereverberation with Diffusion-based Generative Models (ICASSP23)

Mixing Style Transfer

[arXiv] [code] [demo]

Music Mixing Style Transfer: A Contrastive Learning Approach to Disentangle Audio Effects (ICASSP23)

Music Transcription

[arXiv] [code] [demo]

DiffRoll: Diffusion-based Generative Music Transcription with Unsupervised Pretraining Capability (ICASSP23)

Singing Voice Vocoder

[arXiv] [demo]

Hierarchical Diffusion Models for Singing Voice Neural Vocoder (ICASSP23)

Distortion Effect Removal

[poster] [arXiv] [demo]

Distortion Audio Effects: Learning How to Recover the Clean Signal (ISMIR22)

Automatic Music Mixing

[poster] [arXiv] [code] [demo]

Automatic Music Mixing with Deep Learning and Out-of-Domain Data (ISMIR22)

Sound Separation

[IEEE]

Music Source Separation with Deep Equilibrium Models (ICASSP22)

Automatic DJ Transition

[arXiv] [code] [demo]

Automatic DJ Transitions with Differentiable Audio Effects and Generative Adversarial Networks (ICASSP22)

Singing Voice Conversion

[arXiv] [demo]

Robust One-Shot Singing Voice Conversion

Sound Separation

[video] [site]

Glenn Gould and Kanji Ishimaru 2021: A collaboration with AI Sound Separation after 60 years

MDX21

[site] [frontiers]

Music Demixing Challenge 2021

DCASE Challenge

[DCASE Challenge2023]

Sound Event Localization and Detection Evaluated in Real Spatial Sound Scenes

Sound Event Localization and Detection

[arXiv]

Multi-ACCDOA: Localizing and Detecting Overlapping Sounds from the Same Class with Auxiliary Duplicating Permutation Invariant Training (ICASSP2022)

Automatic Music Tagging

[arXiv]

AN ATTENTION-BASED APPROACH TO HIERARCHICAL MULTI-LABEL MUSIC INSTRUMENT CLASSIFICATION (ICASSP2023)

Contact

Yuki Mitsufuji (yuhki.mitsufuji@sony.com)