I. Deep Generative Modeling
SQ-VAE

[PMLR] [arXiv] [code]
SQ-VAE: Variational Bayes on Discrete Representation with Self-annealed Stochastic Quantization (ICML2022)
FP-Diffusion

[arXiv]
Improving Score-based Diffusion Models by Enforcing the Underlying Score Fokker-Planck Equation (previous work was to NeurIPS2022 Workshop on Score-Based Methods)
GibbsDDRM

[arXiv]
GibbsDDRM: A Partially Collapsed Gibbs Sampler for Solving Blind Linear Inverse Problems with Denoising Diffusion Restoration
Adversarially Slicing Generative Networks

[arXiv]
Adversarially Slicing Generative Networks: Discriminator Slices Feature for One-Dimensional Optimal Transport
II. Music and Cinematic Technologies
CLIPSep

[OpenReview]
CLIPSep: Learning Text-queried Sound Separation with Noisy Unlabeled Videos (ICLR2023)
Vocal Dereverberation

[arXiv] [demo]
Unsupervised Vocal Dereverberation with Diffusion-based Generative Models (ICASSP23)
Mixing Style Transfer

[arXiv] [code] [demo]
Music Mixing Style Transfer: A Contrastive Learning Approach to Disentangle Audio Effects (ICASSP23)
Music Transcription

[arXiv] [code] [demo]
DiffRoll: Diffusion-based Generative Music Transcription with Unsupervised Pretraining Capability (ICASSP23)
Singing Voice Vocoder

[arXiv] [demo]
Hierarchical Diffusion Models for Singing Voice Neural Vocoder (ICASSP23)
Distortion Effect Removal

[poster] [arXiv] [demo]
Distortion Audio Effects: Learning How to Recover the Clean Signal (ISMIR22)
Automatic Music Mixing

[poster] [arXiv] [code] [demo]
Automatic Music Mixing with Deep Learning and Out-of-Domain Data (ISMIR22)
Automatic DJ Transition

[arXiv] [code] [demo]
Automatic DJ Transitions with Differentiable Audio Effects and Generative Adversarial Networks (ICASSP22)
Sound Separation

[video] [site]
Glenn Gould and Kanji Ishimaru 2021: A collaboration with AI Sound Separation after 60 years
DCASE Challenge

[DCASE Challenge2023]
Sound Event Localization and Detection Evaluated in Real Spatial Sound Scenes
Sound Event Localization and Detection

[arXiv]
Multi-ACCDOA: Localizing and Detecting Overlapping Sounds from the Same Class with Auxiliary Duplicating Permutation Invariant Training (ICASSP2022)
Automatic Music Tagging

[arXiv]
AN ATTENTION-BASED APPROACH TO HIERARCHICAL MULTI-LABEL MUSIC INSTRUMENT CLASSIFICATION (ICASSP2023)