This demo page is for the paper Timbre-Trap: A Low-Resource Framework for Instrument-Agnostic Music Transcription
Source code: https://github.com/sony/timbre-trap
Timbre-Trap reconstructs the CQT coefficents when the latent space switch is in "on" state (Reconstruction). It removes timbre from the audio clips when the latent space switch is in "off" state (Sonification).
With skip-connections, Timbre-Trap reconstructs CQT coefficents very close to the original audio, resulting in a very high SDR.
But the sonification retains more timbre information than the Timbre-Trap without skip-connections, resulting in a slower F1-score.
The reconstruction and transcription produced by Timbre-Trap with and without skip-connections