Untrained Musical Sound Source Separation
by Unsupervised Density Network

A clarinet plays a single scale:

The clarinet solo is combined with 14 different samples of naturalistic background sounds (like horns honking, phones ringing, traffic passing):

An untrained, unsupervised Density Network learns, recognizes and extracts the clarinet solo: