A non-negative framework for joint modeling of spectral structure and temporal dynamics in sound mixtures