Skip to content

Data related to ICASSP 2018 publication: "Nonnegative Tensor Factorization for Source Separation of Loops in Audio"

Notifications You must be signed in to change notification settings

jblsmith/icassp2018

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 

Repository files navigation

ICASSP 2018: Nonnegative Tensor Factorization for Source Separation of Loops in Audio

This site contains instructions on how to generate the ground truth data related to our ICASSP 2018 publication, "Nonnegative Tensor Factorization for Source Separation of Loops in Audio" (PDF), described in detail here.

@inproceedings{smith2018nonnegative,
Address = {Calgary, AB, Canada},
Author = {Smith, Jordan B. L. and Goto, Masataka},
Booktitle = {Proceedings of the {IEEE} International Conference on Acoustics, Speech and Signal Processing},
Pages = {171--175},
Title = {Nonnegative tensor factorization for source separation of loops in audio},
Year = {2018}}

Abstract

The prevalence of exact repetition in loop-based music makes it an opportune target for source separation. Nonnegative factorization approaches have been used to model the repetition of looped content, and kernel additive modeling has leveraged periodicity within a piece to separate looped background elements. We propose a novel method of leveraging periodicity in a factorization model: we treat the two-dimensional spectrogram as a three-dimensional tensor, and use nonnegative tensor factorization to estimate the component spectral templates, rhythms and loop recurrences in a single step. Testing our method on synthesized loop-based examples, we find that our algorithm mostly exceeds the performance of competing methods, with a reduction in execution cost. We discuss limitations of the algorithm as we demonstrate its potential to analyze larger and more complex songs.

How to use

  1. Download the original dataset, which is licensed under Creative Commons Attribution-ShareAlike 4.0 International.

  2. Unzip the dataset and move the folder ("ISMIR16-EM-Patterns-Audio") to your working directory.

  3. In terminal, run:

cd icassp2018
python audio_generation.py

Source

The data we used to evaluate our algorithm were derived from "Towards Modeling and Decomposing Loop-based Electronic Music", by Patricio López-Serrano, Christian Dittmar, Jonathan Driedger and Meinard Müller. They provided a set of sample stems to create short electronic dance music (EDM) tracks.

  1. Link to original dataset

  2. Citation:

     @inproceedings{lopez-serrano2016,
     Address = {New York, NY, USA},
     Author = {L{\'o}pez-Serrano, Patricio and Dittmar, Christian and Driedger, Jonathan and M{\"u}ller, Meinard},
     Booktitle = {Proceedings of the International Society for Music Information Retrieval Conference},
     Pages = {502--508},
     Title = {Towards Modeling and Decomposing Loop-based Electronic Music},
     Year = {2016}}
    

About

Data related to ICASSP 2018 publication: "Nonnegative Tensor Factorization for Source Separation of Loops in Audio"

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages