ICASSP 2018: Nonnegative Tensor Factorization for Source Separation of Loops in Audio

This site contains instructions on how to generate the ground truth data related to our ICASSP 2018 publication, "Nonnegative Tensor Factorization for Source Separation of Loops in Audio" (PDF), described in detail here.

@inproceedings{smith2018nonnegative,
Address = {Calgary, AB, Canada},
Author = {Smith, Jordan B. L. and Goto, Masataka},
Booktitle = {Proceedings of the {IEEE} International Conference on Acoustics, Speech and Signal Processing},
Pages = {171--175},
Title = {Nonnegative tensor factorization for source separation of loops in audio},
Year = {2018}}

Abstract

The prevalence of exact repetition in loop-based music makes it an opportune target for source separation. Nonnegative factorization approaches have been used to model the repetition of looped content, and kernel additive modeling has leveraged periodicity within a piece to separate looped background elements. We propose a novel method of leveraging periodicity in a factorization model: we treat the two-dimensional spectrogram as a three-dimensional tensor, and use nonnegative tensor factorization to estimate the component spectral templates, rhythms and loop recurrences in a single step. Testing our method on synthesized loop-based examples, we find that our algorithm mostly exceeds the performance of competing methods, with a reduction in execution cost. We discuss limitations of the algorithm as we demonstrate its potential to analyze larger and more complex songs.

How to use

Download the original dataset, which is licensed under Creative Commons Attribution-ShareAlike 4.0 International.
Unzip the dataset and move the folder ("ISMIR16-EM-Patterns-Audio") to your working directory.
In terminal, run:

cd icassp2018
python audio_generation.py

Source

The data we used to evaluate our algorithm were derived from "Towards Modeling and Decomposing Loop-based Electronic Music", by Patricio López-Serrano, Christian Dittmar, Jonathan Driedger and Meinard Müller. They provided a set of sample stems to create short electronic dance music (EDM) tracks.

Link to original dataset

Citation:

 @inproceedings{lopez-serrano2016,
 Address = {New York, NY, USA},
 Author = {L{\'o}pez-Serrano, Patricio and Dittmar, Christian and Driedger, Jonathan and M{\"u}ller, Meinard},
 Booktitle = {Proceedings of the International Society for Music Information Retrieval Conference},
 Pages = {502--508},
 Title = {Towards Modeling and Decomposing Loop-based Electronic Music},
 Year = {2016}}

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.gitignore		.gitignore
README.md		README.md
audio_generation.py		audio_generation.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ICASSP 2018: Nonnegative Tensor Factorization for Source Separation of Loops in Audio

Abstract

How to use

Source

About

Releases

Packages

Languages

jblsmith/icassp2018

Folders and files

Latest commit

History

Repository files navigation

ICASSP 2018: Nonnegative Tensor Factorization for Source Separation of Loops in Audio

Abstract

How to use

Source

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages