torchaudio: an audio library for PyTorch

Support audio I/O (Load files, Save files)
- Load the following formats into a torch Tensor
  - mp3, wav, aac, ogg, flac, avr, cdda, cvs/vms,
  - aiff, au, amr, mp2, mp4, ac3, avi, wmv,
  - mpeg, ircam and any other format supported by libsox.
Dataloaders for common audio datasets (VCTK, YesNo)
Common audio transforms
- Scale, PadTrim, DownmixMono, LC2CL, BLC2CBL, MuLawEncoding, MuLawExpanding

Dependencies

Quick install on OSX (Homebrew):

brew install sox

Linux (Ubuntu):

sudo apt-get install sox libsox-dev libsox-fmt-all

pip install cffi
python setup.py install

import torchaudio
sound, sample_rate = torchaudio.load('foo.mp3')
torchaudio.save('foo_save.mp3', sound, sample_rate) # saves tensor to file

API Reference is located here: http://pytorch.org/audio/

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
docs		docs
test		test
torchaudio		torchaudio
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
build.py		build.py
setup.py		setup.py
tox.ini		tox.ini