- Support audio I/O (Load files, Save files)
- Load the following formats into a torch Tensor
- mp3, wav, aac, ogg, flac, avr, cdda, cvs/vms,
- aiff, au, amr, mp2, mp4, ac3, avi, wmv,
- mpeg, ircam and any other format supported by libsox.
- Load the following formats into a torch Tensor
- Dataloaders for common audio datasets (VCTK, YesNo)
- Common audio transforms
- libsox v14.3.2 or above
Quick install on OSX (Homebrew):
brew install sox
Linux (Ubuntu):
sudo apt-get install sox libsox-dev libsox-fmt-all
pip install cffi
python setup.py install
import torchaudio
sound, sample_rate = torchaudio.load('foo.mp3')
torchaudio.save('foo_save.mp3', sound, sample_rate) # saves tensor to file
API Reference is located here: http://pytorch.org/audio/