Skip to content

yongxuUSTC/sednn

Repository files navigation

deep_learning_for_speech_enhancement_keras_python

deep learning based speech enhancement using keras python

Authors: YONG XU & QIUQIANG KONG

Goal:

Make the GPU-C++ code project convert to python code which is much easier for the community to follow and use. The training and decoding code will be unified into the python code. Keras will be used as the toolkit.

Invitation:

I want to invite you to be one of the contributors of this project, please contact me if you have interest. yong.xu.ustc@gmail.com

My final goal is to build a universal & robust deep learning based speech enhancement front end. And aslo try to adapt it to really serve for the speech recognition back-end.

Ref:

The original GPU-C++ code: https://github.com/yongxuUSTC/DNN-for-speech-enhancement

Please cite the following papers if you use this code:

[1] A Regression Approach to Speech Enhancement Based on Deep Neural Networks. Yong Xu, Jun Du,Li-Rong Dai and Chin-Hui Lee, IEEE/ACM Transactions on Audio,Speech, and Language Processing,P.7-19,Vol.23,No.1, 2015 (2018 IEEE SPS Best paper award, citations > 600)

[2] An Experimental Study on Speech Enhancement Based on Deep Neural Networks. Yong Xu, Jun Du, Li-Rong Dai and Chin-Hui Lee,IEEE signal processing letters, p. 65-68,vol.21,no. 1,January 2014 (citations > 550)

[3] Multi-Objective Learning and Mask-Based Post-Processing for Deep Neural Network Based Speech Enhancement, Yong Xu, Jun Du, Zhen Huang, Li-Rong Dai, Chin-Hui Lee, Interspeech2015

Some DNN based speech enhancemen demos:

http://staff.ustc.edu.cn/~jundu/The%20team/yongxu/demo/SE_DNN_taslp.html

http://staff.ustc.edu.cn/~jundu/The%20team/yongxu/demo/IS15.html