Spectrogram is selected as preprocessing feature of audio samples and a feature representation method based on deep residual network (Spec-ResNet) is implemented here to detect audio steganography based on AAC(Advanced Audio Coding) and MP3(MPEG-1 Audio Layer III).
First of all, we analysed existing audio steganography based on AAC and MP3. Although the embedding domain is different, the final impact of existing schemes are the subtle modifications of MDCT coefficients of the encoded audio signals in different frequency range. Spectrogram, a basic visual representation of the spectrum of frequencies of audio signals which can show energy amplitude information of different frequency bands over time and contains abundant time-frequency information about the audio signal.
Still, this is a work on progress, we will update it lately. You will find responding paper "Spec-ResNet: A General Audio Steganalysis scheme based on Deep Residual Network of Spectrogram" by Yanzhen Ren, Dengkai Liu, Qiaochu Xiong, Jianming Fu, Lina Wang in https://arxiv.org/abs/1901.06838.
- Tensorflow 1.0.0
- CUDA 8.0.61
- CuDNN 5.1.10
- Python 2.7.15.
-
-
stego with 5 relative embedding rate for AAC is provided. (2s, stereo, 1024Kbps, decoded WAV)
-
stego with approximately maxium embedding capcity for Mp3Stego is provided.(5s, mono, 705Kbps, decoded wav)
spectrogram feature extraction script is in AAC (2s, stereo, 1024Kbps, decoded WAV).rar together with a small sample subset, whose spectrogram matrix with window size
N=512
in Spectrogram.rar.
-
-
ASDIIE(extract code: z28d)
For AAC based steganographic algorithm, we choosed above three as objects. Considering the capicity of MP3stego, we choosed several emerging MP3 adaptive steganography(AHCM, EECS) with distortion minimization and UnderMp3Cover to test the detection ability. The dataset we choose was also changed. It wil be coming soon...