Cover Image for System.Linq.Enumerable+EnumerablePartition`1[System.Char]

Audio Tampering Forensics Based on Representation Learning of ENF Phase Sequence

OAI: oai:igi-global.com:302894 DOI: 10.4018/IJDCF.302894

Abstract

This paper proposes an audio tampering detection method based on the ENF phase and BI-LSTM network from the perspective of temporal feature representation learning. First, the ENF phase is obtained by discrete Fourier transform of ENF component in audio. Second, the ENF phase is divided into frames to obtain ENF phase sequence characterization, and each frame is represented as the change information of the ENF phase in a period. Then, the BI-LSTM neural network is used to train and output the state of each time step, and the difference information between real audio and tampered audio is obtained. Finally, these differences were fitted and dimensionally reduced by the fully connected network and classified by the Softmax classifier. Experimental results show that the performance of this method is better than the state-of-the-art approaches.