Robust copy-move detection and localization of digital audio based CFCC feature
Wang, Dongyu,Li, Xiaojie,Shi, Canghong,Xiong, Ling,Qian, Qing
DOI: https://doi.org/10.1007/s11042-024-19346-x
IF: 2.577
2024-05-11
Multimedia Tools and Applications
Abstract:Copy-move forgery is a common audio tampering technique in which users copy the contents of one speech and paste them into another region of the same speech signal, thus achieving the effect of tampering with the semantics. To verify the authenticity of the audio, this paper proposes a method to detect and localize audio copy-move forgery based on the cochlear filter of cochlear filter cepstral coefficients (CFCC) feature. The pitch tracking algorithm is used to distinguish the voiced and unvoiced segments in the audio, and then the CFCC coefficients are extracted for each voiced segment. The CFCC feature simulates the entire transmission process of signals in the cochlear basilar membrane using wavelet transformation. Finally, we use Pearson correlation coefficients (PCCs) and dynamic time warping (DTW) in combination to compare the similarity of voiced segments, accurately determining the tampered locations in the audio through threshold judgment. Through extensive experiments on relevant datasets, this algorithm achieves a precision rate of 98.39% and a recall rate of 98.00% in detecting audio without post-processing. Even when detecting audio that has undergone different post-processing, the precision and recall rates remain above 90%. Compared to existing methods, this approach not only achieves precise localization of replicated segments but also demonstrates superior experimental results.
computer science, information systems, theory & methods,engineering, electrical & electronic, software engineering