Generalized Spoofing Detection Inspired from Audio Generation Artifacts

By Yang Gao, Tyler Vuong and Mahsa Elyasi

Published on ISCA

Summary

State-of-the-art methods for audio generation suffer from fingerprint artifacts and repeated inconsistencies across temporal and spectral domains. Such artifacts could be well captured by the frequency domain analysis over the spectrogram. Thus, we propose a novel use of long-range spectro-temporal modulation feature -- 2D DCT over log-Mel spectrogram for the audio deepfake detection. We show that this feature works better than log-Mel spectrogram, CQCC, MFCC, as a suitable candidate to capture such artifacts. We employ spectrum augmentation and feature normalization to decrease overfitting and bridge the gap between training and test dataset along with this novel feature introduction. We developed a CNN-based baseline that achieved a 0.0849 t-DCF and outperformed the previously top single systems reported in the ASVspoof 2019 challenge. Finally, by combining our baseline with our proposed 2D DCT spectro-temporal feature, we decrease the t-DCF score down by 14% to 0.0737, making it a state-of-the-art system for spoofing detection. Furthermore, we evaluate our model using two external datasets, showing the proposed feature's generalization ability. We also provide analysis and ablation studies for our proposed feature and results.

Research

All Solutions

Reality Defender Launches Free Access to Deepfake Detection API

Reality Defender Launches Free Access to Deepfake Detection API

Reality Defender Wins “Most Innovative Startup” at RSA Conference Innovation Sandbox

Generalized Spoofing Detection Inspired from Audio Generation Artifacts

Published on ISCA

Summary

Read More of Our Peer-Reviewed Research, Published in Top Journals

PolyJuice Makes It Real: Black-Box, Universal Red Teaming for Synthetic Image Detectors

A Data-Driven Diffusion-based Approach for Audio Deepfake Explanations

X-Edit: Detecting and Localizing Edits in Images Altered by Text-Guided Diffusion Models