Data Augmentation with Signal Companding for Detection of Logical Access Attacks
Authors: Rohan Kumar Das, Jichen Yang, Haizhou Li
Published: 2021-02-12 02:51:06+00:00
Comment: 5 pages, Accepted for publication in International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 2021
AI Summary
This paper proposes a novel data augmentation technique utilizing a-law and mu-law based signal companding to improve the detection of logical access attacks against automatic speaker verification (ASV) systems. The method aims to enhance the robustness of spoofing countermeasures, particularly against unknown attack types derived from advanced voice conversion and text-to-speech technologies. Experiments show that this companding-based augmentation outperforms traditional data augmentation and state-of-the-art countermeasures in handling unseen logical access attacks.
Abstract
The recent advances in voice conversion (VC) and text-to-speech (TTS) make it possible to produce natural sounding speech that poses threat to automatic speaker verification (ASV) systems. To this end, research on spoofing countermeasures has gained attention to protect ASV systems from such attacks. While the advanced spoofing countermeasures are able to detect known nature of spoofing attacks, they are not that effective under unknown attacks. In this work, we propose a novel data augmentation technique using a-law and mu-law based signal companding. We believe that the proposed method has an edge over traditional data augmentation by adding small perturbation or quantization noise. The studies are conducted on ASVspoof 2019 logical access corpus using light convolutional neural network based system. We find that the proposed data augmentation technique based on signal companding outperforms the state-of-the-art spoofing countermeasures showing ability to handle unknown nature of attacks.