Data Augmentation with Signal Companding for Detection of Logical Access Attacks

View on arXiv ← Back to list

Authors: Rohan Kumar Das, Jichen Yang, Haizhou Li

Published: 2021-02-12 02:51:06+00:00

AI Summary

This paper introduces a novel data augmentation technique using a-law and mu-law signal companding to improve the detection of logical access attacks in automatic speaker verification (ASV). Experiments on the ASVspoof 2019 logical access corpus show that this method outperforms state-of-the-art spoofing countermeasures, particularly in handling unknown attacks.

Abstract

The recent advances in voice conversion (VC) and text-to-speech (TTS) make it possible to produce natural sounding speech that poses threat to automatic speaker verification (ASV) systems. To this end, research on spoofing countermeasures has gained attention to protect ASV systems from such attacks. While the advanced spoofing countermeasures are able to detect known nature of spoofing attacks, they are not that effective under unknown attacks. In this work, we propose a novel data augmentation technique using a-law and mu-law based signal companding. We believe that the proposed method has an edge over traditional data augmentation by adding small perturbation or quantization noise. The studies are conducted on ASVspoof 2019 logical access corpus using light convolutional neural network based system. We find that the proposed data augmentation technique based on signal companding outperforms the state-of-the-art spoofing countermeasures showing ability to handle unknown nature of attacks.

Key findings

The proposed data augmentation technique significantly improved the detection of unknown logical access attacks compared to baselines and other data augmentation methods. The approach achieved state-of-the-art performance on the ASVspoof 2019 evaluation set, outperforming existing single systems.

Approach

The authors propose data augmentation using a-law and mu-law signal companding. This technique compresses and expands audio signals to generate additional training data without requiring external datasets. A light convolutional neural network (LCNN) is trained using the augmented data to detect logical access attacks.

Datasets

ASVspoof 2019 logical access corpus, NoiseX-92 database (for comparison)

Model(s)

Light Convolutional Neural Network (LCNN)

Author countries

Singapore

← Previous