Malacopula: adversarial automatic speaker verification attacks using a neural-based generalised Hammerstein model

Todisco, Massimiliano; Panariello, Michele; Wang, Xin; Delgado, Hector; Lee, Kong Aik; Evans, Nicholas
ASVspoof Workshop 2024, Automatic Speaker Verification Spoofing and Countermeasures Challenge, 31 August 2024, Kos Island, Greece

We present Malacopula, a neural-based generalised Hammerstein model designed to introduce adversarial perturbations to spoofed speech utterances so that they better deceive automatic speaker verification (ASV) systems. Using non-linear processes to modify speech utterances, Malacopula enhances the effectiveness of spoofing attacks. The model comprises parallel branches of polynomial functions followed by linear timeinvariant filters. The adversarial optimisation procedure acts to minimise the cosine distance between speaker embeddings extracted from spoofed and bona fide utterances. Experiments, performed using three recent ASV systems and the ASVspoof 2019 dataset, show that Malacopula increases vulnerabilities by a substantial margin. However, speech quality is reduced and attacks can be detected effectively under controlled conditions. The findings emphasise the need to identify new vulnerabilities and design defences to protect ASV systems from adversarial attacks in the wild. 


DOI
HAL
Type:
Conference
City:
Kos island
Date:
2024-08-31
Department:
Digital Security
Eurecom Ref:
7828
Copyright:
© ISCA. Personal use of this material is permitted. The definitive version of this paper was published in ASVspoof Workshop 2024, Automatic Speaker Verification Spoofing and Countermeasures Challenge, 31 August 2024, Kos Island, Greece and is available at : http://dx.doi.org/10.21437/ASVspoof.2024-14

PERMALINK : https://www.eurecom.fr/publication/7828