Mel frequency spectral domain defenses against adversarial attacks on speech recognition systems.
Journal
JASA express letters
ISSN: 2691-1191
Titre abrégé: JASA Express Lett
Pays: United States
ID NLM: 101775177
Informations de publication
Date de publication:
03 2023
03 2023
Historique:
medline:
4
4
2023
entrez:
1
4
2023
pubmed:
2
4
2023
Statut:
ppublish
Résumé
Automatic speech recognition (ASR) systems are vulnerable to adversarial attacks due to their reliance on machine learning models. Many of the defenses explored for defending ASR systems simply adapt defense approaches developed for the image domain. This paper explores speech-specific defenses in the feature domain and introduces a defense method called mel domain noise flooding (MDNF). MDNF injects additive noise to the mel spectrogram speech representation prior to re-synthesizing the audio signal input to ASR. The defense is evaluated against strong white-box threat models and shows competitive robustness.
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM