Graph-based methods coupled with specific distributional distances for adversarial attack detection.

Artificial neural network Bio-inspired Deep learning Graph theory Machine learning Wasserstein

Journal

Neural networks : the official journal of the International Neural Network Society

ISSN: 1879-2782

Titre abrégé: Neural Netw

Pays: United States

ID NLM: 8805018

Informations de publication

Date de publication:
11 Oct 2023

Historique:

received: 31 05 2023

revised: 26 09 2023

accepted: 06 10 2023

medline: 19 10 2023

pubmed: 19 10 2023

entrez: 18 10 2023

Statut: aheadofprint

Résumé

Artificial neural networks are prone to being fooled by carefully perturbed inputs which cause an egregious misclassification. These adversarial attacks have been the focus of extensive research. Likewise, there has been an abundance of research in ways to detect and defend against them. We introduce a novel approach of detection and interpretation of adversarial attacks from a graph perspective. For an input image, we compute an associated sparse graph using the layer-wise relevance propagation algorithm (Bach et al., 2015). Specifically, we only keep edges of the neural network with the highest relevance values. Three quantities are then computed from the graph which are then compared against those computed from the training set. The result of the comparison is a classification of the image as benign or adversarial. To make the comparison, two classification methods are introduced: (1) an explicit formula based on Wasserstein distance applied to the degree of node and (2) a logistic regression. Both classification methods produce strong results which lead us to believe that a graph-based interpretation of adversarial attacks is valuable.

Identifiants

DOI: 10.1016/j.neunet.2023.10.007 PMID: 37852166

pubmed: 37852166

pii: S0893-6080(23)00561-0

doi: 10.1016/j.neunet.2023.10.007

pii:

doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

Pagination

11-19

Informations de copyright

Déclaration de conflit d'intérêts

Declaration of competing interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Graph-based methods coupled with specific distributional distances for adversarial attack detection.

Journal

Informations de publication

Résumé

Identifiants

Types de publication

Langues

Sous-ensembles de citation

Pagination

Informations de copyright

Déclaration de conflit d'intérêts

Auteurs

Dwight Nwaigwe (D)

Lucrezia Carboni (L)

Martial Mermillod (M)

Sophie Achard (S)

Michel Dojat (M)

Classifications MeSH