Catastrophic Forgetting in Deep Graph Networks: A Graph Classification Benchmark.

benchmarks catastrophic-forgetting continual-learning deep-graph-networks lifelong-learning

Journal

Frontiers in artificial intelligence
ISSN: 2624-8212
Titre abrégé: Front Artif Intell
Pays: Switzerland
ID NLM: 101770551

Informations de publication

Date de publication:
2022
Historique:
received: 29 11 2021
accepted: 11 01 2022
entrez: 21 2 2022
pubmed: 22 2 2022
medline: 22 2 2022
Statut: epublish

Résumé

In this work, we study the phenomenon of catastrophic forgetting in the graph representation learning scenario. The primary objective of the analysis is to understand whether classical continual learning techniques for flat and sequential data have a tangible impact on performances when applied to graph data. To do so, we experiment with a structure-agnostic model and a deep graph network in a robust and controlled environment on three different datasets. The benchmark is complemented by an investigation on the effect of structure-preserving regularization techniques on catastrophic forgetting. We find that replay is the most effective strategy in so far, which also benefits the most from the use of regularization. Our findings suggest interesting future research at the intersection of the continual and graph representation learning fields. Finally, we provide researchers with a flexible software framework to reproduce our results and carry out further experiments.

Identifiants

pubmed: 35187476
doi: 10.3389/frai.2022.824655
pmc: PMC8855050
doi:

Types de publication

Journal Article

Langues

eng

Pagination

824655

Informations de copyright

Copyright © 2022 Carta, Cossu, Errica and Bacciu.

Déclaration de conflit d'intérêts

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Références

Curr Pharm Des. 2007;13(14):1469-95
pubmed: 17504168
IEEE Trans Neural Netw Learn Syst. 2021 Jan;32(1):4-24
pubmed: 32217482
Neural Comput. 2021 Oct 12;33(11):2908-2950
pubmed: 34474476
Neural Netw. 2019 Aug;116:56-73
pubmed: 31005851
Nat Commun. 2020 Aug 13;11(1):4069
pubmed: 32792531
IEEE Trans Neural Netw. 2009 Mar;20(3):498-511
pubmed: 19193509
Trends Cogn Sci. 1999 Apr;3(4):128-135
pubmed: 10322466
Neural Netw. 2020 Sep;129:203-221
pubmed: 32559609
Psychol Rev. 1980 Jan;87(1):1-51
pubmed: 7375607
Proc Natl Acad Sci U S A. 2017 Mar 28;114(13):3521-3526
pubmed: 28292907
IEEE Trans Neural Netw. 1998;9(5):768-86
pubmed: 18255765
Neural Netw. 2021 Nov;143:607-627
pubmed: 34343775
Neural Netw. 2002 Oct-Nov;15(8-9):1041-58
pubmed: 12416693
IEEE Trans Neural Netw. 2009 Jan;20(1):61-80
pubmed: 19068426
Neural Netw. 2019 May;113:54-71
pubmed: 30780045
Neural Comput. 2020 Jan;32(1):1-35
pubmed: 31703175
IEEE Trans Neural Netw. 1997;8(3):714-35
pubmed: 18255672

Auteurs

Antonio Carta (A)

Computer Science Department, University of Pisa, Pisa, Italy.

Andrea Cossu (A)

Computer Science Department, University of Pisa, Pisa, Italy.
Scuola Normale Superiore, Pisa, Italy.

Federico Errica (F)

Computer Science Department, University of Pisa, Pisa, Italy.

Davide Bacciu (D)

Computer Science Department, University of Pisa, Pisa, Italy.

Classifications MeSH