Generalizability and robustness evaluation of attribute-based zero-shot learning.

Evaluation metrics Generalizability Robustness Zero-shot learning

Journal

Neural networks : the official journal of the International Neural Network Society
ISSN: 1879-2782
Titre abrégé: Neural Netw
Pays: United States
ID NLM: 8805018

Informations de publication

Date de publication:
28 Mar 2024
Historique:
received: 06 09 2023
revised: 15 02 2024
accepted: 26 03 2024
medline: 7 4 2024
pubmed: 7 4 2024
entrez: 6 4 2024
Statut: aheadofprint

Résumé

In the field of deep learning, large quantities of data are typically required to effectively train models. This challenge has given rise to techniques like zero-shot learning (ZSL), which trains models on a set of "seen" classes and evaluates them on a set of "unseen" classes. Although ZSL has shown considerable potential, particularly with the employment of generative methods, its generalizability to real-world scenarios remains uncertain. The hypothesis of this work is that the performance of ZSL models is systematically influenced by the chosen "splits"; in particular, the statistical properties of the classes and attributes used in training. In this paper, we test this hypothesis by introducing the concepts of generalizability and robustness in attribute-based ZSL and carry out a variety of experiments to stress-test ZSL models against different splits. Our aim is to lay the groundwork for future research on ZSL models' generalizability, robustness, and practical applications. We evaluate the accuracy of state-of-the-art models on benchmark datasets and identify consistent trends in generalizability and robustness. We analyze how these properties vary based on the dataset type, differentiating between coarse- and fine-grained datasets, and our findings indicate significant room for improvement in both generalizability and robustness. Furthermore, our results demonstrate the effectiveness of dimensionality reduction techniques in improving the performance of state-of-the-art models in fine-grained datasets.

Identifiants

pubmed: 38581809
pii: S0893-6080(24)00202-8
doi: 10.1016/j.neunet.2024.106278
pii:
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

106278

Informations de copyright

Copyright © 2024 The Author(s). Published by Elsevier Ltd.. All rights reserved.

Déclaration de conflit d'intérêts

Declaration of competing interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Auteurs

Luca Rossi (L)

Dipartimento di Ingegneria dell'Informazione, Università Politecnica delle Marche, Via Brecce Bianche 12, 60131, Ancona, Italy. Electronic address: l.rossi@pm.univpm.it.

Maria Chiara Fiorentino (MC)

Dipartimento di Ingegneria dell'Informazione, Università Politecnica delle Marche, Via Brecce Bianche 12, 60131, Ancona, Italy. Electronic address: m.c.fiorentino@pm.univpm.it.

Adriano Mancini (A)

Dipartimento di Ingegneria dell'Informazione, Università Politecnica delle Marche, Via Brecce Bianche 12, 60131, Ancona, Italy. Electronic address: a.mancini@univpm.it.

Marina Paolanti (M)

Dipartimento di Scienze politiche, della Comunicazione e delle Relazioni Internazionali, Università di Macerata, 62100, Macerata, Italy. Electronic address: marina.paolanti@unimc.it.

Riccardo Rosati (R)

Dipartimento di Ingegneria dell'Informazione, Università Politecnica delle Marche, Via Brecce Bianche 12, 60131, Ancona, Italy. Electronic address: r.rosati@pm.univpm.it.

Primo Zingaretti (P)

Dipartimento di Ingegneria dell'Informazione, Università Politecnica delle Marche, Via Brecce Bianche 12, 60131, Ancona, Italy. Electronic address: p.zingaretti@univpm.it.

Classifications MeSH