Enhancing Explainability of Neural Networks Through Architecture Constraints.

Journal

IEEE transactions on neural networks and learning systems

ISSN: 2162-2388

Titre abrégé: IEEE Trans Neural Netw Learn Syst

Pays: United States

ID NLM: 101616214

Informations de publication

Date de publication:
06 2021

Historique:

pubmed: 28 7 2020

medline: 28 7 2020

entrez: 28 7 2020

Statut: ppublish

Résumé

Prediction accuracy and model explainability are the two most important objectives when developing machine learning algorithms to solve real-world problems. Neural networks are known to possess good prediction performance but suffer from a lack of model interpretability. In this article, we propose to enhance the explainability of neural networks through the following architecture constraints: 1) sparse additive subnetworks; 2) projection pursuit with orthogonality constraint; and 3) smooth function approximation. It leads to an enhanced explainable neural network (ExNN) with a superior balance between prediction performance and model interpretability. We derive sufficient identifiability conditions for the proposed ExNN model. The multiple parameters are simultaneously estimated by a modified minibatch gradient descent method based on the backpropagation algorithm for calculating the derivatives and the Cayley transform for preserving the projection orthogonality. Through simulation study under six different scenarios, we compare the proposed method to several benchmarks, including least absolute shrinkage and selection operator, support vector machine, random forest, extreme learning machine, and multilayer perceptron. It is shown that the proposed ExNN model keeps the flexibility of pursuing high prediction accuracy while attaining improved interpretability. Finally, a real data example is employed as a showcase application.

Identifiants

DOI: 10.1109/TNNLS.2020.3007259 PMID: 32716891

pubmed: 32716891

doi: 10.1109/TNNLS.2020.3007259

doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

Pagination

2610-2621

Enhancing Explainability of Neural Networks Through Architecture Constraints.

Journal

Informations de publication

Résumé

Identifiants

Types de publication

Langues

Sous-ensembles de citation

Pagination

Auteurs

Zebin Yang (Z)

Aijun Zhang (A)

Agus Sudjianto (A)

Classifications MeSH