An interpretive constrained linear model for ResNet and MgNet.

Convolutional neural networks Data-feature mapping MgNet Multigrid iterative methods ResNet

Journal

Neural networks : the official journal of the International Neural Network Society
ISSN: 1879-2782
Titre abrégé: Neural Netw
Pays: United States
ID NLM: 8805018

Informations de publication

Date de publication:
May 2023
Historique:
received: 08 12 2021
revised: 11 02 2023
accepted: 07 03 2023
medline: 25 4 2023
pubmed: 23 3 2023
entrez: 22 3 2023
Statut: ppublish

Résumé

We propose a constrained linear data-feature-mapping model as an interpretable mathematical model for image classification using a convolutional neural network (CNN). From this viewpoint, we establish detailed connections between the traditional iterative schemes for linear systems and the architectures of the basic blocks of ResNet- and MgNet-type models. Using these connections, we present some modified ResNet models that, compared with the original models, have fewer parameters but can produce more accurate results, thereby demonstrating the validity of this constrained learning data-feature-mapping assumption. Based on this assumption, we further propose a general data-feature iterative scheme to demonstrate the rationality of MgNet. We also provide a systematic numerical study on MgNet to show its success and advantages in image classification problems, particularly in comparison with established networks.

Identifiants

pubmed: 36947909
pii: S0893-6080(23)00128-4
doi: 10.1016/j.neunet.2023.03.011
pii:
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

384-392

Informations de copyright

Copyright © 2023 Elsevier Ltd. All rights reserved.

Déclaration de conflit d'intérêts

Declaration of Competing Interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Auteurs

Juncai He (J)

Computer, Electrical and Mathematical Science and Engineering Division, King Abdullah University of Science and Technology, Thuwal 23955, Saudi Arabia. Electronic address: juncai.he@kaust.edu.sa.

Jinchao Xu (J)

Computer, Electrical and Mathematical Science and Engineering Division, King Abdullah University of Science and Technology, Thuwal 23955, Saudi Arabia; Department of Mathematics, The Pennsylvania State University, University Park, PA 16802, USA. Electronic address: xu@multigrid.org.

Lian Zhang (L)

Shenzhen International Center for Industrial and Applied Mathematics, Shenzhen Research Institute of Big Data, Shenzhen 518172, China. Electronic address: zhanglian@sribd.cn.

Jianqing Zhu (J)

Faculty of Science, Beijing University of Technology, Beijing 100124, China. Electronic address: jqzhu@emails.bjut.edu.cn.

Articles similaires

Vancomycin Polyesters Anti-Bacterial Agents Models, Theoretical Drug Liberation

Unsupervised learning for real-time and continuous gait phase detection.

Dollaporn Anopas, Yodchanan Wongsawat, Jetsada Arnin
1.00
Humans Gait Neural Networks, Computer Unsupervised Machine Learning Walking
Humans Shoulder Fractures Tomography, X-Ray Computed Neural Networks, Computer Female
Humans Artificial Intelligence Neoplasms Prognosis Image Processing, Computer-Assisted

Classifications MeSH