Recurrent neural network pruning using dynamical systems and iterative fine-tuning.

Linear dynamical systems Network pruning Recurrent neural networks Regularization

Journal

Neural networks : the official journal of the International Neural Network Society
ISSN: 1879-2782
Titre abrégé: Neural Netw
Pays: United States
ID NLM: 8805018

Informations de publication

Date de publication:
Nov 2021
Historique:
received: 08 02 2021
revised: 24 06 2021
accepted: 02 07 2021
pubmed: 20 7 2021
medline: 25 11 2021
entrez: 19 7 2021
Statut: ppublish

Résumé

Network pruning techniques are widely employed to reduce the memory requirements and increase the inference speed of neural networks. This work proposes a novel RNN pruning method that considers the RNN weight matrices as collections of time-evolving signals. Such signals that represent weight vectors can be modelled using Linear Dynamical Systems (LDSs). In this way, weight vectors with similar temporal dynamics can be pruned as they have limited effect on the performance of the model. Additionally, during the fine-tuning of the pruned model, a novel discrimination-aware variation of the L2 regularization is introduced to penalize network weights (i.e., reduce the magnitude), whose impact on the output of an RNN network is minimal. Finally, an iterative fine-tuning approach is proposed that employs a bigger model to guide an increasingly smaller pruned one, as a steep decrease of the network parameters can irreversibly harm the performance of the pruned model. Extensive experimentation with different network architectures demonstrates the potential of the proposed method to create pruned models with significantly improved perplexity by at least 0.62% on the PTB dataset and improved F1-score by 1.39% on the SQuAD dataset, contrary to other state-of-the-art approaches that slightly improve or even deteriorate models' performance.

Identifiants

pubmed: 34280607
pii: S0893-6080(21)00264-1
doi: 10.1016/j.neunet.2021.07.001
pii:
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

475-488

Informations de copyright

Copyright © 2021 Elsevier Ltd. All rights reserved.

Déclaration de conflit d'intérêts

Declaration of Competing Interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Auteurs

Christos Chatzikonstantinou (C)

Information Technologies Institute, Centre for Research and Technology Hellas, Greece. Electronic address: chatziko@iti.gr.

Dimitrios Konstantinidis (D)

Information Technologies Institute, Centre for Research and Technology Hellas, Greece. Electronic address: dikonsta@iti.gr.

Kosmas Dimitropoulos (K)

Information Technologies Institute, Centre for Research and Technology Hellas, Greece. Electronic address: dimitrop@iti.gr.

Petros Daras (P)

Information Technologies Institute, Centre for Research and Technology Hellas, Greece. Electronic address: daras@iti.gr.

Articles similaires

Unsupervised learning for real-time and continuous gait phase detection.

Dollaporn Anopas, Yodchanan Wongsawat, Jetsada Arnin
1.00
Humans Gait Neural Networks, Computer Unsupervised Machine Learning Walking
Humans Shoulder Fractures Tomography, X-Ray Computed Neural Networks, Computer Female
Humans Artificial Intelligence Neoplasms Prognosis Image Processing, Computer-Assisted
Humans Deep Learning Mouth Neoplasms Drug Resistance, Neoplasm Cell Line, Tumor

Classifications MeSH