Recurrent neural network pruning using dynamical systems and iterative fine-tuning.

Empirical Research Neural Networks, Computer

Linear dynamical systems Network pruning Recurrent neural networks Regularization

Journal

Neural networks : the official journal of the International Neural Network Society

ISSN: 1879-2782

Titre abrégé: Neural Netw

Pays: United States

ID NLM: 8805018

Informations de publication

Date de publication:
Nov 2021

Historique:

received: 08 02 2021

revised: 24 06 2021

accepted: 02 07 2021

pubmed: 20 7 2021

medline: 25 11 2021

entrez: 19 7 2021

Statut: ppublish

Résumé

Network pruning techniques are widely employed to reduce the memory requirements and increase the inference speed of neural networks. This work proposes a novel RNN pruning method that considers the RNN weight matrices as collections of time-evolving signals. Such signals that represent weight vectors can be modelled using Linear Dynamical Systems (LDSs). In this way, weight vectors with similar temporal dynamics can be pruned as they have limited effect on the performance of the model. Additionally, during the fine-tuning of the pruned model, a novel discrimination-aware variation of the L2 regularization is introduced to penalize network weights (i.e., reduce the magnitude), whose impact on the output of an RNN network is minimal. Finally, an iterative fine-tuning approach is proposed that employs a bigger model to guide an increasingly smaller pruned one, as a steep decrease of the network parameters can irreversibly harm the performance of the pruned model. Extensive experimentation with different network architectures demonstrates the potential of the proposed method to create pruned models with significantly improved perplexity by at least 0.62% on the PTB dataset and improved F1-score by 1.39% on the SQuAD dataset, contrary to other state-of-the-art approaches that slightly improve or even deteriorate models' performance.

Identifiants

DOI: 10.1016/j.neunet.2021.07.001 PMID: 34280607

pubmed: 34280607

pii: S0893-6080(21)00264-1

doi: 10.1016/j.neunet.2021.07.001

pii:

doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

Pagination

475-488

Informations de copyright

Déclaration de conflit d'intérêts

Declaration of Competing Interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Recurrent neural network pruning using dynamical systems and iterative fine-tuning.

Journal

Informations de publication

Résumé

Identifiants

Types de publication

Langues

Sous-ensembles de citation

Pagination

Informations de copyright

Déclaration de conflit d'intérêts

Auteurs

Christos Chatzikonstantinou (C)

Dimitrios Konstantinidis (D)

Kosmas Dimitropoulos (K)

Petros Daras (P)

Articles similaires

Unsupervised learning for real-time and continuous gait phase detection.

Detection, classification, and characterization of proximal humerus fractures on plain radiographs.

Editorial: Artificial Intelligence (AI), Digital Image Analysis, and the Future of Cancer Diagnosis and Prognosis.

Deep learning-based automatic image classification of oral cancer cells acquiring chemoresistance in vitro.

Classifications MeSH