HMC: Hybrid model compression method based on layer sensitivity grouping.


Journal

PloS one
ISSN: 1932-6203
Titre abrégé: PLoS One
Pays: United States
ID NLM: 101285081

Informations de publication

Date de publication:
2023
Historique:
received: 09 06 2023
accepted: 23 09 2023
medline: 1 11 2023
pubmed: 9 10 2023
entrez: 9 10 2023
Statut: epublish

Résumé

Previous studies have shown that deep models are often over-parameterized, and this parameter redundancy makes deep compression possible. The redundancy of model weight is often manifested as low rank and sparsity. Ignoring any part of the two or the different distributions of these two characteristics in the model will lead to low accuracy and a low compression rate of deep compression. To make full use of the difference between low-rank and sparsity, a unified framework combining low-rank tensor decomposition and structured pruning is proposed: a hybrid model compression method based on sensitivity grouping (HMC). This framework unifies the existing additive hybrid compression method (AHC) and the non-additive hybrid compression method (NaHC) proposed by us into one model. The latter group the network according to the sensitivity difference of the convolutional layer to different compression methods, which can better integrate the low rank and sparsity of the model compared with the former. Experiments show that our approach achieves a better trade-off between test accuracy and compression ratio when compressing the ResNet family of models than other recent compression methods using a single strategy or additive hybrid compression.

Identifiants

pubmed: 37812605
doi: 10.1371/journal.pone.0292517
pii: PONE-D-23-17911
pmc: PMC10561844
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

e0292517

Informations de copyright

Copyright: © 2023 Yang et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Déclaration de conflit d'intérêts

The authors have declared that no competing interests exist.

Références

IEEE Trans Pattern Anal Mach Intell. 2019 Oct;41(10):2525-2538
pubmed: 30040622
Neural Netw. 2019 Feb;110:104-115
pubmed: 30508807

Auteurs

Guoliang Yang (G)

School of Electrical Engineering, Jiangxi University of Science and Technology, Ganzhou, Jiangxi, China.

Shuaiying Yu (S)

School of Electrical Engineering, Jiangxi University of Science and Technology, Ganzhou, Jiangxi, China.

Hao Yang (H)

School of Electrical Engineering, Jiangxi University of Science and Technology, Ganzhou, Jiangxi, China.

Ziling Nie (Z)

School of Electrical Engineering, Jiangxi University of Science and Technology, Ganzhou, Jiangxi, China.

Jixiang Wang (J)

School of Electrical Engineering, Jiangxi University of Science and Technology, Ganzhou, Jiangxi, China.

Articles similaires

Humans Magnetic Resonance Imaging Artificial Intelligence Female Male
Data Compression Algorithms Computers Computer Simulation Oryza
Algorithms Virtual Reality Video Recording Humans Data Compression
Computer Security Humans Algorithms Diagnostic Imaging Signal-To-Noise Ratio

Classifications MeSH