Hardware-aware training for large-scale and diverse deep learning inference workloads using in-memory computing-based accelerators.
Journal
Nature communications
ISSN: 2041-1723
Titre abrégé: Nat Commun
Pays: England
ID NLM: 101528555
Informations de publication
Date de publication:
30 Aug 2023
30 Aug 2023
Historique:
received:
16
02
2023
accepted:
08
08
2023
medline:
31
8
2023
pubmed:
31
8
2023
entrez:
30
8
2023
Statut:
epublish
Résumé
Analog in-memory computing-a promising approach for energy-efficient acceleration of deep learning workloads-computes matrix-vector multiplications but only approximately, due to nonidealities that often are non-deterministic or nonlinear. This can adversely impact the achievable inference accuracy. Here, we develop an hardware-aware retraining approach to systematically examine the accuracy of analog in-memory computing across multiple network topologies, and investigate sensitivity and robustness to a broad set of nonidealities. By introducing a realistic crossbar model, we improve significantly on earlier retraining approaches. We show that many larger-scale deep neural networks-including convnets, recurrent networks, and transformers-can in fact be successfully retrained to show iso-accuracy with the floating point implementation. Our results further suggest that nonidealities that add noise to the inputs or outputs, not the weights, have the largest impact on accuracy, and that recurrent networks are particularly robust to all nonidealities.
Identifiants
pubmed: 37648721
doi: 10.1038/s41467-023-40770-4
pii: 10.1038/s41467-023-40770-4
pmc: PMC10469175
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
5282Informations de copyright
© 2023. Springer Nature Limited.
Références
Science. 2019 May 10;364(6440):570-574
pubmed: 31023890
Nature. 2020 Jan;577(7792):641-646
pubmed: 31996818
Nature. 2022 Aug;608(7923):504-512
pubmed: 35978128
IEEE Trans Neural Netw Learn Syst. 2018 Oct;29(10):4782-4790
pubmed: 29990267
Front Neurosci. 2019 Jul 30;13:753
pubmed: 31417340
Front Comput Neurosci. 2021 Jul 05;15:675741
pubmed: 34290595
Front Neurosci. 2016 Jul 21;10:333
pubmed: 27493624
Nat Nanotechnol. 2020 Jul;15(7):529-544
pubmed: 32231270
Nature. 2022 Jan;601(7894):549-555
pubmed: 35082422
Nat Commun. 2022 Jun 30;13(1):3765
pubmed: 35773285
Nat Commun. 2020 May 18;11(1):2473
pubmed: 32424184
Nature. 2018 Jun;558(7708):60-67
pubmed: 29875487
Science. 2022 Jul 29;377(6605):539-543
pubmed: 35901152