Optimised weight programming for analogue memory-based deep neural networks.


Journal

Nature communications
ISSN: 2041-1723
Titre abrégé: Nat Commun
Pays: England
ID NLM: 101528555

Informations de publication

Date de publication:
30 06 2022
Historique:
received: 28 10 2021
accepted: 09 06 2022
entrez: 30 6 2022
pubmed: 1 7 2022
medline: 6 7 2022
Statut: epublish

Résumé

Analogue memory-based deep neural networks provide energy-efficiency and per-area throughput gains relative to state-of-the-art digital counterparts such as graphics processing units. Recent advances focus largely on hardware-aware algorithmic training and improvements to circuits, architectures, and memory devices. Optimal translation of software-trained weights into analogue hardware weights-given the plethora of complex memory non-idealities-represents an equally important task. We report a generalised computational framework that automates the crafting of complex weight programming strategies to minimise accuracy degradations during inference, particularly over time. The framework is agnostic to network structure and generalises well across recurrent, convolutional, and transformer neural networks. As a highly flexible numerical heuristic, the approach accommodates arbitrary device-level complexity, making it potentially relevant for a variety of analogue memories. By quantifying the limit of achievable inference accuracy, it also enables analogue memory-based deep neural network accelerators to reach their full inference potential.

Identifiants

pubmed: 35773285
doi: 10.1038/s41467-022-31405-1
pii: 10.1038/s41467-022-31405-1
pmc: PMC9247051
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

3765

Informations de copyright

© 2022. The Author(s).

Références

Science. 2019 May 10;364(6440):570-574
pubmed: 31023890
Nature. 2015 May 28;521(7553):436-44
pubmed: 26017442
IEEE Trans Neural Netw Learn Syst. 2018 Oct;29(10):4782-4790
pubmed: 29990267
Nat Mater. 2019 Apr;18(4):309-323
pubmed: 30894760
Nature. 2016 Jan 28;529(7587):484-9
pubmed: 26819042
Front Comput Neurosci. 2021 Jul 05;15:675741
pubmed: 34290595
Nature. 2021 Mar;591(7848):22-25
pubmed: 33658699
Nature. 2018 Jun;558(7708):60-67
pubmed: 29875487
Nat Nanotechnol. 2020 Jul;15(7):529-544
pubmed: 32231270
Nat Commun. 2020 May 18;11(1):2473
pubmed: 32424184

Auteurs

Charles Mackin (C)

IBM Research-Almaden, 650 Harry Road, San Jose, CA, USA. charles.mackin@ibm.com.

Malte J Rasch (MJ)

IBM Research-Yorktown Heights, 1101 Kitchawan Road, Yorktown Heights, NY, USA.

An Chen (A)

IBM Research-Almaden, 650 Harry Road, San Jose, CA, USA.

Jonathan Timcheck (J)

Stanford University, 450 Serra Mall, Stanford, CA, 94305, USA.

Robert L Bruce (RL)

IBM Research-Yorktown Heights, 1101 Kitchawan Road, Yorktown Heights, NY, USA.

Ning Li (N)

IBM Research-Yorktown Heights, 1101 Kitchawan Road, Yorktown Heights, NY, USA.

Pritish Narayanan (P)

IBM Research-Almaden, 650 Harry Road, San Jose, CA, USA.

Stefano Ambrogio (S)

IBM Research-Almaden, 650 Harry Road, San Jose, CA, USA.

Manuel Le Gallo (M)

IBM Research-Zurich, Säumerstrasse 4, 8803, Rüschlikon, Switzerland.

S R Nandakumar (SR)

IBM Research-Zurich, Säumerstrasse 4, 8803, Rüschlikon, Switzerland.

Andrea Fasoli (A)

IBM Research-Almaden, 650 Harry Road, San Jose, CA, USA.

Jose Luquin (J)

IBM Research-Almaden, 650 Harry Road, San Jose, CA, USA.

Alexander Friz (A)

IBM Research-Almaden, 650 Harry Road, San Jose, CA, USA.

Abu Sebastian (A)

IBM Research-Zurich, Säumerstrasse 4, 8803, Rüschlikon, Switzerland.

Hsinyu Tsai (H)

IBM Research-Almaden, 650 Harry Road, San Jose, CA, USA.

Geoffrey W Burr (GW)

IBM Research-Almaden, 650 Harry Road, San Jose, CA, USA.

Articles similaires

Selecting optimal software code descriptors-The case of Java.

Yegor Bugayenko, Zamira Kholmatova, Artem Kruglov et al.
1.00
Software Algorithms Programming Languages

Exploring blood-brain barrier passage using atomic weighted vector and machine learning.

Yoan Martínez-López, Paulina Phoobane, Yanaima Jauriga et al.
1.00
Blood-Brain Barrier Machine Learning Humans Support Vector Machine Software
Cephalometry Humans Anatomic Landmarks Software Internet
Humans Algorithms Software Artificial Intelligence Computer Simulation

Classifications MeSH