Initializing photonic feed-forward neural networks using auxiliary tasks.

Neural network initialization Photonic activation functions Photonic deep learning

Journal

Neural networks : the official journal of the International Neural Network Society
ISSN: 1879-2782
Titre abrégé: Neural Netw
Pays: United States
ID NLM: 8805018

Informations de publication

Date de publication:
Sep 2020
Historique:
received: 19 11 2019
revised: 15 05 2020
accepted: 21 05 2020
pubmed: 7 6 2020
medline: 28 11 2020
entrez: 7 6 2020
Statut: ppublish

Résumé

Photonics is among the most promising emerging technologies for providing fast and energy-efficient Deep Learning (DL) implementations. Despite their advantages, these photonic DL accelerators also come with certain important limitations. For example, the majority of existing photonic accelerators do not currently support many of the activation functions that are commonly used in DL, such as the ReLU activation function. Instead, sinusoidal and sigmoidal nonlinearities are usually employed, rendering the training process unstable and difficult to tune, mainly due to vanishing gradient phenomena. Thus, photonic DL models usually require carefully fine-tuning all their training hyper-parameters in order to ensure that the training process will proceed smoothly. Despite the recent advances in initialization schemes, as well as in optimization algorithms, training photonic DL models is still especially challenging. To overcome these limitations, we propose a novel adaptive initialization method that employs auxiliary tasks to estimate the optimal initialization variance for each layer of a network. The effectiveness of the proposed approach is demonstrated using two different datasets, as well as two recently proposed photonic activation functions and three different initialization methods. Apart from significantly increasing the stability of the training process, the proposed method can be directly used with any photonic activation function, without further requiring any other kind of fine-tuning, as also demonstrated through the conducted experiments.

Identifiants

pubmed: 32504819
pii: S0893-6080(20)30194-5
doi: 10.1016/j.neunet.2020.05.024
pii:
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

103-108

Informations de copyright

Copyright © 2020 Elsevier Ltd. All rights reserved.

Déclaration de conflit d'intérêts

Declaration of Competing Interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Auteurs

Nikolaos Passalis (N)

Artificial Intelligence and Information Analysis Laboratory, Aristotle University of Thessaloniki, Greece. Electronic address: passalis@csd.auth.gr.

George Mourgias-Alexandris (G)

Photonic Systems and Networks Research Group, Department of Informatics, Aristotle University of Thessaloniki, Greece.

Nikos Pleros (N)

Photonic Systems and Networks Research Group, Department of Informatics, Aristotle University of Thessaloniki, Greece.

Anastasios Tefas (A)

Artificial Intelligence and Information Analysis Laboratory, Aristotle University of Thessaloniki, Greece.

Articles similaires

Databases, Protein Protein Domains Protein Folding Proteins Deep Learning
Humans Breast Neoplasms Female Deep Learning Ultrasonography, Mammary
Humans Deep Learning Mouth Neoplasms Drug Resistance, Neoplasm Cell Line, Tumor
Humans Incidence Deep Learning China Hepatitis E

Classifications MeSH