Initializing photonic feed-forward neural networks using auxiliary tasks.

Deep Learning Photons

Neural network initialization Photonic activation functions Photonic deep learning

Journal

Neural networks : the official journal of the International Neural Network Society

ISSN: 1879-2782

Titre abrégé: Neural Netw

Pays: United States

ID NLM: 8805018

Informations de publication

Date de publication:
Sep 2020

Historique:

received: 19 11 2019

revised: 15 05 2020

accepted: 21 05 2020

pubmed: 7 6 2020

medline: 28 11 2020

entrez: 7 6 2020

Statut: ppublish

Résumé

Photonics is among the most promising emerging technologies for providing fast and energy-efficient Deep Learning (DL) implementations. Despite their advantages, these photonic DL accelerators also come with certain important limitations. For example, the majority of existing photonic accelerators do not currently support many of the activation functions that are commonly used in DL, such as the ReLU activation function. Instead, sinusoidal and sigmoidal nonlinearities are usually employed, rendering the training process unstable and difficult to tune, mainly due to vanishing gradient phenomena. Thus, photonic DL models usually require carefully fine-tuning all their training hyper-parameters in order to ensure that the training process will proceed smoothly. Despite the recent advances in initialization schemes, as well as in optimization algorithms, training photonic DL models is still especially challenging. To overcome these limitations, we propose a novel adaptive initialization method that employs auxiliary tasks to estimate the optimal initialization variance for each layer of a network. The effectiveness of the proposed approach is demonstrated using two different datasets, as well as two recently proposed photonic activation functions and three different initialization methods. Apart from significantly increasing the stability of the training process, the proposed method can be directly used with any photonic activation function, without further requiring any other kind of fine-tuning, as also demonstrated through the conducted experiments.

Identifiants

DOI: 10.1016/j.neunet.2020.05.024 PMID: 32504819

pubmed: 32504819

pii: S0893-6080(20)30194-5

doi: 10.1016/j.neunet.2020.05.024

pii:

doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

Pagination

103-108

Informations de copyright

Déclaration de conflit d'intérêts

Declaration of Competing Interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Initializing photonic feed-forward neural networks using auxiliary tasks.

Journal

Informations de publication

Résumé

Identifiants

Types de publication

Langues

Sous-ensembles de citation

Pagination

Informations de copyright

Déclaration de conflit d'intérêts

Auteurs

Nikolaos Passalis (N)

George Mourgias-Alexandris (G)

Nikos Pleros (N)

Anastasios Tefas (A)

Articles similaires

Exploring structural diversity across the protein universe with The Encyclopedia of Domains.

ACL-DUNet: A tumor segmentation method based on multiple attention and densely connected breast ultrasound images.

Deep learning-based automatic image classification of oral cancer cells acquiring chemoresistance in vitro.

Deep learning models for hepatitis E incidence prediction leveraging Baidu index.

Classifications MeSH