Artificial Neural Network Training on an Optical Processor via Direct Feedback Alignment

Müller, Kilian; Launay, Julien; Poli, Iacopo; Filipovich, Matthew; Capelli, Alessandro; Hesslow, Daniel; Carron, Igor; Daudet, Laurent; Krzakala, Florent; Gigan, Sylvain

doi:10.1109/CLEO/Europe-EQEC57999.2023.10231380

Müller, Kilian; Launay, Julien; Poli, Iacopo; Filipovich, Matthew; Capelli, Alessandro; Hesslow, Daniel; Carron, Igor; Daudet, Laurent; Krzakala, Florent; Gigan, Sylvain

2023

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DataCite
DublinCore
EndNote
NLM
RefWorks
RIS

Abstract

Artificial Neural Networks (ANN) are habitually trained via the back-propagation (BP) algorithm. This approach has been extremely successful: Current models like GPT-3 have O(10 11 ) parameters, are trained on O(10 11 ) words and produce awe-inspiring results. However, there are good reasons to look for alternative training methods: With current algorithms and hardware constraints sometimes only half the available computing power is actually used. This is due to a complicated interplay between the size of the ANN, the available memory, throughput limitations of interconnects, the architecture of the network of computers, and the training algorithm. Training a model like the aforementioned GPT-3 takes months and costs millions. A different training paradigm, which could make clever use of specialized hardware, may train large ANNs more efficiently.

Details

Title Artificial Neural Network Training on an Optical Processor via Direct Feedback Alignment

Author(s) Müller, Kilian ; Launay, Julien ; Poli, Iacopo ; Filipovich, Matthew ; Capelli, Alessandro ; Hesslow, Daniel ; Carron, Igor ; Daudet, Laurent ; Krzakala, Florent ; Gigan, Sylvain

Published in 2023 Conference on Lasers and Electro-Optics Europe & European Quantum Electronics Conference (CLEO/Europe-EQEC)

Pages 1-1

Conference 2023 Conference on Lasers and Electro-Optics Europe & European Quantum Electronics Conference (CLEO/Europe-EQEC), Munich, Germany, June 26-30, 2023

Date 2023

Publisher IEEE

ISBN 979-8-3503-4600-8
979-8-3503-4599-5

DOI https://doi.org/10.1109/CLEO/Europe-EQEC57999.2023.10231380

Laboratories IDEPHICS1
IDEPHICS2

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > IDEPHICS1 - Information, Learning & Physics Laboratory (STI/SB)
Scientific production and competences > SB - School of Basic Sciences > IPHYS - Institute of Physics > IDEPHICS2 - Information, Learning & Physics Laboratory (SB/STI)
Peer-reviewed publications
Conference Papers
Work produced at EPFL

Record creation date 2023-09-11