Learning curves for the multi-class teacher-student perceptron

Cornacchia, Elisabetta; Mignacco, Francesca; Veiga, Rodrigo; Gerbelot, Cedric; Loureiro, Bruno; Zdeborova, Lenka

doi:10.1088/2632-2153/acb428

Cornacchia, Elisabetta; Mignacco, Francesca; Veiga, Rodrigo; Gerbelot, Cedric; Loureiro, Bruno; Zdeborova, Lenka

2023

Download

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DataCite
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

One of the most classical results in high-dimensional learning theory provides a closed-form expression for the generalisation error of binary classification with a single-layer teacher-student perceptron on i.i.d. Gaussian inputs. Both Bayes-optimal (BO) estimation and empirical risk minimisation (ERM) were extensively analysed in this setting. At the same time, a considerable part of modern machine learning practice concerns multi-class classification. Yet, an analogous analysis for the multi-class teacher-student perceptron was missing. In this manuscript we fill this gap by deriving and evaluating asymptotic expressions for the BO and ERM generalisation errors in the high-dimensional regime. For Gaussian teacher, we investigate the performance of ERM with both cross-entropy and square losses, and explore the role of ridge regularisation in approaching Bayes-optimality. In particular, we observe that regularised cross-entropy minimisation yields close-to-optimal accuracy. Instead, for Rademacher teacher we show that a first-order phase transition arises in the BO performance.

Details

Title Learning curves for the multi-class teacher-student perceptron

Author(s) Cornacchia, Elisabetta ; Mignacco, Francesca ; Veiga, Rodrigo ; Gerbelot, Cedric ; Loureiro, Bruno ; Zdeborova, Lenka

Published in Machine Learning-Science And Technology

Volume 4

Issue 1

Pages 015019

Date 2023-03-01

Publisher Bristol, IOP Publishing Ltd

ISSN 2632-2153

Keywords

multi-class classification; empirical risk minimization; high-dimensional statistics; message-passing algorithms; statistical-mechanics

DOI https://doi.org/10.1088/2632-2153/acb428

Other identifier(s) View record in Web of Science

Laboratories IDEPHICS2
SPOC1
SPOC2

Record Appears in Scientific production and competences > SB - School of Basic Sciences > IPHYS - Institute of Physics > IDEPHICS2 - Information, Learning & Physics Laboratory (SB/STI)
Scientific production and competences > I&C - School of Computer and Communication Sciences > IINFCOM > SPOC2 - Statistical Physics of Computation Laboratory (IC/SB)
Scientific production and competences > SB - School of Basic Sciences > IPHYS - Institute of Physics > SPOC1 - Statistical Physics of Computation Laboratory (SB/IC)
Peer-reviewed publications
Work produced at EPFL
Journal Articles
Published

Record creation date 2023-03-13

Files

Abstract

Details

PDF