The statistical complexity of early-stopped mirror descent

Kanade, Varun; Rebeschini, Patrick; Vaskevicius, Tomas

doi:10.1093/imaiai/iaad047

Kanade, Varun; Rebeschini, Patrick; Vaskevicius, Tomas

2023

Download

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DataCite
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

Recently there has been a surge of interest in understanding implicit regularization properties of iterative gradient-based optimization algorithms. In this paper, we study the statistical guarantees on the excess risk achieved by early-stopped unconstrained mirror descent algorithms applied to the unregularized empirical risk. We consider the set-up of learning linear models and kernel methods for strongly convex and Lipschitz loss functions while imposing only boundedness conditions on the unknown data-generating mechanism. By completing an inequality that characterizes convexity for the squared loss, we identify an intrinsic link between offset Rademacher complexities and potential-based convergence analysis of mirror descent methods. Our observation immediately yields excess risk guarantees for the path traced by the iterates of mirror descent in terms of offset complexities of certain function classes depending only on the choice of the mirror map, initialization point, step size and the number of iterations. We apply our theory to recover, in a clean and elegant manner via rather short proofs, some of the recent results in the implicit regularization literature while also showing how to improve upon them in some settings.

Details

Title The statistical complexity of early-stopped mirror descent

Author(s) Kanade, Varun ; Rebeschini, Patrick ; Vaskevicius, Tomas

Published in Information And Inference-A Journal Of The Ima

Volume 12

Issue 4

Pages iaad047

Date 2023-09-18

Publisher Oxford Univ Press, Oxford

ISSN 2049-8764
2049-8772

Keywords

Excess Risk; Regularization; Iterative Regularization; Early Stopping; Rademacher Complexity; Mirror Descent; Fast Rates

DOI https://doi.org/10.1093/imaiai/iaad047

Other identifier(s) View record in Web of Science

Laboratories DOLA

Record Appears in Scientific production and competences > SB - School of Basic Sciences > MATH - Institute of Mathematics > DOLA - Chair of Dynamics of Learning Algorithms
Peer-reviewed publications
Work produced at EPFL
Journal Articles
Published

Grant EPSRC
MRC through the OxWaSP CDT programme: EP/L016710/1
Alan Turing Institute under the EPSRC grant: EP/N510129/1

Record creation date 2024-02-20

Files

Abstract

Details

PDF