Arbitrary Decisions are a Hidden Cost of Differentially Private Training

Kulynych, Bogdan; Hsu, Hsiang; Troncoso, Carmela; Calmon, Flavio du Pin; ASSOC COMPUTING MACHINERY

doi:10.1145/3593013.3594103

Kulynych, Bogdan; Hsu, Hsiang; Troncoso, Carmela; Calmon, Flavio du Pin; ASSOC COMPUTING MACHINERY

2023

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DataCite
DublinCore
EndNote
NLM
RefWorks
RIS

Résumé

Mechanisms used in privacy-preserving machine learning often aim to guarantee differential privacy (DP) during model training. Practical DP-ensuring training methods use randomization when fitting model parameters to privacy-sensitive data (e.g., adding Gaussian noise to clipped gradients). We demonstrate that such randomization incurs predictive multiplicity: for a given input example, the output predicted by equally-private models depends on the randomness used in training. Thus, for a given input, the predicted output can vary drastically if a model is re-trained, even if the same training dataset is used. The predictive-multiplicity cost of DP training has not been studied, and is currently neither audited for nor communicated to model designers and stakeholders. We derive a bound on the number of re-trainings required to estimate predictive multiplicity reliably. We analyze-both theoretically and through extensive experiments-the predictive-multiplicity cost of three DP-ensuring algorithms: output perturbation, objective perturbation, and DP-SGD. We demonstrate that the degree of predictive multiplicity rises as the level of privacy increases, and is unevenly distributed across individuals and demographic groups in the data. Because randomness used to ensure DP during training explains predictions for some examples, our results highlight a fundamental challenge to the justifiability of decisions supported by differentially-private models in high-stakes settings. We conclude that practitioners should audit the predictive multiplicity of their DP-ensuring algorithms before deploying them in applications of individual-level consequence.

Détails

Titre Arbitrary Decisions are a Hidden Cost of Differentially Private Training

Auteur(s) Kulynych, Bogdan ; Hsu, Hsiang ; Troncoso, Carmela ; Calmon, Flavio du Pin ; ASSOC COMPUTING MACHINERY

Publié dans Proceedings Of The 6Th Acm Conference On Fairness, Accountability, And Transparency, Facct 2023

Pages 1609-1623

Présenté à 6th ACM Conference on Fairness, Accountability, and Transparency (FAccT), JUN 12-15, 2023, Chicago, IL

Date 2023-01-01

Editeur Assoc Computing Machinery, New York

ISBN 978-1-4503-7252-7

DOI https://doi.org/10.1145/3593013.3594103

Autres identifiant(s) Afficher la publication dans Web of Science

Laboratoires SPRING

Le document apparaît dans Production scientifique et compétences > I&C - Faculté Informatique & Communications > IINFCOM > SPRING - Laboratoire d'Ingéniérie de Sécurité et Privacy
Publications validées par des pairs
Papiers de conférence
Travail produit à l'EPFL
Publié

Grant Swiss National Science Foundation: 200021-188824
US National Science Foundation: CAREER 1845852
Swiss National Science Foundation (SNF): 200021_188824

Date de création de la notice 2024-02-14