Mani Malek, Ilya Mironov, Karthik Prasad, Igor Shilov and Florian Tramèr
Conference on Neural Information Processing Systems (NeurIPS) 2021
We consider the privacy-preserving machine learning (ML) setting where the trained model must satisfy differential privacy (DP) with respect to the labels of the training examples. We propose two novel approaches based on, respectively, the Laplace mechanism and the PATE framework, and demonstrate their effectiveness on standard benchmarks.
While recent work by Ghazi et al. proposed Label DP schemes based on a randomized response mechanism, we argue that additive Laplace noise coupled with Bayesian inference (ALIBI) is a better fit for typical ML tasks. Moreover, we show how to achieve very strong privacy levels in some regimes, with our adaptation of the PATE framework that builds on recent advances in semi-supervised learning.
We complement theoretical analysis of our algorithms’ privacy guarantees with empirical evaluation of their memorization properties. Our evaluation suggests that comparing different algorithms according to their provable DP guarantees can be misleading and favor a less private algorithm with a tighter analysis.
@inproceedings{MMPS+21, | |||
author | = | {Malek, Mani and Mironov, Ilya and Prasad, Karthik and Shilov, Igor and Tram{\`e}r, Florian}, | |
title | = | {Antipodes of Label Differential Privacy: {PATE} and {ALIBI}}, | |
booktitle | = | {Conference on Neural Information Processing Systems (NeurIPS)}, | |
year | = | {2021}, | |
howpublished | = | {arXiv preprint arXiv:2106.03408}, | |
url | = | {https://arxiv.org/abs/2106.03408} | |
} |