Correlated differential privacy for non-IID datasets
Version 2 2024-06-04, 01:52Version 2 2024-06-04, 01:52
Version 1 2017-09-04, 21:02Version 1 2017-09-04, 21:02
chapter
posted on 2024-06-04, 01:52authored byT Zhu, Gang LiGang Li, W Zhou, PS Yu
Most previous work on differential privacy mainly focused on independent datasets, assuming that all records were sampled from a universe independently. However, in a real-world, many datasets contain strong coupling relations where some records are often correlated with each other. When such datasets are released, the definition of differential privacy will be violated as an adversary has a higher chance to obtain sensitive information. Hence, it is critical to find effective solutions to preserve rigorous differential privacy with correlated datasets. This chapter first formally defines the correlated differential privacy problem and outlines the research issues and challenges in providing privacy guarantees for correlated datasets. Then it presents an innovative solution to solve the correlated differential privacy problem and shows that the solution is robust and effective.