Differentially private query learning: from data publishing to model publishing

Zhu, T; Xiong, P; Li, Gang; Zhou, W; Yu, PS

Differentially private query learning: from data publishing to model publishing

conference contribution

posted on 2024-06-04, 01:53 authored by T Zhu, P Xiong, Gang LiGang Li, W Zhou, PS Yu

As one of the most influential privacy definitions, differential privacy provides a rigorous and provable privacy guarantee for data publishing. However, the curator has to release a large number of queries in a batch or a synthetic dataset in the Big Data era. Two challenges need to be tackled: one is how to decrease the correlation between large sets of queries, while the other is how to predict on fresh queries. This paper transfers the data publishing problem to a machine learning problem, in which queries are considered as training samples and a prediction model will be released rather than query results or synthetic datasets. When the model is published, it can be used to answer current submitted queries and predict results for fresh queries from the public. Compared with the traditional method, the proposed prediction model enhances the accuracy of query results for non-interactive publishing. We prove that learning model can successfully retain the utility of published queries while preserving privacy.

History

Pagination

1117-1122

Location

Boston, Mass.

Publisher DOI

https://doi.org/10.1109/BigData.2017.8258037

Start date

2017-12-11

End date

2017-12-14

ISBN-13

978-1-5386-2715-0

Language

eng

Publication classification

E Conference publication, E1 Full written paper - refereed

Copyright notice

2017, IEEE

Editor/Contributor(s)

Nie JY, Obradovic Z, Suzumura T, Ghosh R, Nambiar R, Wang CG, Baeza-Yates R, Zang H, Hu XH, Kepner J, Cuzzocrea A, Tang J, Toyoda M

Title of proceedings

Big Data 2017 : Proceedings of the 2017 IEEE Interantional Conference on Big Data

Event

IEEE Computer Society. Conference (5th : 2017 : Boston, Mass.)

Publisher

Institute of Electrical and Electronics Engineers

Place of publication

Piscataway, N.J.

Series

IEEE Computer Society Conference

Usage metrics

Keywords

Big Data training publishing predictive models sensitivity data privacy privacy correlation School of Information Technology 4604 Cybersecurity and privacy 4605 Data management and data science 4606 Distributed computing and systems software

Differentially private query learning: from data publishing to model publishing

History

Pagination

Location

Publisher DOI

Start date

End date

ISBN-13

Language

Publication classification

Copyright notice

Editor/Contributor(s)

Title of proceedings

Event

Publisher

Place of publication

Series

Usage metrics

Categories

Keywords

Licence

Exports