A Data- and Workload-aware Algorithm for Range Queries Under Differential Privacy

  • Chao Li ,
  • Michael Hay ,
  • Gerome Miklau ,

International Conference on Very Large Data Bases (PVLDB) | , Vol 7: pp. 341-352

Publication | Publication

We describe a new algorithm for answering a given set of range queries under ε-differential privacy which often achieves substantially lower error than competing methods. Our algorithm satisfies differential privacy by adding noise that is adapted to the input data and to the given query set. We first privately learn a partitioning of the domain into buckets that suit the input data well. Then we privately estimate counts for each bucket, doing so in a manner well-suited for the given query set. Since the performance of the algorithm depends on the input database, we evaluate it on a wide range of real datasets, showing that we can achieve the benefits of data-dependence on both “easy” and “hard” databases.