Part of Advances in Neural Information Processing Systems 37 (NeurIPS 2024) Main Conference Track
Hao WU, Hanwen Zhang
We study the differentially private top-k selection problem, aiming to identify a sequence of k items with approximately the highest scores from d items. Recent work by Gillenwater et al. (2022) employs a direct sampling approach from the vast collection of O(dk) possible length-k sequences, showing superior empirical accuracy compared to previous pure or approximate differentially private methods. Their algorithm has a time and space complexity of ˜O(dk). In this paper, we present an improved algorithm that achieves time and space complexity of ˜O(d+k2).Experimental results show that our algorithm runs orders of magnitude faster than their approach, while achieving similar empirical accuracy.