Privacy-preserving indexing of documents on the network |
| |
Authors: | Mayank Bawa Roberto J. Bayardo Jr Rakesh Agrawal Jaideep Vaidya |
| |
Affiliation: | (1) Aster Data Systems, Redwood City, CA 94065, USA;(2) Google, Inc., Mountain View, CA 94043, USA;(3) Microsoft Search Labs, Mountain View, CA 94043, USA;(4) Rutgers University, Newark, NJ 07102, USA |
| |
Abstract: | With the ubiquitous collection of data and creation of large distributed repositories, enabling search over this data while respecting access control is critical. A related problem is that of ensuring privacy of the content owners while still maintaining an efficient index of distributed content. We address the problem of providing privacy-preserving search over distributed access-controlled content. Indexed documents can be easily reconstructed from conventional (inverted) indexes used in search. Currently, the need to avoid breaches of access-control through the index requires the index hosting site to be fully secured and trusted by all participating content providers. This level of trust is impractical in the increasingly common case where multiple competing organizations or individuals wish to selectively share content. We propose a solution that eliminates the need of such a trusted authority. The solution builds a centralized privacy-preserving index in conjunction with a distributed access-control enforcing search protocol. Two alternative methods to build the centralized index are proposed, allowing trade offs of efficiency and security. The new index provides strong and quantifiable privacy guarantees that hold even if the entire index is made public. Experiments on a real-life dataset validate performance of the scheme. The appeal of our solution is twofold: (a) content providers maintain complete control in defining access groups and ensuring its compliance, and (b) system implementors retain tunable knobs to balance privacy and efficiency concerns for their particular domains. Dr. Vaidya’s work was supported by the National Science Foundation under grant CNS-0746943 and by a research resources grant from Rutgers Business School, Newark and New Brunswick. |
| |
Keywords: | Privacy Indexing Distributed search |
本文献已被 SpringerLink 等数据库收录! |
|