Large-scale k-means clustering with user-centric privacy-preservation |
| |
Authors: | Jun Sakuma Shigenobu Kobayashi |
| |
Affiliation: | 1. Department of Computer Science, University of Tsukuba, 1-1-1 Tennodai, Tsukuba, 305-8577, Japan 2. Department of Computational Intelligence and Systems Science, Tokyo Institute of Technology, Yokohama, Japan
|
| |
Abstract: | A k-means clustering with a new privacy-preserving concept, user-centric privacy preservation, is presented. In this framework, users can conduct data mining using their private information by storing them in their local storage. After the computation, they obtain only the mining result without disclosing private information to others. In most cases, the number of parties that can join conventional privacy-preserving data mining has been assumed to be only two. In our framework, we assume large numbers of parties join the protocol; therefore, not only scalability but also asynchronism and fault-tolerance is important. Considering this, we propose a k-mean algorithm combined with a decentralized cryptographic protocol and a gossip-based protocol. The computational complexity is O(log n) with respect to the number of parties n, and experimental results show that our protocol is scalable even with one million parties. |
| |
Keywords: | |
本文献已被 SpringerLink 等数据库收录! |
|