On the efficiency of user identification: a system-based approach |
| |
Authors: | Apostolos Malatras Dimitris Geneiatakis Ioannis Vakalis |
| |
Affiliation: | 1.European Commission, Joint Research Centre (JRC),Institute for the Protection and Security of the Citizen,Ispra,Italy;2.Electrical and Computer Engineering Department,Aristotle University of Thessaloniki,Thessaloniki,Greece |
| |
Abstract: | In the Internet era, users’ fundamental privacy and anonymity rights have received significant research and regulatory attention. This is not only a result of the exponential growth of data that users generate when accomplishing their daily task by means of computing devices with advanced capabilities, but also because of inherent data properties that allow them to be linked with a real or soft identity. Service providers exploit these facts for user monitoring and identification, albeit impacting users’ anonymity, based mainly on personal identifiable information or on sensors that generate unique data to provide personalized services. In this paper, we report on the feasibility of user identification using general system features like memory, CPU and network data, as provided by the underlying operating system. We provide a general framework based on supervised machine learning algorithms both for distinguishing users and informing them about their anonymity exposure. We conduct a series of experiments to collect trial datasets for users’ engagement on a shared computing platform. We evaluate various well-known classifiers in terms of their effectiveness in distinguishing users, and we perform a sensitivity analysis of their configuration setup to discover optimal settings under diverse conditions. Furthermore, we examine the bounds of sampling data to eliminate the chances of user identification and thus promote anonymity. Overall results show that under certain configurations users’ anonymity can be preserved, while in other cases users’ identification can be inferred with high accuracy, without relying on personal identifiable information. |
| |
Keywords: | |
本文献已被 SpringerLink 等数据库收录! |
|