首页 | 本学科首页   官方微博 | 高级检索  
     


PicShark: mitigating metadata scarcity through large-scale P2P collaboration
Authors:Philippe Cudré-Mauroux  Adriana Budura  Manfred Hauswirth  Karl Aberer
Affiliation:(1) School of Computer and Communication Sciences EPFL, 1010 Lausanne, Switzerland;(2) Digital Enterprise Research Institute, National University of Ireland, Galway, Ireland
Abstract:With the commoditization of digital devices, personal information and media sharing is becoming a key application on the pervasive Web. In such a context, data annotation rather than data production is the main bottleneck. Metadata scarcity represents a major obstacle preventing efficient information processing in large and heterogeneous communities. However, social communities also open the door to new possibilities for addressing local metadata scarcity by taking advantage of global collections of resources. We propose to tackle the lack of metadata in large-scale distributed systems through a collaborative process leveraging on both content and metadata. We develop a community-based and self-organizing system called PicShark in which information entropy—in terms of missing metadata—is gradually alleviated through decentralized instance and schema matching. Our approach focuses on semi-structured metadata and confines computationally expensive operations to the edge of the network, while keeping distributed operations as simple as possible to ensure scalability. PicShark builds on structured Peer-to-Peer networks for distributed look-up operations, but extends the application of self-organization principles to the propagation of metadata and the creation of schema mappings. We demonstrate the practical applicability of our method in an image sharing scenario and provide experimental evidences illustrating the validity of our approach. The work presented in this article was supported by the Swiss NSF National Competence Center in Research on Mobile Information and Communication Systems (NCCR MICS, grant number 5005-67322), by the EPFL Center for Global Computing as part of the European project NEPOMUK No FP6-027705, and by the Líon project supported by Science Foundation Ireland under Grant No. SFI/02/CE1/I131.
Keywords:Metadata scarcity  Metadata heterogeneity  Metadata entropy  Peer-to-Peer collaboration  Peer data management
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号