首页 | 本学科首页   官方微博 | 高级检索  
     

基于Mapreduce的大规模社会网络提取方法研究*
引用本文:施佺,肖仰华,温文灏,朱乾钱,王恒山.基于Mapreduce的大规模社会网络提取方法研究*[J].计算机应用研究,2011,28(1):145-148.
作者姓名:施佺  肖仰华  温文灏  朱乾钱  王恒山
作者单位:1. 上海理工大学,管理学院,上海,200093;南通大学,计算机科学与技术学院,江苏,南通,226019
2. 复旦大学,计算机科学技术学院,上海,200433
3. 上海理工大学,管理学院,上海,200093
基金项目:国家自然科学基金资助项目(61003001,71071098);江苏省自然科学基金资助项目(BK2009153,BK2010280);南通市科技计划项目(K2008018,K2008031)
摘    要:从海量非规范Web数据源提取大规模高质量的社会网络有着广阔应用前景和较高学术价值,同时也面临着海量计算所带来的巨大挑战。为此,以Digg新闻评论网站为信息源,以提取网站用户之间的共同兴趣网络为主要目标,提出了基于云平台的社会网络提取系统框架,实现了基于Mapreduce的大规模社会网络提取方法。实验结果表明,提出的方法具有较好的扩展性和伸缩性,能够胜任从异构Web数据源提取高质量的大规模社会网络的计算任务。

关 键 词:社会网络提取  关系提取  云计算  Mapreduce  社会网络

Research on method for extracting large-scale social network based on Mapreduce
SHI Quan,XIAO Yang-hu,WEN Wen-hao,ZHU Qian-qian,WANG Heng-shan.Research on method for extracting large-scale social network based on Mapreduce[J].Application Research of Computers,2011,28(1):145-148.
Authors:SHI Quan  XIAO Yang-hu  WEN Wen-hao  ZHU Qian-qian  WANG Heng-shan
Affiliation:SHI Quan1,2,XIAO Yang-hua3,WEN Wen-hao3,ZHU Qian-qian3,WANG Heng-shan1 (1.School of Management,University of Shanghai for Science & Technology,Shanghai 200093,China,2.School of Computer Science & Technology,Nantong University,Nantong Jiangsu 226019,3.School of Computer Science,Fudan University,Shanghai 200433,China)
Abstract:Extracting large-scale social networks from massive heterogeneous Web data is of both theoretical and practical significance. However,one of definite features of this task was large-scale computing, which remains to be a great challenge that would be addressed.Cloud computing platform had provided us new opportunity to overcome this challenge.Hence,efforts would be dedicated to investigate the methods to extract large social network from Web data by cloud computing techniques.Specifically,proposed a Mapredu...
Keywords:social network extraction  relation extraction  cloud computing  Mapreduce  social network
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《计算机应用研究》浏览原始摘要信息
点击此处可从《计算机应用研究》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号