首页 | 本学科首页   官方微博 | 高级检索  
     

面向服务的云数据挖掘引擎的研究
引用本文:余永红,向晓军,高阳,商琳,杨育彬.面向服务的云数据挖掘引擎的研究[J].计算机科学与探索,2012,6(1):46-57.
作者姓名:余永红  向晓军  高阳  商琳  杨育彬
作者单位:1. 南京大学计算机软件新技术国家重点实验室,南京210093;南京邮电大学通达学院,南京210003
2. 南京大学计算机软件新技术国家重点实验室,南京,210093
基金项目:国家自然科学基金),科技部国际科技合作项目),江苏省自然科学基金)
摘    要:数据挖掘算法处理海量数据时,扩展性受到制约。在商业和科学研究的各个领域,知识发现的过程和需求差异较大,需要有效的机制来设计和运行各种类型的分布式数据挖掘应用。提出了一种面向服务的云数据挖掘引擎的框架CloudDM。不同于基于网格的分布式数据挖掘框架,CloudDM利用开源云计算平台Hadoop处理海量数据的能力,以面向服务的形式支持分布式数据挖掘应用的设计和运行,并描述面向服务的云数据挖掘引擎系统的关键部件和实现技术。依据面向服务的软件体系结构和基于云平台的数据挖掘引擎,可以有效解决海量数据挖掘中的海量数据存储、数据处理和数据挖掘算法互操作性等问题。

关 键 词:云计算  Hadoop  数据挖掘  面向服务的体系结构(SOA)
修稿时间: 

Research on Service-Oriented Data Mining Engine Based on Cloud Computing
YU Yonghong , XIANG Xiaojun , GAO Yang , SHANG Lin , YANG Yubin.Research on Service-Oriented Data Mining Engine Based on Cloud Computing[J].Journal of Frontier of Computer Science and Technology,2012,6(1):46-57.
Authors:YU Yonghong  XIANG Xiaojun  GAO Yang  SHANG Lin  YANG Yubin
Affiliation:1. State Key Laboratory for Novel Software Technology, Nanjing University, Nanjing 210093, China 2. College of Tongda, Nanjing University of Posts and Telecommunications, Nanjing 210003, China
Abstract:The scalability of data mining algorithms is restricted when dealing with large-scale data. There are sig-nificant differences in a wide range of application areas and requirements for knowledge discovery process. It is fundamental to provide effective formalisms to design distributed data mining application and support their efficient execution. This paper proposes a novel service-oriented data minging engine based on cloud computing framework,which is named as CloudDM. Differentiating from grid-based distributed data mining framework,CloudDM ex-ploits the capacity of open source cloud computing platform-Hadoop for large-scale data analysis,supports the design and execution of distributed data mining applications according to SOA(service-oriented architecture) . Moreover,it discusses and reports the key component functions and implementation technologies. According to the design principles of SOA and data mining engine based on cloud computing,the paper can solve the problems in massive data mining systems,such as big data storage,data processing and interactive operation of algorithms,etc.
Keywords:cloud computing  Hadoop  data mining  service-oriented architecture (SOA)
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《计算机科学与探索》浏览原始摘要信息
点击此处可从《计算机科学与探索》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号