首页 | 本学科首页   官方微博 | 高级检索  
     

一种面向主题的领域服务聚类方法
引用本文:李 征, 王 健, 张 能, 李 昭, 何成万, 何克清. 一种面向主题的领域服务聚类方法[J]. 计算机研究与发展, 2014, 51(2): 408-419.
作者姓名:李征  王健  张能  李昭  何成万  何克清
作者单位:河南大学计算机与信息工程学院;软件工程国家重点实验室(武汉大学);武汉工程大学计算机科学与工程学院;
基金项目:国家自然科学基金项目(61202031,61100017,61100018,61272115);国家科技支撑计划基金项目(2012BAH07B01);国家云计算示范工程项目(中小企业管理云应用研发与产业化);中央高校基本科研业务费专项资金项目(201121102020004,2012211020201)
摘    要:随着互联网上服务资源规模的快速增长,如何高效、准确地发现服务成为一个亟待解决的关键问题.服务聚类是促进服务发现的一种重要技术.但是,现有服务聚类方法只对单一类型的服务文档进行聚类,并且没有考虑服务的领域特性.针对该问题,在对服务进行领域分类的基础上,提出了一种基于概率、融合领域特性的服务聚类模型——领域服务聚类模型(domain service clustering model, DSCM),然后基于该模型提出了一种面向主题的服务聚类方法.最后通过ProgrammableWeb网站提供的真实服务集对提出的方法进行了验证.实验结果表明,该方法可以准确地对不同类型的服务文档进行聚类.与经典的潜在狄利克雷分配(latent Dirichlet allocation, LDA),K-means等方法相比,该方法在聚类纯度和F-measure指标上均具有更好的效果,从而为按需服务发现与服务组合提供更好的支持.

关 键 词:服务聚类  潜在狄利克雷分配  主题  概率  特征降维

A Topic-Oriented Clustering Approach for Domain Services
Li Zheng, Wang Jian, Zhang Neng, Li Zhao, He Chengwan, He Keqing. A Topic-Oriented Clustering Approach for Domain Services[J]. Journal of Computer Research and Development, 2014, 51(2): 408-419.
Authors:Li Zheng  Wang Jian  Zhang Neng  Li Zhao  He Chengwan  He Keqing
Abstract:With the development of SOA and SaaS technologies, the scale of services on the Internet shows a trend of rapid growth. Faced with the abundant and heterogeneous services, how to efficiently and accurately discover user desired services becomes a key issue in service-oriented software engineering. Services clustering is an important technology to facilitate services discovery. However, the existing clustering approaches are only for a single type of service documents, and they do not consider the domain characteristic of services. To avoid these limitations, on the basis of domain-oriented services classification, this paper proposes a services clustering model named as DSCM based on probability and domain characteristic, and then proposes a topic-oriented clustering approach for domain services based on the DSCM model. The proposed clustering approach can cluster services described in WSDL, OWL-S, and text, which can effectively solve the problem of single service document type. Finally, experiments are conducted on real services from ProgrammableWeb to demonstrate the effectiveness of the proposed approach. Experimental results show that the proposed approach can cluster services more accurately. Compared with the approaches of classical latent Dirichlet allocation (LDA) and K-means, the proposed approach can achieve better in the purity of cluster and F-measure, which can greatly promote on demand services discovery and composition.
Keywords:services clustering  latent Dirichlet allocation  topic  probability  feature dimension reduction
本文献已被 CNKI 等数据库收录!
点击此处可从《计算机研究与发展》浏览原始摘要信息
点击此处可从《计算机研究与发展》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号