首页 | 本学科首页   官方微博 | 高级检索  
     

基于聚类和偏序序列的API用法模式挖掘
引用本文:王树怡,董东. 基于聚类和偏序序列的API用法模式挖掘[J]. 计算机科学, 2017, 44(Z6): 486-490
作者姓名:王树怡  董东
作者单位:河北师范大学数学与信息科学学院 石家庄050024,河北师范大学数学与信息科学学院 石家庄050024
基金项目:本文受河北省自然科学基金(F2013205192)资助
摘    要:在软件开发过程中,开发人员经常需要遵循特定的API用法模式,而这些用法模式几乎没有相关文档作为参考。为了挖掘API用法模式,提出基于聚类和频繁闭合偏序序列的API用法模式挖掘途径。通过抽象语法树对源代码进行解析,对提取API方法调用序列进行层次聚类,最后使用频繁闭合偏序挖掘算法DFP进行API用法模式的挖掘。实验结果表明,在相同的数据集上,与SPADE算法和BIDE算法相比,所得候选API用法模式集更加精简。

关 键 词:API用法模式  序列模式挖掘  层次聚类  偏序

Mining of API Usage Pattern Based on Clustering and Partial Order Sequences
WANG Shu-yi and DONG Dong. Mining of API Usage Pattern Based on Clustering and Partial Order Sequences[J]. Computer Science, 2017, 44(Z6): 486-490
Authors:WANG Shu-yi and DONG Dong
Affiliation:College of Mathematics and Information Science,Hebei Normal University,Shijiazhuang 050024,China and College of Mathematics and Information Science,Hebei Normal University,Shijiazhuang 050024,China
Abstract:During software development,a developer often needs to follow specific usage patterns of application programming interface (API).However,few of those is well documented for developers to refer to in order to mining the API usage pattern,this paper proposed an approach that discovers the API usage pattern based on clustering and frequent closed partial order sequence mining.After parsing the source code by abstract syntax tree,the extracted API sequences is hierarchically clustered.Finally,API usage patterns by depth-first frequent closed partial order algorithm (DFP) is excauated.The experiment shows that this approach can obtain more succinct candidate API usage pattern compared to SPADE and BIDE on the same dataset.
Keywords:API usage pattern  Sequential pattern mining  Hierarchical clustering  Partial order
点击此处可从《计算机科学》浏览原始摘要信息
点击此处可从《计算机科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号