首页 | 本学科首页   官方微博 | 高级检索  
     

一种面向中医药领域的二维最大频繁项集挖掘算法
引用本文:肖文,鞠时光,朱金伟,辛燕,刘志锋.一种面向中医药领域的二维最大频繁项集挖掘算法[J].小型微型计算机系统,2007,28(12):2193-2198.
作者姓名:肖文  鞠时光  朱金伟  辛燕  刘志锋
作者单位:江苏大学,计算机学院,江苏,镇江,212013
基金项目:江苏省自然科学基金;江苏大学校科研和教改项目
摘    要:在中医药领域挖掘药组频繁项集时发现,尽管有些项集的支持度比人们需要的频繁项集的支持度高很多,但这些项集并不是人们感兴趣的,即过分频繁反而变得平凡.本文引入支持度区间的概念,提出了适合中药数据挖掘的二维TCM-FP森林结构及其建树算法.在针对疾病症状的中药药组挖掘过程中,采用优化的搜索策略开发了基于支持度区间的TCMA维间最大频繁项集挖掘算法.这种算法既缩小了挖掘的范围又提高了规则的意义,并且具有较高的执行效率.

关 键 词:数据挖掘  中药  维间关联规则  最大频繁项集
文章编号:1000-1220(2007)12-2193-06
收稿时间:2006-08-08
修稿时间:2006年8月8日

Two-dimensional Maximal Frequent Itemset Mining Algorithm in Traditional Chinese Medicine Field
XIAO Wen,JU Shi-guang,ZHU Jin-wei,XIN Yan,LIU Zhi-feng.Two-dimensional Maximal Frequent Itemset Mining Algorithm in Traditional Chinese Medicine Field[J].Mini-micro Systems,2007,28(12):2193-2198.
Authors:XIAO Wen  JU Shi-guang  ZHU Jin-wei  XIN Yan  LIU Zhi-feng
Abstract:When mining medicine group in TCM(traditional Chinese medicine) field,it shows that although some itemsets' support rating is higher than those frequent itemsets need,these itemsets are not interesting,namely,too frequent makes them ordinary.In this paper,the concept of support rating interval is brought in and the structure of two-dimensional TCM-FP forest and the algorithm of tree building are proposed for the TCM data mining.While mining TCM medicine groups for disease symptoms,optimized search strategy is adopted and the TCMA mining algorithm of intra-dimensional maximal frequent itemsets based on support rating interval is developed.This algorithm both reduces the mining scope and improves the significance of the rules,and has higher execute efficiency than FP-growth.
Keywords:data mining  traditional Chinese medicine  interdimension association rule  maximal frequent itemset
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号