首页 | 官方网站   微博 | 高级检索  
     

基于改进BUS算法的焦虑抑郁障碍因素挖掘
引用本文:刘峰斌,袁志勇,肖 玲,王惠玲,王高华.基于改进BUS算法的焦虑抑郁障碍因素挖掘[J].计算机工程与应用,2015,51(18):126-130.
作者姓名:刘峰斌  袁志勇  肖 玲  王惠玲  王高华
作者单位:1.武汉大学 计算机学院,武汉 430072 2.武汉大学人民医院,武汉 430060
摘    要:针对焦虑抑郁患者的早期预防和诊断需求,将关联规则挖掘和压缩方法应用于焦虑抑郁障碍因素的研究,在病人数据中挖掘出与焦虑抑郁障碍相关性较高的因素集合。单独使用频繁项集挖掘算法会产生过多的频繁项集和关联规则,导致其实用性大为降低。对收集的病人数据进行预处理,采用FP-growth算法,挖掘出预处理后数据中的频繁项集,采用最新改进Bottom-Up Summarization(BUS)算法,对挖掘出的频繁项集进行压缩。同时将最后得到的关联规则与未压缩得到的关联规则、原始BUS算法及Top-K算法压缩后得到的关联规则进行对比。实验结果表明,使用改进BUS算法得到的规则数量适中、信息冗余较少而且覆盖的人群具有更高的患病风险。

关 键 词:数据挖掘  关联规则  关联规则压缩  频繁项集  焦虑  抑郁  

Anxiety and depression factors mining based on improved BUS algorithm
LIU Fengbin,YUAN Zhiyong,XIAO Ling,WANG Huiling,WANG Gaohua.Anxiety and depression factors mining based on improved BUS algorithm[J].Computer Engineering and Applications,2015,51(18):126-130.
Authors:LIU Fengbin  YUAN Zhiyong  XIAO Ling  WANG Huiling  WANG Gaohua
Affiliation:1.Computer School, Wuhan University, Wuhan 430072, China 2.Renmin Hospital of Wuhan University, Wuhan 430060, China
Abstract:For early prevention and diagnosis of patients with anxiety and depression, this paper applies association rule mining and summarization methods to medical records to discover sets of risk factors associated with anxiety and depression. Separate use of frequent itemsets mining algorithm would produce too many frequent itemsets and association rules, causing its practicability greatly reduced. It preprocesses the medical records. Then it uses the FP-growth algorithm to find frequent itemsets in the data after pretreatment. At last, it uses the latest improvement Bottom-Up Summarization(BUS) algorithm to summarize the discovered frequent itemsets. At the same time, it compares the association rules obtained at last with the association rules uncompressed and the association rules obtained by the original BUS algorithm and Top-K. Experimental results show that the rules obtained by improved BUS algorithm have moderate number, less redundant information and the people covered by these rules are at high risk of anxiety or depression.
Keywords:data mining  association rules  association rule summarization  frequent itemsets  anxiety  depression  
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机工程与应用》浏览原始摘要信息
点击此处可从《计算机工程与应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号