首页 | 本学科首页   官方微博 | 高级检索  
     

挖掘所关注规则的多策略方法研究
引用本文:程继华,郭建生,施鹏飞.挖掘所关注规则的多策略方法研究[J].计算机学报,2000,23(1):47-51.
作者姓名:程继华  郭建生  施鹏飞
作者单位:上海交通大学图像处理与模式识别研究所,上海,200030
基金项目:国家自然科学基金!( 695 75 0 12 )
摘    要:通过数据挖掘,从大型数据库中发现了大量规则,如何选取所关注的规则,是知识发现的重要研究内容。该文研究了利用领域知识对规则的主观关注程度进行度量的方法,给出了一个能够度量规则的简洁性和新奇性的客观关注程度的计算函数,提出了选取用户关注的规则的多策略方法。

关 键 词:知识发现  数据挖掘  规则  数据库  多策略方法

Multi-Strategy Approach to Mining Interesting Rules
CHENG Ji-Hua,GUO Jian-Sheng,SHI Peng-Fei.Multi-Strategy Approach to Mining Interesting Rules[J].Chinese Journal of Computers,2000,23(1):47-51.
Authors:CHENG Ji-Hua  GUO Jian-Sheng  SHI Peng-Fei
Abstract:A large set of rules can be discovered from large database by using the data mining technologies, but most of them are of no interesting to the user. How to filter the interesting rules is the crucial step within the knowledge discovery in database. The methods to calculate the subjective interestingness and to measure the importance of rules by using of the domain knowledge are studied. A new objective interestingness function, which can measure the novelty and simplicity of rules, is given. A multi strategy approach, which combines with background knowledge, for selecting interesting rules is proposed in the paper. The process of filtering interesting rules includes the several sub processes: deleting the redundant rules; grouping the rules into sub groups; clustering each sub group into classes, and selecting the most interesting rule from each class; finally combining them into the set of interesting rules. Some concepts, such as the importance of attribute, the interestingness of rules, the distance between rules etc, are proposed in the paper. There is an example in the paper, which applying the multi strategy approach, for illustrating the process to select the interesting rules that discovered from CPICDB (chinese parasite infection census database). The example demonstrated the algorithm represented in the paper is practicable and effective.
Keywords:knowledge discovery  subjective measures interestingness  objective measures interestingness  similarity  domain knowledge
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号