首页 | 本学科首页   官方微博 | 高级检索  
     


A discretization algorithm based on Class-Attribute Contingency Coefficient
Authors:Cheng-Jung Tsai  Chien-I. Lee
Affiliation:a Department of Computer Science, National Chiao Tung University, Hsinchu, Taiwan, ROC
b Department of Information and Learning Technology, National University of Tainan, Tainan, Taiwan, ROC
c Department of Information Management, National DongHwa University, Hualien, Taiwan, ROC
Abstract:Discretization algorithms have played an important role in data mining and knowledge discovery. They not only produce a concise summarization of continuous attributes to help the experts understand the data more easily, but also make learning more accurate and faster. In this paper, we propose a static, global, incremental, supervised and top-down discretization algorithm based on Class-Attribute Contingency Coefficient. Empirical evaluation of seven discretization algorithms on 13 real datasets and four artificial datasets showed that the proposed algorithm could generate a better discretization scheme that improved the accuracy of classification. As to the execution time of discretization, the number of generated rules, and the training time of C5.0, our approach also achieved promising results.
Keywords:Data mining   Classification   Decision tree   Discretization   Contingency coefficient
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号