首页 | 本学科首页   官方微博 | 高级检索  
     

A New Classification Method to Overcome Over—Branching
引用本文:周傲英,钱卫宁,钱海蕾,金文.A New Classification Method to Overcome Over—Branching[J].计算机科学技术学报,2002,17(1):18-27.
作者姓名:周傲英  钱卫宁  钱海蕾  金文
作者单位:[1]DepartmentofComputerScience,FudanUniversity,Shanghai200433,P.R.China [2]SimonFraserUniversity,8888UniversityDrive,Burnaby,B.C.,V5A1S6,Canada
基金项目:the NKBRSF of China,国家自然科学基金 
摘    要:Classification is an important technique in data mining.The decision trees builty by most of the existing classification algorithms commonly feature over-branching,which will lead to poor efficiency in the subsequent classification period.In this paper,we present a new value-oriented classification method,which aims at building accurately proper-sized decision trees while reducing over-branching as much as possible,based on the concepts of frequent-pattern-node and exceptive-child-node.The experiments show that while using relevant anal-ysis as pre-processing ,our classification method,without loss of accuracy,can eliminate the over-branching greatly in decision trees more effectively and efficiently than other algorithms do.

关 键 词:数据挖掘  决策树  分类方法  算法

A new classification method to overcome over-branching
Aoying Zhou,Weining Qian,Hailei Qian,Wen Jin.A new classification method to overcome over-branching[J].Journal of Computer Science and Technology,2002,17(1):18-27.
Authors:Aoying Zhou  Weining Qian  Hailei Qian  Wen Jin
Affiliation:(1) Department of Computer Science, Fudan University, 200433 Shanghai, P.R. China;(2) Simon Fraser University, 8888 University Drive, V5A 1S6 Burnaby, B.C., Canada
Abstract:Classification is an important technique in data mining. The decision trees built by most of the existing classification algorithms commonly feature over-branching, which will lead to poor efficiency in the subsequent classification period. In this paper, we present a new value-oriented classification method, which aims at building accurately proper-sized decision trees while reducing over-branching as much as possible, based on the concepts of frequent-pattern-node and exceptive-child-node. The experiments show that while using relevant analysis as pre-processing, our classification method, without loss of accuracy, can eliminate the over-branching greatly in decision trees more effectively and efficiently than other algorithms do. This work is partially supported by the NKBRSF of China (G1998030414) and the National Natural Science Foundation of China (No.60003016). ZHOU Aoying received his M.S. degree in computer science from Sichuan. University in 1988, and his Ph.D. degree in computer software from Fudan University in 1993. He is currently a professor in the Department of Computer Science, Fudan University. His main research interests include object-oriented data model for multimedia information, Web data management, data mining and data warehousing, the novel database technologies and their applications to digital library and electronic commerce. QIAN Weining is a Ph.D. candidate in the Departemnt of Computer Science, Fudan University. His speciality is database and knowledge base. He is supported by Microsoft Research Fellowship. His research interests include clustering, data mining and Web mining. QIAN Hailei is a graduate student in the Department of Computer Science Fudan University. Her speciality is database and knowledge base. Her research interests include clustering and data mining. JIN Wen is a Ph.D. candidate in the School of Computing, Simon Fraser University, Canada, supervised by Dr. Jiawei Han. His current research interests are database and data warehousing, data mining, Web mining and XML.
Keywords:data mining  classification  over branching  decision tree  frequent pattern
本文献已被 维普 万方数据 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号