首页 | 本学科首页   官方微博 | 高级检索  
     


On data mining,compression, and Kolmogorov complexity
Authors:Christos Faloutsos  Vasileios Megalooikonomou
Affiliation:(1) School of Computer Science, Carnegie Mellon University, 5000 Forbes Ave, Pittsburgh, PA 15213-3891, USA;(2) Department of Computer and Information Sciences, Temple University, 314 Wachman Hall, 1805 N. Broad Street, Philadelphia, PA 19122, USA
Abstract:Will we ever have a theory of data mining analogous to the relational algebra in databases? Why do we have so many clearly different clustering algorithms? Could data mining be automated? We show that the answer to all these questions is negative, because data mining is closely related to compression and Kolmogorov complexity; and the latter is undecidable. Therefore, data mining will always be an art, where our goal will be to find better models (patterns) that fit our datasets as best as possible.
Keywords:Data mining  Compression  Kolmogorov complexity  Clustering  Classification  Forecasting  Outliers
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号