首页 | 本学科首页   官方微博 | 高级检索  
     


Clustering categorical data: an approach based on dynamical systems
Authors:David Gibson  Jon Kleinberg  Prabhakar Raghavan
Affiliation:(1) Department of Computer Science UC Berkeley, Berkeley, CA 94720 USA; e-mail: dag@cs.berkeley.edu, US;(2) Department of Computer Science, Cornell University, Ithaca, NY 14853; e-mail: kleinber@cs.cornell.edu, US;(3) Almaden Research Center IBM, San Jose, CA 95120 USA; e-mail: pragh@almaden.ibm.com, US
Abstract:We describe a novel approach for clustering collections of sets, and its application to the analysis and mining of categorical data. By “categorical data,” we mean tables with fields that cannot be naturally ordered by a metric – e.g., the names of producers of automobiles, or the names of products offered by a manufacturer. Our approach is based on an iterative method for assigning and propagating weights on the categorical values in a table; this facilitates a type of similarity measure arising from the co-occurrence of values in the dataset. Our techniques can be studied analytically in terms of certain types of non-linear dynamical systems. Received February 15, 1999 / Accepted August 15, 1999
Keywords::Clustering –  Data mining –  Categorial data –  Dynamical systems –  Hypergraphs
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号