首页 | 本学科首页   官方微博 | 高级检索  
     


Selecting good views of high-dimensional data using class consistency
Authors:Mike Sips   Boris Neubert   John P. Lewis   Pat Hanrahan
Affiliation:Max Planck Center for Visual Computing Stanford/Saarbruecken;University of Konstanz;Massey University;Stanford University
Abstract:Many visualization techniques involve mapping high-dimensional data spaces to lower-dimensional views. Unfortunately, mapping a high-dimensional data space into a scatterplot involves a loss of information; or, even worse, it can give a misleading picture of valuable structure in higher dimensions. In this paper, we propose class consistency as a measure of the quality of the mapping. Class consistency enforces the constraint that classes of n–D data are shown clearly in 2–D scatterplots. We propose two quantitative measures of class consistency, one based on the distance to the class's center of gravity, and another based on the entropies of the spatial distributions of classes. We performed an experiment where users choose good views, and show that class consistency has good precision and recall. We also evaluate both consistency measures over a range of data sets and show that these measures are efficient and robust.
Keywords:Data Mining [I.5.3]: Clustering    User Interfaces [H.5.2]: Evaluation/methodology
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号