首页 | 本学科首页   官方微博 | 高级检索  
     


Using regression makes extraction of shared variation in multiple datasets easy
Authors:Jussi Korpela  Andreas Henelius  Lauri Ahonen  Arto Klami  Kai Puolamäki
Affiliation:1.Finnish Institute of Occupational Health,Helsinki,Finland;2.Department of Computer Science, Helsinki Institute for Information Technology HIIT,University of Helsinki,Helsinki,Finland
Abstract:In many data analysis tasks it is important to understand the relationships between different datasets. Several methods exist for this task but many of them are limited to two datasets and linear relationships. In this paper, we propose a new efficient algorithm, termed cocoreg, for the extraction of variation common to all datasets in a given collection of arbitrary size. cocoreg extends redundancy analysis to more than two datasets, utilizing chains of regression functions to extract the shared variation in the original data space. The algorithm can be used with any linear or non-linear regression function, which makes it robust, straightforward, fast, and easy to implement and use. We empirically demonstrate the efficacy of shared variation extraction using the cocoreg algorithm on five artificial and three real datasets.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号