首页 | 本学科首页   官方微博 | 高级检索  
     


Dimensionality reduction in data mining: A Copula approach
Affiliation:1. LIMED Laboratory, University of Bejaia, Faculty of exact sciences, Computing Department, 06000, Bejaia, Algeria;2. Lab-STICC Laboratory, University of Brest, 20 Avenue Victor Le Gorgeu, 29238 Brest, France;3. Lab-CASL Laboratory,University College Dublin, Belfield, Dublin 4, Ireland;1. Dept. of Electronics, Information & Communications Engineering, Daejeon University, Korea South;2. Visual Intelligence SW Research Section, Electronics and Telecommunications Research Institute, Korea South;3. School of Information Technologies, University of Sydney, Australia;1. Electronic Engineering Department/Graduate School at Shenzhen, Tsinghua University, Beijing 100084, China;2. Biometrics Research Centre and the Department of Computing, The Hong Kong Polytechnic University, Kowloon, Hong Kong;3. Biocomputing Research Center, Shenzhen Graduate School, Harbin Institute of Technology, Shenzhen 518055, China
Abstract:The recent trends in collecting huge and diverse datasets have created a great challenge in data analysis. One of the characteristics of these gigantic datasets is that they often have significant amounts of redundancies. The use of very large multi-dimensional data will result in more noise, redundant data, and the possibility of unconnected data entities. To efficiently manipulate data represented in a high-dimensional space and to address the impact of redundant dimensions on the final results, we propose a new technique for the dimensionality reduction using Copulas and the LU-decomposition (Forward Substitution) method. The proposed method is compared favorably with existing approaches on real-world datasets: Diabetes, Waveform, two versions of Human Activity Recognition based on Smartphone, and Thyroid Datasets taken from machine learning repository in terms of dimensionality reduction and efficiency of the method, which are performed on statistical and classification measures.
Keywords:
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号