首页 | 本学科首页   官方微博 | 高级检索  
     

基于字典学习的非线性降维方法
引用本文:郑思龙, 李元祥, 魏宪, 彭希帅. 基于字典学习的非线性降维方法. 自动化学报, 2016, 42(7): 1065-1076. doi: 10.16383/j.aas.2016.c150557
作者姓名:郑思龙  李元祥  魏宪  彭希帅
作者单位:1.上海交通大学航空航天学院 上海 200240 中国;;2.慕尼黑工业大学电气与计算机工程系 慕尼黑 D-80333 德国
基金项目:国家自然科学基金(U1406404,61331015,41174164)资助
摘    要:目前,众多的数据降维(Dimensionality reduction, DR)方法(如经典的PCA(Principle component analysis), ISOMAP(Isometric mapping))能够使降维后的数据保留原始信号的重要特征,但是从降维后的数据中很好地恢复出原始信号仍旧是一个挑战.近年来,稀疏表示(Sparse representation, SR)在信号重构研究中受到广泛关注,信号可以利用过完备字典中少数原子的线性组合来描述.本文提出一种基于字典学习的非线性降维方法.从高维输入信号到低维特征的降维过程中,期望一些重要的几何特征(内积、距离和夹角)得以保留,同时又能够从低维数据中恢复出原始信号.为达此目的,本文采用CDL(Concentrated dictionary learning)算法训练一个字典对(高维字典D和低维字典P),使高维原始信号的能量能够聚集于低维子空间中.字典D用来获取稀疏表示系数,字典P是D的直接降维采样,CDL算法能够保证P聚集D中的大部分能量.这样,信号的降维与恢复问题就转变为字典对的训练问题,信号的降维即为从D到P的能量保留过程.实验表明:CDL可在RIP(Restricted isomery property)条件的限制之外具有一定的信号重建能力,能在更低的维度条件下恢复图像,优于传统的压缩感知方法.此外,在噪声较大的情况下,CDL图像压缩效果优于JPEG2000.

关 键 词:数据降维   稀疏表示   压缩感知   字典学习
收稿时间:2015-09-02

Nonlinear Dimensionality Reduction Based on Dictionary Learning
ZHENG Si-Long, LI Yuan-Xiang, WEI Xian, PENG Xi-Shuai. Nonlinear Dimensionality Reduction Based on Dictionary Learning. ACTA AUTOMATICA SINICA, 2016, 42(7): 1065-1076. doi: 10.16383/j.aas.2016.c150557
Authors:ZHENG Si-Long  LI Yuan-Xiang  WEI Xian  PENG Xi-Shuai
Affiliation:1. School of Aeronautics and Astronautics, Shanghai Jiao Tong University, Shanghai 200240, China;;2. Department of Electrical and Computer Engineering, Technische Universitaet München, Munich D-80333, Germany
Abstract:Most classic dimensionality reduction (DR) algorithms (such as principle component analysis (PCA) and isometric mapping (ISOMAP)) focus on finding a low-dimensional embedding of original data, which are often not reversible. It is still challenging to make DR processes reversible in many applications. Sparse representation (SR) has shown its power on signal reconstruction and denoising. To tackle the problem of large scale dataset processing, this work focuses on developing a differentiable model for invertible DR based on SR. From high-dimensional input signal to the low-dimensional feature, we expect to preserve some important geometric features (such as inner product, distance and angle) such that the reliable reconstruction from the low dimensional space back to the original high dimensional space is possible. We employ the algorithm called concentrated dictionary learning (CDL) to train the high dimensional dictionary to concentrate the energy in its low dimensional subspace. Then we design a paired dictionaries: D and P, where D is used to obtain the sparse representation and P is a direct down-sampling of D. CDL can ensure P to capture the most energy of D. Then, the problem about signal reconstruction is transformed into how to train dictionaries D and P, so the process of input signal X to feature Y is transformed into the process of energy retention from D to P. Experimental results show that without the restrictions of linear projection using restricted isometry property (RIP), CDL can reconstruct the image at a lower dimensional space and outperform state-of-the-art DR methods (such as Gaussian random compressive sensing). In addition, for noise-corrupted images, CDL can obtain better compression performance than JPEG2000.
Keywords:Dimensionality reduction (DR)  sparse representation (SR)  compressed sensing (CS)  dictionary learning
点击此处可从《自动化学报》浏览原始摘要信息
点击此处可从《自动化学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号