期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Tucker decomposition-based tensor learning for human action recognition

Jianguang Zhang Yahong Han Jianmin Jiang 《Multimedia Systems》2016,22(3):343-353

相似文献

2.

基于加权多尺度张量子空间的人脸图像特征提取方法 总被引：1，自引：1，他引：0

王仕民程柏良叶继华王明文《数据采集与处理》2016,31(4):791-798

为了不破坏原始数据固有的高阶结构和数据之间的相关性,减少光照对图像特征的影响,并优化多尺度特征的权重,提出了基于加权多尺度张量子空间的图像特征提取方法。采用多尺度小波变换表征图像各个部位特征,使用不确定度权衡每个尺度对图像分类的作用,并组建成多尺度张量子空间,结合多线性主成分分析与线性判别分析算法,降低了图像在处理过程中的成本,保存了高维数据固有结构和相关性,完成对图像特征提取。使用CAS-PEAL-R1东方人脸库进行评测,实验结果表明,该图像特征提取算法用于图像识别过程中具有较好的效果,具有一定的可行性。相似文献

3.

Individual Kernel Tensor-subspaces for robust face recognition: a computationally efficient tensor framework without requiring mode factorization.

Sung Won Park Marios Savvides 《IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics》2007,37(5):1156-1166

Facial images change appearance due to multiple factors such as different poses, lighting variations, and facial expressions. Tensors are higher order extensions of vectors and matrices, which make it possible to analyze different appearance factors of facial variation. Using higher order tensors, we can construct a multilinear structure and model the multiple factors of face variation. In particular, among the appearance factors, the factor of a person's identity modeled by a tensor structure can be used for face recognition. However, this tensor-based face recognition creates difficulty in factorizing the unknown parameters of a new test image and solving for the person-identity parameter. In this paper, to break this limitation of applying the tensor-based methods to face recognition, we propose a novel tensor approach based on an individual-modeling method and nonlinear mappings. The proposed method does not require the problematic tensor factorization and is more efficient than the traditional TensorFaces method with respect to computation and memory. We set up the problem of solving for the unknown factors as a least squares problem with a quadratic equality constraint and solve it using numerical optimization techniques. We show that an individual-multilinear approach reduces the order of the tensor so that it makes face-recognition tasks computationally efficient as well as analytically simpler. We also show that nonlinear kernel mappings can be applied to this optimization problem and provide more accuracy to face-recognition systems than linear mappings. In this paper, we show that the proposed method, Individual Kernel TensorFaces, produces the better discrimination power for classification. The novelty in our approach as compared to previous work is that the Individual Kernel TensorFaces method does not require estimating any factor of a new test image for face recognition. In addition, we do not need to have any a priori knowledge of or assumption about the factors of a test image when using the proposed method. We can apply Individual Kernel TensorFaces even if the factors of a test image are absent from the training set. Based on various experiments on the Carnegie Mellon University Pose, Illumination, and Expression database, we demonstrate that the proposed method produces reliable results for face recognition. 相似文献

4.

基于张量投票的摄像机自标定方法研究

王君竹陈丽芳刘渊《计算机工程与应用》2016,52(15):190-197

针对传统的基于Kruppa方程摄像机自标定算法的欠鲁棒性,首次提出将鲁棒的张量投票算法用于摄像机自标定方法中。利用基于尺度不变的SIFT算法查找并匹配出每对图像的特征点,其中待匹配图像由摄像机对同一场景从三个不同角度位置拍摄,对图像张量投票后按棒张量特征值降序排序,由此筛选得到具有鲁棒性边缘特征的前八对特征点,利用八点算法求解相应的基础矩阵和极点,根据Kruppa方程和三维重建（SFM）算法求得摄像机参数矩阵。实验结果证明,该方法具有较高标定精度,并通过加入高斯噪声的仿真实验证明该算法是一种鲁棒的摄像机自标定方法。相似文献

5.

Semisupervised Sparse Multilinear Discriminant Analysis

下载免费PDF全文

黄锴张丽清《计算机科学技术学报》2014,29(6):1058-1071

Various problems are encountered when adopting ordinary vector space algorithms for high-order tensor data input. Namely, one must overcome the Small Sample Size （SSS） and overfitting problems. In addition, the structural information of the original tensor signal is lost during the vectorization process. Therefore, comparable methods using a direct tensor input are more appropriate. In the case of electrocardiograms （ECGs）, another problem must be overcome; the manual diagnosis of ECG data is expensive and time consuming, rendering it difficult to acquire data with diagnosis labels. However, when effective features for classification in the original data are very sparse, we propose a semisupervised sparse multilinear discriminant analysis （SSSMDA） method. This method uses the distribution of both the labeled and the unlabeled data together with labels discovered through a label propagation Mgorithm. In practice, we use 12-lead ECGs collected from a remote diagnosis system and apply a short-time-fourier transformation （STFT） to obtain third-order tensors. The experimental results highlight the sparsity of the ECG data and the ability of our method to extract sparse and effective features that can be used for classification. 相似文献

6.

Image classification by multimodal subspace learning

Jun Yu Feng Lin Hock-Soon Seah Cuihua Li Ziyu Lin 《Pattern recognition letters》2012,33(9):1196-1204

In recent years we witnessed a surge of interest in subspace learning for image classification. However, the previous methods lack of high accuracy since they do not consider multiple features of the images. For instance, we can represent a color image by finding a set of visual features to represent the information of its color, texture and shape. According to the “Patch Alignment” Framework, we developed a new subspace learning method, termed Semi-Supervised Multimodal Subspace Learning (SS-MMSL), in which we can encode different features from different modalities to build a meaningful subspace. In particular, the new method adopts the discriminative information from the labeled data to construct local patches and aligns these patches to get the optimal low dimensional subspace for each modality. For local patch construction, the data distribution revealed by unlabeled data is utilized to enhance the subspace learning. In order to find a low dimensional subspace wherein the distribution of each modality is sufficiently smooth, SS-MMSL adopts an alternating and iterative optimization algorithm to explore the complementary characteristics of different modalities. The iterative procedure reaches the global minimum of the criterion due to the strong convexity of the criterion. Our experiments of image classification and cartoon retrieval demonstrate the validity of the proposed method. 相似文献

7.

基于张量表示的直推式多模态视频语义概念检测 总被引：4，自引：0，他引：4

吴飞刘亚楠庄越挺《软件学报》2008,19(11):2853-2868

提出了一种基于高阶张量表示的视频语义分析与理解框架.在此框架中,视频镜头首先被表示成由视频中所包含的文本、视觉和听觉等多模态数据构成的三阶张量;其次,基于此三阶张量表达及视频的时序关联共生特性设计了一种子空间嵌入降维方法,称为张量镜头;由于直推式学习从已知样本出发能对特定的未知样本进行学习和识别.最后在这个框架中提出了一种基于张量镜头的直推式支持张量机算法,它不仅保持了张量镜头所在的流形空间的本征结构,而且能够将训练集合外数据直接映射到流形子空间,同时充分利用未标记样本改善分类器的学习性能.实验结果表明,该方法能够有效地进行视频镜头的语义概念检测. 相似文献

8.

Tensor learning and automated rank selection for regression-based video classification

Jianguang Zhang Yanbin Liu Jianmin Jiang 《Multimedia Tools and Applications》2018,77(22):29213-29230

相似文献

9.

Text classification method based on self-training and LDA topic models

《Expert systems with applications》2017

Supervised text classification methods are efficient when they can learn with reasonably sized labeled sets. On the other hand, when only a small set of labeled documents is available, semi-supervised methods become more appropriate. These methods are based on comparing distributions between labeled and unlabeled instances, therefore it is important to focus on the representation and its discrimination abilities. In this paper we present the ST LDA method for text classification in a semi-supervised manner with representations based on topic models. The proposed method comprises a semi-supervised text classification algorithm based on self-training and a model, which determines parameter settings for any new document collection. Self-training is used to enlarge the small initial labeled set with the help of information from unlabeled data. We investigate how topic-based representation affects prediction accuracy by performing NBMN and SVM classification algorithms on an enlarged labeled set and then compare the results with the same method on a typical TF-IDF representation. We also compare ST LDA with supervised classification methods and other well-known semi-supervised methods. Experiments were conducted on 11 very small initial labeled sets sampled from six publicly available document collections. The results show that our ST LDA method, when used in combination with NBMN, performed significantly better in terms of classification accuracy than other comparable methods and variations. In this manner, the ST LDA method proved to be a competitive classification method for different text collections when only a small set of labeled instances is available. As such, the proposed ST LDA method may well help to improve text classification tasks, which are essential in many advanced expert and intelligent systems, especially in the case of a scarcity of labeled texts. 相似文献

10.

Tensor-Based Transductive Learning for Multimodality Video Semantic Concept Detection

Fei Wu Yanan Liu Yueting Zhuang 《Multimedia, IEEE Transactions on》2009,11(5):868-878

Interaction and integration of multimodality media types such as visual, audio, and textual data in video are the essence of video semantic analysis. Contextual information propagation is useful for both intra- and inter-shot correlations. However, the traditional concatenated vector representation of videos weakens the power of the propagation and compensation among the multiple modalities. In this paper, we introduce a higher-order tensor framework for video analysis. We represent image frame, audio, and text in video shots as data points by the 3rd-order tensor. Then we propose a novel dimension reduction algorithm which explicitly considers the manifold structure of the tensor space from contextual temporal associated cooccurring multimodal media data. Our algorithm inherently preserves the intrinsic structure of the sub- manifold where tensorshots are sampled and is also able to map out-of-sample data points directly. We propose a new transductive support tensor machines algorithm to train effective classifier using large amount of unlabeled data together with the labeled data. Experiment results on TREVID 2005 data set show that our method improves the performance of video semantic concept detection. 相似文献

11.

一种基于Bernoulli混合模型的不完整数据文本分类方法

蔡崇超王士同《计算机应用》2007,27(5):1235-1237

在Bernoulli混合模型和期望最大化(EM)算法的基础上给出了一种基于不完整数据的改进方法。首先在已标记数据的基础上通过Bernoulli混合模型和朴素贝叶斯算法得到似然函数参数估计初始值，然后利用含有权值的EM算法对分类器的先验概率模型进行参数估计，得到最终的分类器。实验结果表明，该方法在准确率和查全率方面要优于朴素贝叶斯文本分类。相似文献

12.

基于主动深度学习的高光谱影像分类

程圆娥周绍光袁春琦陈蒙蒙《计算机工程与应用》2017,53(17):192-196

针对当前高光谱遥感影像分类人工标注样本费时费力,大量未标注样本未得到有效利用以及主要利用光谱信息而忽视空间信息等问题,提出了一种空-谱信息与主动深度学习相结合的高光谱影像分类方法。首先利用主成分分析对原始影像进行降维,在此基础上提取像素的一正方形小邻域作为该像素的空间信息并结合其原始光谱信息得到空谱特征。然后,通过稀疏自编码器得到原始数据的稀疏特征表达,并通过逐层无监督学习稀疏自编码器构建深度神经网络,输出原始数据的深度特征,将其连接到softmax分类器,利用少量标记样本以监督学习的方式完成模型的精调。最后,利用主动学习算法选择最不确定性样本对其进行标注,并加入至训练样本以提高分类器的分类效果。分别对PaviaU影像和PaviaC影像进行分类实验的结果表明,该方法在少量标记样本情况下,相对于传统方法能有效地提高分类精度。相似文献

13.

应用分形维数的自适应张量投票算法

下载免费PDF全文

王莉戴芳郭文艳韩伟《计算机工程与应用》2013,49(12):168-171

张量投票算法是感知聚类方法中一种比较常用的计算方法,可以应用到图像处理等各个方面,具有较强的鲁棒性,非迭代等特性。张量投票算法中尺度参数的自适应选取对于投票域的建立起着至关重要的作用。通过分形维数来选取尺度参数,建立了尺度参数与分形维数的关系,提出了基于分形维数的自适应张量投票算法,并将该方法应用于图像的线特征提取和边缘修复。与传统的张量投票算法进行比较,该方法在图像线特征提取和边缘修复方面获得了较好的实验结果。相似文献

14.

结合均值漂移的基于图的半监督图像分类

白艺娜汪西莉《计算机应用》2013,33(9):2606-2609

针对基于图的半监督流形正则化图像分类算法需要大量无标记样本训练分类器,空间和时间复杂度高,甚至不能处理大规模图像,且对背景或目标复杂的图像分类错误率较高的问题,提出了结合均值漂移（mean shift）的基于图的半监督流形正则化图像分类算法。该方法对基于图的半监督流形正则化分类算法的改进主要体现在两方面,首先是通过mean shift算法对图像进行了平滑,以平滑后的图像作为分类对象;其次不是利用所有无标记样本,而是只采用少量无标记样本。实验结果表明:图像的平滑使得目标和背景区域的特征更为一致,从而利用较少的样本就可以提高分类器的正确率;同时大大降低了算法的复杂度,使得基于图的半监督分类算法用于分类大规模图像成为可能。相似文献

15.

一种鲁棒性的少样本学习方法

代磊超冯林杨玉亭尚兴林苏菡《小型微型计算机系统》2021,(2):340-347

少样本学习是目前机器学习研究领域的一个热点,它能在少量的标记样本中学习到较好的分类模型.但是,在噪声的不确定环境中,传统的少样本学习模型泛化能力弱.针对这一问题,提出一种鲁棒性的少样本学习方法RFSL(Robust Few-Shot Learning).首先,使用核密度估计(Kernel Density Estimation,KDE)和图像滤波(Image Filtering)方法在训练集中加入不同的随机噪声,形成多个不同噪声下的训练集,并分别生成支持集和查询集.其次,利用关系网络的关系模块通过训练集端到端地学习多个基分类器.最后,采用投票的方式对各基分类器的最末Sigmoid层非线性分类结果进行融合.实验结果表明,RFSL模型可促进小样本学习快速收敛,同时,与R-Net以及其他主流少样本学习方法相比,RFSL具有更高的分类准确率,更强的鲁棒性. 相似文献

16.

Fast facial shape recovery from a single image with general, unknown lighting by using tensor representation

Minsik Lee^{Author Vitae} Chong-Ho Choi Author Vitae 《Pattern recognition》2011,44(7):1487-1496

In this paper, we propose a fast 3-D facial shape recovery algorithm from a single image with general, unknown lighting. In order to derive the algorithm, we formulate a nonlinear least-square problem with two parameter vectors which are related to personal identity and light conditions. We then combine the spherical harmonics for the surface normals of a human face with tensor algebra and show that in a certain condition, the dimensionality of the least-square problem can be further reduced to one-tenth of the regular subspace-based model by using tensor decomposition (N-mode SVD), which greatly speeds up the computations. In order to enhance the shape recovery performance, we have incorporated prior information in updating the parameters. In the experiment, the proposed algorithm takes less than 0.4 s to reconstruct a face and shows a significant performance improvement over other reported schemes. 相似文献

17.

基于图像处理参数反馈自适应方法设计与应用

下载免费PDF全文

孙静杨森林师超陈伟张晓丽《计算机测量与控制》2021,29(6):164-168

自动图像处理程序的参数优化是一个耗时过程;对于具有高噪音和阴影复杂图像分析任务,手动调整参数无法产生较好的效果;为了多参数同时调优,提出了基于反馈的参数自适应模型来改进标准图像分割方法;通过前馈方式实现算法,对其调整参数来比较算法的性能,根据基准数据集对算法进行评估、来比较讨论图像阴影和噪声对分割和分类精度的影响;研究结果表明,在不同阴影等级的背景下,采用反馈参数自适应的图像分割算法比采用前馈算法的图像分割、分类效果更好;当仅存在抽象基准数据时,该方法执行自动图像分析得有效,同时,使用分级的数据集评估不同图像处理的鲁棒性,有利于最终用户进行图像处理;为自动化图像处理理论研究与实践提供借鉴. 相似文献

18.

Neighborhood preserving regression for image retrieval

Ke Lu^{Author Vitae} Jidong ZhaoAuthor Vitae 《Neurocomputing》2011,74(9):1467-1473

In content-based image retrieval (CBIR), relevance feedback has been proven to be a powerful tool for bridging the gap between low level visual features and high level semantic concepts. Traditionally, relevance feedback driven CBIR is often considered as a supervised learning problem where the user provided feedbacks are used to learn a distance metric or classification function. However, CBIR is intrinsically a semi-supervised learning problem in which the testing samples (images in the database) are present during the learning process. Moreover, when there are no sufficient feedbacks, these methods may suffer from the overfitting problem. In this paper, we propose a novel neighborhood preserving regression algorithm which makes efficient use of both labeled and unlabeled images. By using the unlabeled images, the geometrical structure of the image space can be incorporated into the learning system through a regularizer. Specifically, from all the functions which minimize the empirical loss on the labeled images, we select the one which best preserves the local neighborhood structure of the image space. In this way, our method can obtain a regression function which respects both semantic and geometrical structures of the image database. We present experimental evidence suggesting that our algorithm is able to use unlabeled data effectively for image retrieval. 相似文献

19.

基于半监督学习的增量图像分类方法

梁鹏黎绍发覃姜维罗剑高《模式识别与人工智能》2012,25(1):111-117

为有效使用大量未标注的图像进行分类,提出一种基于半监督学习的图像分类方法。通过共同的隐含话题桥接少量已标注的图像和大量未标注的图像,利用已标注图像的Must-link约束和Cannot-link约束提高未标注图像分类的精度。实验结果表明,该方法有效提高Caltech-101数据集和7类图像集约10%的分类精度。此外,针对目前绝大部分半监督图像分类方法不具备增量学习能力这一缺点,提出该方法的增量学习模型。实验结果表明,增量学习模型相比无增量学习模型提高近90%的计算效率。关键词半监督学习,图像分类,增量学习中图法分类号TP391。41IncrementalImageClassificationMethodBasedonSemi-SupervisedLearningLIANGPeng^1,2,LIShao-Fa²,QINJiang-Wei²,LUOJian-Gao³¹(SchoolofComputerScienceandEngineering,GuangdongPolytechnicNormalUniversity,Guangzhou510665)²(SchoolofComputerScienceandEngineering,SouthChinaUniversityofTechnology,Guangzhou510006)³(DepartmentofComputer,GuangdongAIBPolytechnicCollege,Guangzhou510507)ABSTRACTInordertouselargenumbersofunlabeledimageseffectively,animageclassificationmethodisproposedbasedonsemi-supervisedlearning。Theproposedmethodbridgesalargeamountofunlabeledimagesandlimitednumbersoflabeledimagesbyexploitingthecommontopics。Theclassificationaccuracyisimprovedbyusingthemust-linkconstraintandcannot-linkconstraintoflabeledimages。TheexperimentalresultsonCaltech-101and7-classesimagedatasetdemonstratethattheclassificationaccuracyimprovesabout10%bytheproposedmethod。Furthermore,duetothepresentsemi-supervisedimageclassificationmethodslackingofincrementallearningability,anincrementalimplementationofourmethodisproposed。Comparingwithnon-incrementallearningmodelinliterature,theincrementallearningmethodimprovesthecomputationefficiencyofnearly90%。相似文献

20.

基于反馈稀疏约束的非负张量分解算法

刘亚楠涂铮铮罗斌《计算机应用》2013,33(10):2871-2873

为了充分利用图像本身的结构信息并充分压缩图像数据,把得到的子空间中数据(反馈)的稀疏性作为约束项加入非负张量分解目标函数中,即采用基于反馈稀疏约束的非负张量分解算法对图像集合进行降维。最后,将该算法应用于手写数字图像库中,实验结果表明所提出的方法能有效改善图像分类的准确性相似文献