首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
人脸识别是模式识别领域的一个重要研究课题,它具有广泛的应用背景并日益受到学术界、企业界、政府和军事部分的高度重视。人脸识别研究的目标主要有两个,一是提高识别正确率,二是降低训练与识别时间。论文对传统LDB方法进行改进,基于差可分度确定小波包分解子带,以选定子带内选定系数的一、二阶原点距作为人脸特征,定义了相应的分类识别距离,在此基础上提出了一种新的人脸识别方法,既减少了计算复杂度,降低训练与识别时间,保证实时性,又能够更好地描述对分类有用的人脸特征,提高识别正确率。  相似文献   

2.
工程扫描图象的骨架提取和识别技术综述   总被引:3,自引:0,他引:3  
目前有很多工程图纸处理和识别算法,比如基于细化的识别算法,基于结构的识别算法,正交方向搜索(OrthogonalZig-Zag)识别算法,轮廓匹配识别算法等,结合自己的工作对当代典型识别算法进行了分析和探讨并提出了自己的新的观点和方法。  相似文献   

3.
情绪识别作为人机交互的热门领域,其技术已经被应用于医学、教育、安全驾驶、电子商务等领域.情绪主要由面部表情、声音、话语等进行表达,不同情绪表达时的面部肌肉、语气、语调等特征也不相同,使用单一模态特征确定的情绪的不准确性偏高,考虑到情绪表达主要通过视觉和听觉进行感知,本文提出了一种基于视听觉感知系统的多模态表情识别算法,分别从语音和图像模态出发,提取两种模态的情感特征,并设计多个分类器为单特征进行情绪分类实验,得到多个基于单特征的表情识别模型.在语音和图像的多模态实验中,提出了晚期融合策略进行特征融合,考虑到不同模型间的弱依赖性,采用加权投票法进行模型融合,得到基于多个单特征模型的融合表情识别模型.本文使用AFEW数据集进行实验,通过对比融合表情识别模型与单特征的表情识别模型的识别结果,验证了基于视听觉感知系统的多模态情感识别效果要优于基于单模态的识别效果.  相似文献   

4.
5.
半边图模型之多层次认知系统   总被引:3,自引:1,他引:2  
针对具有多层次性和复杂性的认知问题提出一个动态可增殖的多层次自组织认知系统,每个层次具有形式上一致的知识表示方法,各层的自组关联、自组聚合、归约和样本表达四个知识处理模型是实现系统自组织层次增殖的核心模型。提出信息粒和容器的概念,信息粒演进流程模拟认知过程的静态归约,容器演进流程对应于认知的动态模拟,这两个流程在每个系统层次上对每个输入样本完成一个完整的模拟认知与归约表达。以自组图演算为理论模型,给出了每个层次内以及相邻层次之间的信息处理与传递的详细设计规范。  相似文献   

6.
深度学习及其在目标和行为识别中的新进展   总被引:12,自引:7,他引:5       下载免费PDF全文
深度学习是机器学习中的一个新的研究领域。通过深度学习的方法构建深度网络来抽取特征是目前目标和行为识别中得到关注的研究方向。为引起更多计算机视觉领域研究者对深度学习进行探索和讨论,并推动目标和行为识别的研究,本文对深度学习及其在目标和行为识别中的新进展给予了概述。本文先介绍深度学习领域研究的基本状况、主要概念和原理;然后介绍近期利用深度学习在目标和行为识别应用中的一些新进展;最后阐述了深度学习与神经网络之间的关系,深度学习的优缺点,以及目前深度学习理论需要解决的主要问题。这对拟将深度学习应用于目标和行为识别的研究人员应有所帮助。  相似文献   

7.
The evolution of robust speech recognition systems that maintain a high level of recognition accuracy in difficult and dynamically-varying acoustical environments is becoming increasingly important as speech recognition technology becomes a more integral part of mobile applications. In distributed speech recognition (DSR) architecture the recogniser's front-end is located in the terminal and is connected over a data network to a remote back-end recognition server. The terminal performs the feature parameter extraction, or the front-end of the speech recognition system. These features are transmitted over a data channel to the remote back-end recogniser. DSR provides particular benefits for the applications of mobile devices such as improved recognition performance compared to using the voice channel and ubiquitous access from different networks with a guaranteed level of recognition performance. A feature extraction algorithm integrated into the DSR system is required to operate in real-time as well as with the lowest possible computational costs.In this paper, two innovative front-end processing techniques for noise robust speech recognition are presented and compared, time-domain based frame-attenuation (TD-FrAtt) and frequency-domain based frame-attenuation (FD-FrAtt). These techniques include different forms of frame-attenuation, improvement of spectral subtraction based on minimum statistics, as well as a mel-cepstrum feature extraction procedure. Tests are performed using the Slovenian SpeechDat II fixed telephone database and the Aurora 2 database together with the HTK speech recognition toolkit. The results obtained are especially encouraging for mobile DSR systems with limited sizes of available memory and processing power.  相似文献   

8.
9.
人脸图像中不同子区域对表情识别的贡献度不同,而且同一子区域对不同年龄段人(如中老年、青年、儿童)的表情识别贡献度也不同。因此,若采用单一固定的子区域加权模式进行人脸表情识别,无法达到最佳识别率。为了提高识别率,提出一种可变加权值的表情识别方法。对中老年人、青年人和儿童分别建立表情数据库,分割出纯人脸区域、眼睛区域和嘴巴区域。对这些区域提取特征后将其进行加权融合,通过设置不同的权值研究其对不同年龄段人脸表情识别的影响。实验结果表明,采用可变加权值比采用固定加权值方法的识别率明显更高。对中老年人的表情识别率提高了8.6%,对青年人的表情识别率提高了4.8%,对儿童的表情识别率提高了1.4%。  相似文献   

10.
目标识别是计算机视觉领域的一大挑战,随着深度学习的发展,目标识别算法被广泛应用于视频数据中目标的识别和监测。对现有目标识别算法进行归纳,根据是否采用锚点机制将主流算法分为Anchor-Based和Anchor-Free两大类。针对R-CNN、SPP-Net、SSD、YOLOv2等Anchor-Based类目标识别算法,从候选框创建、特征提取和结果生成角度分析基于区域和基于回归的目标识别算法的区别和各自优势。针对CornerNet、ExtremeNet、CenterNet、FCOS等Anchor-Free类目标识别算法,从特征提取、关键点选择/层次结构和结果生成角度分析基于关键点和基于特征金字塔的目标识别算法的区别和各自优势。在此基础上,以识别效率和识别精度为评价指标,对Faster R-CNN、Mask R-CNN、SSD等8种代表性目标识别算法进行对比总结。最后,针对目标识别算法中的数据预处理耗时长、多尺度特征同步识别精度低、结构繁杂等问题,对当前研究的不足和未来研究方向进行分析和展望。  相似文献   

11.
文章首先给出了基于角度的动力学模型及其特征值,并提出了基于SCG神经网络的静态特征值识别算法和基于模板匹配的动态特征值识别算法。使用该文提出的动态时间规整算法和手势分割算法建立的动态手势识别系统,实践证明具有较好的实时性和识别率。  相似文献   

12.
单训练样本人脸识别技术综述   总被引:1,自引:0,他引:1  
对近年来国内外出现的单样本人脸识别技术和方法进行简单介绍和系统分类,分析各种方法的优缺点.阐明单样本人脸识别技术所面临的挑战,并对未来单样本人脸识别技术的发展方向进行展望.  相似文献   

13.
We propose an approach to achieving early recognition of gesture patterns. Early recognition is a method for recognizing sequential patterns at their earliest stage. Therefore, in the case of gesture recognition, we can get a recognition result for human gestures before the gestures are finished. The most difficult problem in early recognition is knowing when the system has determined the result. Most traditional approaches suffer from this problem, since gestures are often ambiguous. At the start of a gesture, in particular, it is very difficult to determinate the recognition result since insufficient input data have been observed. Therefore, we have improved on the traditional approach by using a self-organizing map.  相似文献   

14.
人体动作识别是计算机视觉研究中备受关注的课题。现有的动作识别方法大多属于监督学习,需要大量的有标记数据来训练识别模型。然而,在现实应用中有标记的数据成本较高,而无标记数据很容易获取。提出一种基于混合式协同训练的新型人体动作识别算法——Co-KNN-SVM,该算法利用动作识别领域不同类型的方法来构建基分类器,并进行迭代的相互训练以提高泛化性能,可以降低标注成本,并实现不同识别方法的优势互补。此外,还改进了协同训练中对伪标记数据的选择方法和迭代训练策略,有效控制了伪标记数据的噪声影响,提高了协同训练的识别效果。实验结果表明,所提算法可以有效地识别视频中的人体动作。  相似文献   

15.
手写体数字识别是多年来的研究热点,也是字符识别中的一个特别问题。由于手写体数字字体变化很大,传统的识别方法很难达到高的识别率。针对传统的数字识别方法的复杂性和局限性,提出了一种基于BP神经网络的手写体数字的识别方法。该方法在提取手写体数字点特征、笔划密度特征基础上,利用改进的BP神经网络进行训练识别。经实验,识别率达94%。实验结果表明,该方法对手写体数字识别效果良好,不仅简化了传统识别的繁杂性,而且提高了识别的准确性。  相似文献   

16.
Gait recognition is one measure of biometrics, which also includes facial, fingerprint, and retina recognition. Although most biometric methods require direct contact between a device and a subject, gait recognition has unique characteristics whereby interaction with the subjects is not required and can be performed from a distance. Cameras are commonly used for gait recognition, and a number of researchers have used depth information obtained using an RGB-D camera, such as the Microsoft Kinect. Although depth-based gait recognition has advantages, such as robustness against light conditions or appearance variations, there are also limitations. For instance, the RGB-D camera cannot be used outdoors and the measurement distance is limited to approximately 10 meters. The present paper describes a long short-term memory-based method for gait recognition using a real-time multi-line LiDAR. Very few studies have dealt with LiDAR-based gait recognition, and the present study is the first attempt that combines LiDAR data and long short-term memory for gait recognition and focuses on dealing with different appearances. We collect the first gait recognition dataset that consists of time-series range data for 30 people with clothing variations and show the effectiveness of the proposed approach.  相似文献   

17.
Tone information is very important to speech recognition in a tonal language such as Thai. In this article, we present a method for isolated Thai tone recognition. First, we define three sets of tone features to capture the characteristics of Thai tones and employ a feedforward neural network to classify tones based on these features. Next, we describe several experiments using the proposed features. The experiments are designed to study the effect of initial consonants, vowels, and final consonants on tone recognition. We find that there are some correlations between tones and other phonemes, and the recognition performances are satisfying. A human perception test is then conducted to judge the recognition rate. The recognition rate of a human is much lower than that of a machine. Finally, we explore various combination schemes to enhance the recognition rate. Further improvements are found in most experiments.  相似文献   

18.
在人脸识别领域,提取人脸特征和降低维数是人脸识别的关键。传统的基于小波变换的人脸识别算法仅在小波分解的低频分量上提取用于分类的图像特征,造成了高频分量中部分对识别有利信息的丢失。为了更有效地提取人脸图像特征,提出一种基于小波变换和特征加权融合的人脸识别算法。首先通过小波变换对人脸图像进行降维处理,然后对4个小波子图分别运用主成分分析法(PCA)提取特征,并把这4部分特征加权融合,最后利用支持向量机(SVM)进行分类识别。在ORL人脸库上进行实验验证,识别准确率可达到97.5%,实验结果表明该算法能够有效提高人脸识别能力,与传统识别算法相比具有较高的识别准确率和识别速度。  相似文献   

19.
智能语音技术包含语音识别、自然语言处理、语音合成三个方面的内容,其中语音识别是实现人机交互的关键技术,识别系统通常需要建立声学模型和语言模型。神经网络的兴起使声学模型数量急剧增加,基于神经网络的声学模型与传统识别模型相结合的方式,极大地推动了语音识别的发展。语音识别作为人机交互的前端,具有许多研究方向,文中着重对语音识别任务中的文本识别、说话人识别、情绪识别三个方向的声学模型研究现状进行归纳总结,尽可能对语音识别技术的演化进行细致介绍,为以后的相关研究提供有价值的参考。同时对目前语音识别的主流方法进行概括比较,介绍了端到端的语音识别模型的优势,并对发展趋势进行分析展望,最后提出当前语音识别任务中面临的挑战。  相似文献   

20.
Feature Extraction Using Independent Components of Each Category   总被引:1,自引:0,他引:1  
We describe an application of independent component analysis (ICA) to pattern recognition in order to evaluate the effectiveness of features extracted by ICA. We propose a recognition method suitable for independent components that consists of modules for each category. A module has two parts: feature extraction and classification. Features are independent components estimated by ICA and outputs of modules are candidates for categories. These candidates are combined and categories are decided with a majority rule. This recognition method is applied to two tasks: hand-written digits in the MNIST database and acoustic diagnosis for a compressor as real-world tasks. A FastICA algorithm is applied to extracting independent features in the proposed method. Through recognition experiments, we demonstrate that the ICA of each category extracts useful features for these tasks and the independent components are superior to the principal components in recognition accuracy. Manabu Kotani - Deceased  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号