首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 125 毫秒
1.
为了提高噪声环境中的语音识别率,将独立成分分析(ICA)方法用于语音信号特征提取.并使用遗传算法(GA)将提取出来的高维特征进行选择,最后得到的语音特征被用于基于高斯混合模型的语音识别应用中,并与传统的Mel倒谱系数(MFcC)特征进行比较。实验结果表明基于ICA与GA的语言特征优于传统的MFCC特征。  相似文献   

2.
通用可编程多制式收发信数字仿真平台中语音抗干扰研究   总被引:1,自引:0,他引:1  
本文主要讨论了通用机上纯软件实现的通用可编程多制式收发信数字仿真平台中语音抗干扰问题,针对高斯调幅噪声环境,利用语音信号的短时参数判定语音的无声和有声期,提供噪声参考信号,采用基于自适应滤波的噪声对消方法对单通道带噪语音进行处理,实验结果表明语音增强效果令人满意。  相似文献   

3.
针对作者已经提出的双因子高斯过程隐变量模型(Two-factor Gaussian process latent variable model,TF-GPLVM)用于语音转换时未考虑语音的动态特征,并且模型训练时需要估计的参数较多的问题,提出引入隐马尔科夫模型(Hidden Markov model,HMM)对语音动态特征进行建模,并利用HMM隐状态对各帧语音进行关于语义内容的概率软分类,建立了分离精度更高、运算负荷较小的双因子高斯过程动态模型(Two-factor Gaussian process dynamic model,TF-GPDM).基于此模型,设计了一种全新的基于说话人特征替换的语音声道谱转换方案.主、客观实验结果表明,无论是与传统的统计映射和频率弯折转换方法相比,还是与双因子高斯过程隐变量模型方法相比,本文方法都获得了语音质量和转换相似度的提升,以及两项性能的更佳平衡.  相似文献   

4.
广义Gamma模型是近年来新提出的一种语音分布模型,相对于传统的高斯或超高斯模型具有更好的普适性和灵活性,提出一种基于广义Gamma语音模型和语音存在概率修正的语音增强算法。在假设语音和噪声的幅度谱系数分别服从广义Gamma分布和Gaussian分布的基础上,推导了语音信号对数谱的最小均方误差估计式;在该模型下进一步推导了语音存在概率,对最小均方误差估计进行修正。仿真结果表明,与传统的短时谱估计算法相比,该算法不仅能够进一步提高增强语音的信噪比,而且可以有效减小增强语音的失真度,提高增强语音的主观感知质量。  相似文献   

5.
基于高斯过程模型的语音增强   总被引:2,自引:2,他引:0       下载免费PDF全文
沈赟  张丽清 《计算机工程》2010,36(5):162-164
针对信号处理领域的语音活动探测问题,提出一种基于高斯过程先验假设的概率方法,用于增强语音。利用高斯过程模型的后验概率来估计纯净语音,使用在学习过程中得到的高斯过程模型的参数探测语音活动。实验结果表明,该方法对于在白噪声和有色噪声环境下的语音有较好的增强效果。  相似文献   

6.
目前,面向蒙古语的语音识别语音库资源相对稀缺,但存在较多的电视剧、广播等蒙古语音频和对应的文本。该文提出基于语音识别的蒙古语长音频语音文本自动对齐方法,实现蒙古语电视剧语音的自动标注,扩充了蒙古语语音库。在前端处理阶段,使用基于高斯混合模型的语音端点检测技术筛选并删除噪音段;在语音识别阶段,构建基于前向型序列记忆网络的蒙古语声学模型;最后基于向量空间模型,将语音识别得到的假设序列和参考音素序列进行句子级别的动态时间归整算法匹配。实验结果表明,与基于Needleman-Wunsch算法的语音对齐比较,该文提出的蒙古语长音频语音文本自动对齐方法的对齐正确率提升了31.09%。  相似文献   

7.
基于对话语音的与文本无关的说话人确认系统的研究   总被引:1,自引:2,他引:1  
本文建立了一个基于对话语音的与文本无关的说话人确认系统,它和传统的与文本无关的说话人确认系统的关键不同在于,训练及测试语音不再只包含一个人而都是对话语音,因此需要分割出属于不同说话人的语音段,以建立说话人模型和实现最终判决。文中详细介绍了高斯混合模型-背景模型(GMM-UBM)这种说话人确认系统的框架,重点讨论了基于GLR(Generalized Likelihood Ratio)距离测度的无监督语音分割算法。最终阐述的输出评分的规整方法即ZNORM(Zero Normalization)和持续时间修正,可以使确认系统的性能提高近10%。  相似文献   

8.
针对现有的语音可懂度评价方法不能真实贴近人耳对语音的感知过程,提出一种基于人耳听觉特性的双谱特征预测语音可懂度评价(Gammatone-bspectral speech intelligibility metric, GBSIM)算法。充分利用双谱可以检测语音信号中的非线性相位耦合,抑制非高斯信号中的高斯噪声的特性,采用可以模拟人工耳蜗模型的Gammatone滤波器组,通过滤波处理将输入的语音信号分为32个听觉子频带,用三阶统计量对每个子频带的语音信号进行双谱估计并提取单一特征值来计算语音的可懂度。实例验证结果表明,该方法对信号失真变化敏感,其评价结果与主观评价具有很高的相关度,相对于传统的语音可懂度评价算法具有更好的评价效果。  相似文献   

9.
在人机语音交互系统中,机器不仅要具有理解人类语音的能力,还应当具有识别说话人情感的能力.本文提出了基于高斯混合模型(GMM)的序列分类和识别的改进方法,并将该方法引入到语音情感识别的研究中.本文提出了观测值次序均衡的方法.实验结果证明这种新的方法有效地提高了语音情感识别的准确率.  相似文献   

10.
为了在语音转换过程中充分考虑语音的帧间相关性,提出了一种基于卷积非负矩阵分解的语音转换方法.卷积非负矩阵分解得到的时频基可较好地保存语音信号中的个人特征信息及帧间相关性.利用这一特性,在训练阶段,通过卷积非负矩阵分解从训练数据中提取源说话人和目标说话人相匹配的时频基.在转换阶段,通过时频基替换实现对源说话人语音的转换.相对于传统方法,本方法能够更好地保存和转换语音帧间相关性.实验仿真及主、客观评价结果表明,与基于高斯混合模型、状态空间模型的语音转换方法相比,该方法具有更好的转换语音质量和转换相似度.  相似文献   

11.
The crossover operator has been considered "the centre of the storm" in genetic programming (GP). However, many existing GP approaches to object recognition suggest that the standard GP crossover is not sufficiently powerful in producing good child programs due to the totally random choice of the crossover points. To deal with this problem, this paper introduces an approach with a new crossover operator in GP for object recognition, particularly object classification. In this approach, a local hill-climbing search is used in constructing good building blocks, a weight called looseness is introduced to identify the good building blocks in individual programs, and the looseness values are used as heuristics in choosing appropriate crossover points to preserve good building blocks. This approach is examined and compared with the standard crossover operator and the headless chicken crossover (HCC) method on a sequence of object classification problems. The results suggest that this approach outperforms the HCC, the standard crossover, and the standard crossover operator with hill climbing on all of these problems in terms of the classification accuracy. Although this approach spends a bit longer time than the standard crossover operator, it significantly improves the system efficiency over the HCC method.  相似文献   

12.
高飞 《计算机工程》2007,33(14):130-132
为了得到具有良好随机性和初值敏感性的二值序列,在已有的混沌系统的基础上,利用Bernstein函数,给出了一种基于插值方法构造的广义混沌序列产生方法。实验结果表明,生成的广义混沌序列具有很好的伪随机特性。通过对一维和二维广义混沌序列的非线性离散化处理,有效提高了离散化过程的安全性。给出了生成的二值序列在图像加密算法中的应用,该算法对图像加密效果良好。  相似文献   

13.
A variety of metaheuristic approaches have emerged in recent years for solving the resource-constrained project scheduling problem (RCPSP), a well-known NP-hard problem in scheduling. In this paper, we propose a Neurogenetic approach which is a hybrid of genetic algorithms (GA) and neural-network (NN) approaches. In this hybrid approach the search process relies on GA iterations for global search and on NN iterations for local search. The GA and NN search iterations are interleaved in a manner that allows NN to pick the best solution thus far from the GA pool and perform an intensification search in the solution's local neighborhood. Similarly, good solutions obtained by NN search are included in the GA population for further search using the GA iterations. Although both GA and NN approaches, independently give good solutions, we found that the hybrid approach gives better solutions than either approach independently for the same number of shared iterations. We demonstrate the effectiveness of this approach empirically on the standard benchmark problems of size J30, J60, J90 and J120 from PSPLIB.  相似文献   

14.
Synthesizing networks that satisfy multiple requirements, such as high reliability, low diameter, good embeddability, etc., is a difficult problem to which there has been no completely satisfactory solution. We present a simple, yet very effective, approach to this problem. The crux of our approach is a filtration process that takes as input a large set of randomly generated graphs and filters out those that do not meet the specified requirements. Our experimental results show that this approach is both practical and powerful. The use of random regular networks as the raw material for the filtration process was motivated by their surprisingly good performance with regard to almost all properties that characterize a good interconnection network. We provide results related to the generation of networks that have low diameter, high fault tolerance, and good embeddability. Through this, we show that the generated networks are serious competitors to several traditional well-known networks. We also explore how random networks can be used in a packaging hierarchy and comment on the scope of application of these networks.  相似文献   

15.
Cyber–physical systems are becoming increasingly complex. In these advanced systems, the different engineering domains involved in the design process become more and more intertwined. Therefore, a traditional (sequential) design process becomes inefficient in finding good design options. Instead, an integrated approach is needed where parameters in multiple different engineering domains can be chosen, evaluated, and optimized to achieve a good overall solution. However, in such an approach, the combined design space becomes vast. As such, methods are needed to mitigate this problem.In this paper, we show a method for systematically capturing and updating domain knowledge in the context of a co-design process involving different engineering domains, i.e. control and embedded. We rely on ontologies to reason about the relationships between parameters in the different domains. This allows us to derive a stepwise design space exploration workflow where this domain knowledge is used to quickly reduce the design space to a subset of likely good candidates. We illustrate our approach by applying it to the design space exploration process for an advanced electric motor control system and its deployment on embedded hardware.  相似文献   

16.
二值图象颗粒分割及其应用   总被引:2,自引:0,他引:2       下载免费PDF全文
提出一种二值图象颗粒分割的方法,该方法利用颗粒粘连的形状特征进行颗粒分离,对于有凸边缘的无孔的颗粒,即使多个颗粒粘连,仍能获得满意的结果,且分离速度快。文中将方法用于定量金相图象处理的颗粒分离,效果良好。  相似文献   

17.
一种用于多阈值图象自动分割的混合遗传算法   总被引:8,自引:1,他引:7       下载免费PDF全文
模糊C-均值算法是一种比较有效的数据聚类方法,遗传算法则是一种借鉴生物界自然选择和自然遗传机制的高度并行、随机、自适应的搜索算法,该文有机地利用遗传算法与模糊C-均值算法,并考虑图象的二维灰度信息,提出了一种适用于多阈值图象自动分割的新方案。该方案能够快速正确地实现分割,且不需要事先认定分割类数,实验结果令人满意。  相似文献   

18.
袁姮  王志宏  姜文涛 《控制与决策》2017,32(10):1739-1748
提出一种新的基于奇异点邻域结构的三维人脸识别方法.首先,在人脸纹理图像上分割目标区域,划分特征子区域,提取二维奇异点和奇异点邻域结构;然后,在人脸空间几何信息上标记三维奇异点及其邻域结构,并以奇异点和奇异点邻域结构的三维信息表征人脸特征;最后,采用奇异点邻域结构最近点方法识别人脸身份信息.实验结果表明,所提出方法在三维人脸识别方面具有较高的识别准确率和较好的鲁棒性能.  相似文献   

19.
This paper proposes a new approach for designing stable adaptive fuzzy controllers, which employs a hybridization of a conventional Lyapunov-theory-based approach and a particle swarm optimization (PSO) based stochastic optimization approach. The objective is to design a self-adaptive fuzzy controller, optimizing both its structures and free parameters, such that the designed controller can guarantee desired stability and can simultaneously provide satisfactory performance. The design methodology for the controller simultaneously utilizes the good features of PSO (capable of providing good global search capability, required to provide a high degree of automation) and Lyapunov-based tuning (providing fast adaptation utilizing a local search method). Three different variants of the hybrid controller are proposed in this paper. These variants are implemented for benchmark simulation case studies and real-life experimentation, and their results demonstrate the usefulness of the proposed approach.  相似文献   

20.
口语理解在口语自动翻译和人机对话系统中具有非常重要的作用。本文面向口语自动翻译提出了一种统计和规则相结合的汉语口语理解方法,该方法利用统计方法从训练语料中自动获取语义规则,生成语义分类树,然后利用语义分类树对待解析的汉语句子中与句子浅层语义密切相关的词语进行解析,最后再利用统计理解模型对各个词语的解析结果进行组合,从而获得整个句子的浅层语义领域行为。实验结果表明,该方法具有较高的准确率和鲁棒性,适合应用在限定领域的汉语口语浅层语义理解。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号