共查询到19条相似文献,搜索用时 156 毫秒
1.
详细介绍一种基于神经网络的自学习非特定人语音识别方法,首次介绍一种语音识别知识的自动检验方法——LVV法,给出系统原理图和知识库的自动完善原理;介绍一种LEA判别法,实现梯度牛顿有效结合神经网络快速学习方法,并给出了实验结果。 相似文献
2.
3.
给出了一种应用于电话语音自动拨号的实时语音识别方法。该系统对特定人的语音进行识别,并将识别结果映射成相应的电话号码。实验结果表明该方法具有很高的识别精度和实时的识别速度,并且只需很小的内存空间就可以实现,是一种有效的应用于电话语音自动拨号等方面的语音识别方法。 相似文献
4.
5.
阐述在线语音识别技术,采用Python语言设计实现一个自动成绩录入系统,包含录音、语音识别、向Excel自动成绩填写,以及语音播放功能。 相似文献
6.
针对智能机器人语音校准结果不精准的问题,研究基于深度学习的智能机器人语音自动校准系统。设计语音自动校准引擎A/D电路,通过模拟信号发射范围采集与控制电路原始音频信息,利用紧凑型嵌入式音频接收器接收音频信息。整理与识别音频信息内容,获取语句文本样本集。使用深度学习的正弦和余弦函数编码处理方式构建校正模型的输入部分,通过深度学习的前馈神经网络训练输入样本,完成校正模型输出部分的构建。将训练后的样本输入到校正模型中,得到校正后的文本,实现智能机器人语音自动校准。由实验结果可知,该系统两种指令下的振幅波动范围分别为9~22 dB和7~21 dB,与实际振幅波动情况一致,具有精准校准结果。 相似文献
7.
8.
运用TMS320C5416实现了语音自动识别装置。该装置利用一种新的语音信号r阶的倒谱线性回归系数等参数构成识别的特征矢量集,运用模糊矢量量化技术实现了特定人的语音识别。实验结果表明该系统具有识别精度高、识别速度快等特点.是一种语音自动识别装置的有效的硬件实现方案。 相似文献
9.
10.
语音静噪实现的方法很多,本文主要研究"载波+音频静噪法"法,该方法结合了抗干扰算法中基于Goertzel算法的载频估计方法以及常规基带话音静噪的不同特点来实现话音静噪。利用基于Goertzel算法的载频估计静噪方法实现有无信号或话音质量的好坏的识别和处理,而结合基带话音静噪可有效将有载波而无调制的信号识别出来,该方法不需对音频信号作复杂的识别处理。通过话音静噪处理,能够实现解调输出话音自动通断,提高语音质量等级。 相似文献
11.
Neural networks for statistical recognition of continuous speech 总被引:4,自引:0,他引:4
Morgan N. Bourlard H.A. 《Proceedings of the IEEE. Institute of Electrical and Electronics Engineers》1995,83(5):742-772
In recent years there has been a significant body of work, both theoretical and experimental, that has established the viability of artificial neural networks (ANN's) as a useful technology for speech recognition. It has been shown that neural networks can be used to augment speech recognizers whose underlying structure is essentially that of hidden Markov models (HMM's). In particular, we have demonstrated that fairly simple layered structures, which we lately have termed big dumb neural networks (BDNN's), can be discriminatively trained to estimate emission probabilities for an HMM. Recently simple speech recognition systems (using context-independent phone models) based on this approach have been proved on controlled tests, to be both effective in terms of accuracy (i.e., comparable or better than equivalent state-of-the-art systems) and efficient in terms of CPU and memory run-time requirements. Research is continuing on extending these results to somewhat more complex systems. In this paper, we first give a brief overview of automatic speech recognition (ASR) and statistical pattern recognition in general. We also include a very brief review of HMM's, and then describe the use of ANN's as statistical estimators. We then review the basic principles of our hybrid HMM/ANN approach and describe some experiments. We discuss some current research topics, including new theoretical developments in training ANN's to maximize the posterior probabilities of the correct models for speech utterances. We also discuss some issues of system resources required for training and recognition. Finally, we conclude with some perspectives about fundamental limitations in the current technology and some speculations about where we can go from here 相似文献
12.
The problem concerning recognition of single pulses under the action of interferences is discussed by the example of classification of neuron action potentials. Joint applications of wavelets and artificial neural networks in solving the the given problem are analyzed. The recognition method, which is based on wavelet neural networks and ensures adjustment of the synapses of a supplementary (??wavelet??) layer, has been proposed. It is demonstrated that experimental data can efficiently be analyzed via the proposed method. 相似文献
13.
14.
15.
语音识别与理解的研究进展 总被引:1,自引:0,他引:1
本文综述了当前语音识别理解的发展趋势和最新进展。指出美国在不依说话人的大词汇表的连续语音隐马尔柯夫模型识别方面起主导地位,日本在大词汇表的连续语音神经网络识别、模拟人工智能进行语音后处理方面起主导地位,并介绍了国际上最优秀的语音识别理解系统。 相似文献
16.
民航陆空通话对民航飞行安全十分重要,但因其通话模式有特殊的语法结构与发音方式,日常语音识别声学模型无法有效应用于民航陆空通话的语音处理问题。针对民航陆空通话的特殊语境,本文提出了基于双向长短时记忆网络(BiLSTM)的民航陆空通话语音识别方法。首先,提取民航陆空通话语音的FBANK特征作为输入,以时序链式连接(CTC)为目标函数,训练BiLSTM网络得到BiLSTM/CTC模型。然后,利用声学模型,语言模型与陆空通话词典实现民航陆空通话的语音识别,并结合数据增强与数据迁移对模型进行增强训练提高语音识别性能。实验结果表明本文提出的方法适用于民航陆空通话语音识别,并且数据增强模型可有效降低民航陆空通话语音识别的词错误率。 相似文献
17.
Text-to-speech conversion has traditionally been performed either by concatenating short samples of speech or by using rule-based
systems to convert a phonetic representation of speech into an acoustic representation, which is then converted into speech.
This paper describes a text-to-speech synthesis system for modern standard Arabic based on artificial neural networks and
residual excited LPC coder. The networks offer a storage-efficient means of synthesis without the need for explicit rule enumeration.
These neural networks require large prosodically labeled continuous speech databases in their training stage. As such databases
are not available for the Arabic language, we have developed one for this purpose. Thus, we discuss various stages undertaken
for this development process. In addition to interpolation capabilities of neural networks, a linear interpolation of the
coder parameters is performed to create smooth transitions at segment boundaries. A residual-excited all pole vocal tract
model and a prosodic-information synthesizer based on neural networks are also described in this paper. 相似文献
18.
深度学习是当前人工神经网络领域的研究热点,广泛应用于字符识别、图像识别和语音识别等应用中。雷达通信目标识别是通信对抗的前提和关键。文中分析了模板匹配法、DS证据理论等传统通信目标识别方法的在特征提取、模型表达方面的不足,对深度学习神经网络在通信目标识别中的应用进行了初步探讨,并提出了一种基于深度学习的通信目标识别框架。该框架和思路同样适用于雷达对抗目标识别等问题,可为深度学习在雷达目标识别领域的应用提供支撑。 相似文献
19.
Two novel chaotic coding and decoding methods based on artificial neural networks (ANNs) are reported which employ the unimodal logistic map (LM) as an example. Coding is carried out by either modulating the LM or by generating the chaotic sequence with ANNs. In simulations speech has been coded and the resulting SNRsig for the decoded speech has been evaluated. The results demonstrate that the two proposed methods offer a SNRsig improvement of 4 and 20 dB over the SNRsig obtained by using the LMS for decoding 相似文献