首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 93 毫秒
1.
当今社会中有相当数量的人群由于各种原因,在阅读和看电视方面存在各种障碍,如何为他们提供各种电视或视频内容服务是各国政府、服务提供商和各种公益机构希望尽快、尽好解决的问题之一。本文就如何通过电视节目的语音辅助播放控制来帮助此类人群而展开。着重介绍了两种解决方案,分别是AD(语音说明)和基于文转声(TTS)的方案。围绕AD和TTS探讨了相关的方案实现需求。此外,文章还罗列了服务提供商在实现和支持语音辅助访问时面临的一些困难和挑战。文章最后简要地介绍了两种解决方案。  相似文献   

2.
基于组件对象模型(COM)技术和微软Office 2000套件的自动化技术,采用内嵌可视BASIC语言(VB)应用编程方法开发了一个基于Outlook 2000的客户端文字邮件语音播放系统.详细介绍了微软语音插件Speech SDK v5.1的架构以及在Outlook 2000中内嵌VB的开发过程.该系统在功能上实现了邮件正文及附件的自动语音播放,在软件界面上实现了语音应用模块与Outlook的无缝集成.  相似文献   

3.
王渭刚 《信息技术》2023,(3):117-121+127
提出基于TTS技术的智能化英语自动翻译系统设计研究。选型并配置文音转换器与语音处理器,以此为基础,引入TTS技术(文本分析、韵律控制与语音合成),结合英语翻译需求,设计系统软件模块,包括连续语音自动切分与标注模块、语音韵律控制模块、语音合成模块及语音库裁减模块。通过上述硬件单元与软件模块的设计,实现了智能化英语自动翻译系统的运行。实验数据显示:相较于对比系统,应用设计系统获得的语音韵律控制参数偏差较小,语音自然度因子数值更大,充分表明设计系统英语翻译语音更为精准。  相似文献   

4.
在对国内交通诱导系统现状分析的基础上,针对视觉诱导设施在效果上的不足,提出了一种基于TTS技术的交通语音诱导方法,并给出了系统设计的原理图和实现步骤。实验表明该语音诱导系统文语转换和语音合成效果理想,可在交通诱导方面发挥重要作用。  相似文献   

5.
语音回声隐藏技术及分析   总被引:3,自引:0,他引:3  
将信息隐藏技术用于实现语音安全通信,对语音安全通信的一种新方法——语音回声隐藏技术进行了分析和研究,给出了实现该技术的系统方案,并进行仿真和分析,从人耳听觉特性具体要求的角度具体讨论该系统的实现细节。  相似文献   

6.
宫湘琦 《信息通信》2013,(6):123-124
该系统采用了以下技术:自动语音识别技术(ASR),文本到语音转换技术(TTS)和互联网协议(IP)。选民使用的电话连接到传统的公共交换电话网(PSTN)或移动网络。系统使用语音自动识别引擎收集到选民的语音信息,并将语音转换成文本,再转换成选举标记语言(EML)格式的文本输出给电子选举系统;选举系统生成的EML格式的文本反馈信息通过使用文本转换语音(TTS)引擎和话筒传达给选民。  相似文献   

7.
随着从文本到语音(Text To Speech,TTS)技术的发展,其语音效果已经可以达到真人播报效果。基于此,提出将TTS技术应用到车载乘客信息系统中,改变传统预录语音文件报站的方式,极大地提高语音播报的灵活性和可维护性。  相似文献   

8.
通过基于TMS320VC5402与TLC320AD50芯片硬件电路及软件的设计,实现了语音信号的采集和播放.简述了语音采集和播放系统的用途,详细阐述了硬件电路设计的步骤和基本原理,以及A/D芯片TLC320AD50的工作机制.整个系统根据外围电路和语音处理算法的不同,具有较好的扩展能力和适应性.  相似文献   

9.
基于ISD1420芯片,给出一种利用单片机89C51控制的语音编辑器,分析该系统的构成、硬件方案、用户界面以及软件方案。首先给出整个系统的总体构成以及框图,同时给出一个硬件系统的设计方案和软件流程,也给出了一个简要的方法去实现用户界面。语音编辑器具有分段、录音、播放、组合播放等基本的语音编辑功能,具有MIC与线路录音两种模式,利用该方法设计的语音编辑器既可以单独用来对音频设备进行语音编辑,也可以以模块的形式加入到其他设备上。  相似文献   

10.
TTS技术又称文本-语音转发处理技术,它是以大规模真实录音的语音库为基础,增加音库压缩算法和声色变换算法所形成的一项重大核心技术。TTS技术具有以下特点:1、可直接将任意文本信息转化为语音输出,即动态合成语音信息,实现真正意义上的实时语音播放;2、达到自然语调合成、字词  相似文献   

11.
基于语音识别的发音学习技术   总被引:7,自引:0,他引:7  
在语言发音学习中,有效的反馈对学习者有很大的帮助。计算机辅助发音学习系统可以给学习者有效的发音指导。就目前基于语音识别的发音学习技术进行介绍,给出系统原理框图,对一些关键技术和问题进行探讨,并对其发展进行展望。  相似文献   

12.
We have examined various aspects of how to produce synthetic speech. There are numerous applications for such synthetic speech, mostly when starting from a textual input, i.e., TTS. Given the large amount of text in databases and the public's need to access information efficiently, synthetic speech is a natural way to obtain information. A major application of the future will be speech-to-speech translation, in which a person speaking in one language will be able to converse automatically with someone using another language: ASR would transcribe the original speech to a textual form in language A, then an automatic text translator would map that text to language B, and finally a TTS system for this second language would generate the output speech.  相似文献   

13.
Voice search is the technology underlying many spoken dialog systems (SDSs) that provide users with the information they request with a spoken query. The information normally exists in a large database, and the query has to be compared with a field in the database to obtain the relevant information. The contents of the field, such as business or product names, are often unstructured text. This article categorized spoken dialog technology into form filling, call routing, and voice search, and reviewed the voice search technology. The categorization was made from the technological perspective. It is important to note that a single SDS may apply the technology from multiple categories. Robustness is the central issue in voice search. The technology in acoustic modeling aims at improved robustness to environment noise, different channel conditions, and speaker variance; the pronunciation research addresses the problem of unseen word pronunciation and pronunciation variance; the language model research focuses on linguistic variance; the studies in search give rise to improved robustness to linguistic variance and ASR errors; the dialog management research enables graceful recovery from confusions and understanding errors; and the learning in the feedback loop speeds up system tuning for more robust performance. While tremendous achievements have been accomplished in the past decade on voice search, large challenges remain. Many voice search dialog systems have automation rates around or below 50% in field trials.  相似文献   

14.
随着计算机科学技术的发展,英语学习软件的研发和应用数量也逐渐增多.在英语的学习中,智能英语发音训练是练习英语口语的重要部分,目前在英语的发音训练研究中语音识别技术受到高度的关注.随着移动互联网技术的发展,基于Android平台的便携移动设备作为安装英语发音辅助学习系统的主要工具得到了广泛的应用.本文对Android应用程序和英语教学中的英语发音训练进行了分析和研究,在Android平台的基础之上提出了智能英语发音训练系统设计的方案.  相似文献   

15.
刘伟  谢建志 《信号处理》2017,33(2):229-235
语音库的质量是决定语音合成(Text to Speech, TTS)效果的重要因素。TTS语音库的制作周期需要六个月左右,期间,发音人的录音状态需要保持一致,即音色、能量皆不能有大的差异,这对于发音人来说是较为困难的。为此,本文给出语音能量均衡方法,其中包括时域包络波动检测算法和帧能量平均算法,旨在解决TTS语音数据库录制后能量不一致现象。首先分析获得标准语音的相关能量参数和波动参数作为模板;其次,利用时域包络波动检测算法对预调节语音样本的合格性进行检验;最后根据帧能量平均准则,对所有合格语音样本进行时域幅值调整,以最大限度地保证语音库整体能量的一致性。实验结果表明,本文提出的语音能量均衡方法可以有效提升TTS语音库质量,具有实际工程意义。   相似文献   

16.
大型变电站仿真培训系统的设计与实现   总被引:1,自引:0,他引:1  
以连运港仿真培训变电站系统(TTS)为背景,结合天津电网实际情况,较详细地介绍了大型仿真变电站软件的设计思想和总体结构。TTS系统是以物理仿真与计算机数字仿真相结合的大型仿真培训系统,它突出了实用性、技术先进性和功能完备性,达到较高的应用水平,对于提高变电运行人员的技术水平及电网的安全运行有着极大的意义。  相似文献   

17.
Dialect pronunciation influences English pronunciation of the learners in many aspects. The thesis study English Problematic Sounds of English Learners. Analyzing and studying the influence of the dialect to the English pronunciation learning can help the teachers and the learners to correct the bad habits in the pronunciation of the first language and the barrier of the dialect to the learning of the English pronunciation; It can be good for the learners to grasp the correct English pronunciation.  相似文献   

18.
网络技术的发展和Internet的普及,有力促进了以Ethernet/IP为基本网络架构的电话系统的运用与实施。而作为以Ethernet/IP为基本架构的IP PBX电话交换系统,只要运用开放的数据网包交换技术来实现专用的语音服务功能,他能与现有的数据网络实现无缝集成,因而在数据、语音通信领域中应用十分广泛。为此IP PBX系统的特点及其应用技术值得大家进行研究及探讨。  相似文献   

19.
The authors describe an architecture and search organization for continuous speech recognition. The recognition module is part of the Siemens-Philips-Ipo project on continuous speech recognition and understanding (SPICOS) system for the understanding of database queries spoken in natural language. The goal of this project is a man-machine dialogue system that is able to understand fluently spoken German sentences and thus to provide voice access to a database. The recognition strategy is based on Bayes decision rule and attempts to find the best interpretation of the input speech data in terms of knowledge sources such as a language model, pronunciation lexicon, and inventory of subword units. The implementation of the search has been tested on a continuous speech database comprising up to 4000 words for each of several speakers. The efficiency and robustness of the search organization have been checked and evaluated along many dimensions, such as different speakers, phoneme models, and language models  相似文献   

20.
Dynamic programming search for continuous speech recognition   总被引:2,自引:0,他引:2  
The authors gives a unifying view of the dynamic programming approach to the search problem. They review the search problem from the statistical point-of-view and show how the search space results from the acoustic and language models required by the statistical approach. Starting from the baseline one-pass algorithm using a linear organization of the pronunciation lexicon, they have extended the baseline algorithm toward various dimensions. To handle a large vocabulary, they have shown how the search space can be structured in combination with a lexical prefix tree organization of the pronunciation lexicon. In addition, they have shown how this structure of the search space can be combined with a time-synchronous beam search concept and how the search space can be constructed dynamically during the recognition process. In particular, to increase the efficiency of the beam search concept, they have integrated the language model look-ahead into the pruning operation. To produce sentence alternatives rather than only the single best sentence, they have extended the search strategy to generate a word graph. Finally, they have reported experimental results on a 64 k-word task that demonstrate the efficiency of the various search concepts presented  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号