首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 0 毫秒
1.
Prosody is the change of F0 and intensity in time and the speed of articulation. The presence or absence of the realization of word accent is also examined as an important feature in prosody generation. During verbal communication various prosody forms contribute to the expression of the textual content of the message on the one hand and of the personal intention of the speaker on the other. In many cases in dialogues the same text can be (must be) pronounced with different intentions. Our goal was to find what kind of prosody patterns and rules are characteristic of these utterance types and what the acoustic relationship among them is for Hungarian. In this article the prosody structures of the most important dialogue components are described, and invariant structures are derived and verified by speech synthesis. Rules are also stated as generalized function structures to show the acoustic relationship of the prosody of these expressions to the prosody of statements. Using these rules, it is possible to convert the prosody of a given utterance type to another one by preserving the naturalness of the speech. The rules can be used in text to speech (TTS) conversion to generate spoken dialogues.  相似文献   

2.
不同的韵律层级可以将文本划分成适合朗读与理解的韵律组块,从而保证合成语音能够以自然的节奏表现出来。目前对韵律层级预测所采用的特征绝大多数是较为浅层的特征,如词性、词长等,但这些浅层特征对有的韵律层次如韵律短语的预测能力比较弱。实际上,句法结构同韵律层级之间有着非常紧密的联系,二者相互影响,相互制约。本文根据依存句法分析的结果,抽取出若干同韵律层级相关的深层句法特征对韵律层级进行预测。实验证明,其中内弧跨度和内弧类型等特征,对浅层特征较难解决的类似韵律短语这种中间层次的韵律单元划分问题,可以起到很大的提高作用,使韵律短语标注的综合F值提高了11%。  相似文献   

3.
One of the most serious challenges for speech synthesis is the systematic treatment of events in language and speech that are known to have low frequencies of occurrence. The problems that extremely unbalanced frequency distributions pose for rule-based or data-driven models are often underestimated or even unrecognized. This paper discusses the problems pertinent to rare events in four components of speech synthesis systems: in linguistic text analysis, where productive word formation processes generate a potentially unbounded lexicon and cause heavily skewed word frequency distributions; in syllabification, where some syllables occur very frequently but most phonotactically possible syllables are very infrequent; in speech timing, where most constellations of factors affecting segmental duration are sparsely or not at all represented in training databases; and in unit selection synthesis, where the uneven distribution of speech unit frequencies poses challenges to speech corpus design. Currently available techniques for coping with the problem of rare or unseen events in each of these components are reviewed. Finally, a distinction is made between a strictly closed domain with a fixed vocabulary and a merely restricted domain with loopholes for unseen words and names, and the consequences of the respective type of domain for appropriate synthesis strategies are discussed.  相似文献   

4.
汉语语音合成中基频曲线(F0 曲线)预测是决定合成语音声调自然度的关键因素,为了使生成的基频曲线过渡自然,提出应用连接段基频曲线模式连接各音节的方法.连接段和音节基频曲线模式使用聚类、分析修正的方法获得,相互问有重叠性,应用时根据参数来确定选取区域,进行连接.通过实验过程中分析总结得到的规则确定基频曲线模式参数.实际应用于 PSOLA 语音合成系统后,经实验证明合成语音声调自然度明显提升.  相似文献   

5.
《Ergonomics》2012,55(1):43-55
The aim of the study was to determine the influence of textual feedback on the content and outcome of spoken interaction with a natural language dialogue system. More specifically, the assumption that textual feedback could disrupt spoken interaction was tested in a human–computer dialogue situation. In total, 48 adult participants, familiar with the system, had to find restaurants based on simple or difficult scenarios using a real natural language service system in a speech-only (phone), speech plus textual dialogue history (multimodal) or text-only (web) modality. The linguistic contents of the dialogues differed as a function of modality, but were similar whether the textual feedback was included in the spoken condition or not. These results add to burgeoning research efforts on multimodal feedback, in suggesting that textual feedback may have little or no detrimental effect on information searching with a real system.

Statement of Relevance: The results suggest that adding textual feedback to interfaces for human–computer dialogue could enhance spoken interaction rather than create interference. The literature currently suggests that adding textual feedback to tasks that depend on the visual sense benefits human–computer interaction. The addition of textual output when the spoken modality is heavily taxed by the task was investigated.  相似文献   

6.
随着网络的发展,网络旅游信息系统成为开发旅游资源的一个重要信息平台,本文针对泉州当地旅游资源,利用WebGIS软件ArcIMS平台设计开发泉州旅游信息系统,阐述其系统架构原理及设计方案。  相似文献   

7.
    
In this article we argue that discourse structure constrains the set ofpossible constituents in a discourse that can provide the relevantcontext for structuring information in a target sentence, whileinformation structure critically constrains discourse structureambiguity. For the speaker, the discourse structure provides a set of possible contexts for continuation while information structure assignment is independent of discourse structure. For the hearer, the information structure of a sentence together with discourse structure instructs dynamic semantics how rhematic information should be used to update the meaning representation of the discourse (Polanyi and van den Berg, 1996).  相似文献   

8.
While information structure has traditionally been viewed as a singlepartition of information within an utterance, there are opposing viewsthat identify multiple such partitions in an utterance. The existenceof alternative proposals raises questions about the notion ofinformation structure and also its relation to discoursestructure. Exploring various linguistic aspects, this paper supports thetraditional view by arguing that there is no information structure partition within a subordinate clause.  相似文献   

9.
基于韵律特征参数的情感语音合成算法研究   总被引:1,自引:0,他引:1  
为了合成更为自然的情感语音,提出了基于语音信号声学韵律参数及时域基音同步叠加算法的情感语音合成系统.实验通过对情感语音数据库中生气、无聊、高兴和悲伤4种情感的韵律参数分析,建立4种情感模板,采用波形拼接语音合成技术,运用时域基音同步叠加算法合成含有目标感情色彩的语音信号.实验结果表明,运用波形拼接算法,调节自然状态下语音信号的韵律特征参数,可合成较理想的情感语音.合成的目标情感语音具有明显的感情色彩,其主观情感类别判别正确率较高.  相似文献   

10.
基于STM32的嵌入式语音识别模块设计   总被引:1,自引:0,他引:1  
介绍了一种以ARM为核心的嵌入式语音识别模块的设计与实现.模块的核心处理单元选用ST公司的基于ARM Cortex-M3内核的32位处理器STM32F103C8T6.本模块以对话管理单元为中心,通过以LD3320芯片为核心的硬件单元实现语音识别功能,采用嵌入式操作系统μC/OS-Ⅱ来实现统一的任务调度和外围设备管理.经...  相似文献   

11.
In some cases, to make a proper translation of an utterance in a dialogue, different pieces of contextual information are needed. Interpreting such utterances often requires dialogue analysis including speech acts and discourse analysis. In this paper, a statistical dialogue analysis model for Korean–English dialogue machine translation based on speech acts is proposed. The model uses syntactic patterns and n-grams of speech acts. The syntactic patterns include surface syntactic features which are related to the language-dependent expressions of speech acts. Speech-act n-grams are used to approximate the context of utterances. The key feature is the use of speech-act n-grams based on hierarchical recency. Experimental results with trigrams show that the proposed model achieves an accuracy of 66.87% for the top candidate and 82.35% for the top three candidates. It indicates that the proposed model based on hierarchical recency outperforms the model based on linear recency.  相似文献   

12.
“维基解密”事件爆发后,网络安全重要性问题上升到前所未有的认识高度,言论自由的底线、数据信息的保密势必会发展出全新且更为全面的法律标准。分析“维基解密”事件的启示是,掌握机密信息的计算机终端用户必须进行全面的信息保护,政府信息安全知识亟待提高,私人或者公众数据库保护级别必须提高。与其从法律与道德层面对“维基解密”进行谴责,不如夯实信息网络安全保护的工作基础。  相似文献   

13.
以B/S为架构,设计并实现了一个分布式数据处理的高校医院综合管理平台,将校园内不同校区的医院实行统一的信息管理。对高校医院的信息化管理水平和适应医疗管理制度的改革,具有一定的借鉴意义。  相似文献   

14.
该文在分析和研究了WCF原理特性的基础上结合作者多年的ERP系统架构与实现经验,提出了一种基于WCF的分布式的信息系统的结构模型(B/S/S与C/S/S模型结构),并基于此模型结构设计了一种多层的分布式软件体系架构,该结构模型与体系架枸有着更好的灵活性、安全性、可扩展性,并且该架构模型和体系架构史持多种网络终端设备。  相似文献   

15.
办公信息管理是企事业单位信息化管理的重要组成部分。为改变手工管理的低效现状,充分利用单位现有软、硬件资源,本文从单位的实际应用入手,提出并组织实施一种基于B/S架构的办公信息管理系统,提高办公效率和管理水平。  相似文献   

16.
该文在分析和研究了WCF原理特性的基础上结合作者多年的ERP系统架构与实现经验,提出了一种基于WCF的分布式的信息系统的结构模型(B/S/S与C/S/S模型结构),并基于此模型结构设计了一种多层的分布式软件体系架构,该结构模型与体系架构有着更好的灵活性、安全性、可扩展性,并且该架构模型和体系架构支持多种网络终端设备。  相似文献   

17.
多传感器信息融合技术综述   总被引:8,自引:0,他引:8  
文章首先介绍了多传感器信息融合技术的定义、优点、发展历程,接着介绍了多传感器信息融合的工作原理、主要研究方法、主要应用,最后对信息融合技术的未来发展做了展望。  相似文献   

18.
描述了建设学生信息系统对实现教育目标的重大意义。说明了基于B/S结构的学生信息系统的优点。给出了建设和使用学生信息系统的注意事项。  相似文献   

19.
本文根据企业加强预算管理和信息化的要求,提出了一种基于C/S体系结构的企业预算管理系统设计方案,详细阐述了系统的网络及硬件软件平台、数据库平台、系统功能模块和子系统结构,并对系统实施的关键技术一客户端程序的自动更新和业务工作流结构进行了分析。  相似文献   

20.
基于统一编码的信息孤岛集成技术研究   总被引:8,自引:0,他引:8  
信息编码是信息孤岛集成的基础和关键技术。文章分析了信息集成环境对信息分类编码的要求,从信息编码多约束目标出发,采用AHP法分析了各约束目标对编码结构的影响权重,构建了一种基于面向对象技术的统一编码柔性结构模型,并对模型进行了描述和实例化,该模型最终实现了各类信息编码结构形式的统一。以统一编码结构模型为基础,提出了基于统一编码的信息孤岛集成技术实现方案,从基于统一编码的应用系统接口技术和基于统一编码的标准视图访问接口技术两个方面描述了集成方案的实现机理。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号