期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

基于波形音频段处理的中文语音合成研究

罗三定贾建华等《电脑与信息技术》2002,10(1):16-19

针对语音合成自然度不够理想的问题，文章提出了语音单元之间平稳过渡的改进方法和在分词基础上的词的组合方法，给出一些短语的组合，并对合成中的语气处理规律进行了探讨，将这些规则和规律成功应用于汉语文本－语音转换系统，获得了比较好的效果。相似文献

2.

关系运算的综合及其优化方法研究 总被引：1，自引：0，他引：1

吴建国孙元《计算机研究与发展》1998,35(10):941-95

文中较全面地研究了ＶＨＬＤ等述语言中关系运算的综合及其优化方法，提出了根据关系运算在描述中的不同情开以不同综合方法的策略。相似文献

3.

On the impact of excitation and spectral parameters for expressive statistical parametric speech synthesis

《Computer Speech and Language》2014,28(5):1209-1232

This paper presents a study on the importance of short-term speech parameterizations for expressive statistical parametric synthesis. Assuming a source-filter model of speech production, the analysis is conducted over spectral parameters, here defined as features which represent a minimum-phase synthesis filter, and some excitation parameters, which are features used to construct a signal that is fed to the minimum-phase synthesis filter to generate speech. In the first part, different spectral and excitation parameters that are applicable to statistical parametric synthesis are tested to determine which ones are the most emotion dependent. The analysis is performed through two methods proposed to measure the relative emotion dependency of each feature: one based on K-means clustering, and another based on Gaussian mixture modeling for emotion identification. Two commonly used forms of parameters for the short-term speech spectral envelope, the Mel cepstrum and the Mel line spectrum pairs are utilized. As excitation parameters, the anti-causal cepstrum, the time-smoothed group delay, and band-aperiodicity coefficients are considered. According to the analysis, the line spectral pairs are the most emotion dependent parameters. Among the excitation features, the band-aperiodicity coefficients present the highest correlation with the speaker's emotion. The most emotion dependent parameters according to this analysis were selected to train an expressive statistical parametric synthesizer using a speaker and language factorization framework. Subjective test results indicate that the considered spectral parameters have a bigger impact on the synthesized speech emotion when compared with the excitation ones. 相似文献

4.

螺旋状匹配搜索的块拼贴纹理合成 总被引：3，自引：0，他引：3

李燕王永东吴文治吴晓东《计算机工程》2006,32(16):210-212

基于样图的纹理合成方法是继纹理映射、过程纹理合成等方法后发展起来的一种纹理拼贴方法。该文在Efros块拼贴算法和徐晓刚的螺旋状点匹配搜索算法基础上,提出了一种螺旋状匹配搜索的块拼贴算法。该算法利用纹理块的连惯性,在搜索待合成纹理块时,在已合成纹理块在样本图像中位置的邻域进行搜索,找到匹配纹理块后进行输出。该方法大大加快了纹理合成的速度,与Efros块拼贴算法相比,在合成质量不变的基础上,合成速度平均提高了10倍。对于不同的纹理进行实验,其结果也令人满意。相似文献

5.

Advances in human-interface engineering for reverse directory assistance (ACNA) services

Murray F. Spiegel 《International Journal of Speech Technology》1997,1(2):91-101

After years of productive research, speech synthesis is now profitably automating services by answering queries via constrained dialogs, directly accessing individual computer databases, and speaking text created from disparate sources of information. Directory-based services, such as Automated Customer Name and Address (ACNA), requires synthesis with high intelligibility and name pronunciation accuracy. Current synthesis technology achieves those goals. However, even the best of current speech technology is not good enough to mindlessly drop into complex services. Customized directory preprocessing is still necessary to transform listing data, which commonly contains unconventional abbreviations, unlabeled acronyms, and scrambled word ordering, into a sentence suitable for synthesis. This article describes state-of-the-art directory preprocessing programs that have led to successful implementations for synthesis services in 2 major U.S. telephone companies (Ameritech and Bell Atlantic). Of course, the basic capabilities of the synthesizer, such as pronuncïation accuracy, speech quality and naturalness, play a large role. Efforts ensured locality terms were pronounced in accordance with local custom. Finally, for prompts and other fixed messages, this article describes experiments that determined whether the naturalness of recorded speech offsets the undesirable discontinuity between recorded and synthesized utterances.An earlier version of this article was given at the AVIOS '95 conference. 相似文献

6.

The RTL Binding and Mapping Approach of VHDL High-Level Synthesis System HLS/BIT

下载免费PDF全文

Yan Zongfu Liu Mingye 《计算机科学技术学报》1996,11(6):562-569

This paper describes a VHDL high-level synthesis system HLS/BIT with emphasis on its register-transfer level(RTL)binding and technology mapping subsystem.In more detail,the component instantiation mechanism and the knowledge-driven approach to RTL technology mapping are also presented. 相似文献

7.

基于复用计算的大纹理实时合成

陈昕王文成《计算机学报》2010,33(4)

文中提出一种基于复用计算的纹理合成方法,逐步地利用已合成的部分纹理来生成更大的纹理块,以进行后续的纹理合成计算.由此,该方法可节省大量耗时的纹理块选择及缝合计算,提高了合成效率.实验表明,新方法可实时合成2048×2048像素的大纹理,而已有工作至多只能以交互的速度进行这样的合成. 相似文献

8.

Real‐time horse gait synthesis

Ting‐Chieh Huang Yi‐Jheng Huang Wen‐Chieh Lin 《Computer Animation and Virtual Worlds》2013,24(2):87-95

Horse locomotion exhibits rich variations in gaits and styles. Although there have been many approaches proposed for animating quadrupeds, there is not much research on synthesizing horse locomotion. In this paper, we present a horse locomotion synthesis approach. A user can arbitrarily change a horse's moving speed and direction, and our system would automatically adjust the horse's motion to fulfill the user's commands. At preprocessing, we manually capture horse locomotion data from Eadweard Muybridge's famous photographs of animal locomotion and expand the captured motion database to various speeds for each gait. At runtime, our approach automatically changes gaits based on speed, synthesizes the horse's root trajectory, and adjusts its body orientation based on the horse's turning direction. We propose an asynchronous time warping approach to handle gait transition, which is critical for generating realistic and controllable horse locomotion. Our experiments demonstrate that our system can produce smooth, rich, and controllable horse locomotion in real time. Copyright © 2012 John Wiley & Sons, Ltd. 相似文献

9.

基于虚拟不定长的语音库裁剪方法

张巍吴晓如赵志伟王仁华《软件学报》2006,17(5):983-990

语音库裁剪或语音库去冗余,是大语料库语音合成技术的一个重要问题.提出了虚拟不定长替换的概念,以弥补不定长的损失.结合合成使用变体的频度,构建了语音库裁剪算法StaRp-VPA.该算法能够以任意比例裁剪语音库.实验表明:当裁剪率小于50%时,合成自然度几乎没有下降;当裁剪率大于50%时,合成自然度也不会严重降低. 相似文献

10.

块拼贴的纹理合成算法的实现与改进

戴磊《计算机与现代化》2008,(11):80-83

基于样图的纹理合成方法是继纹理映射、过程纹理合成等方法后发展起来的一种新的纹理拼接技术。其中块拼贴的纹理合成算法由于合成速度快,效果良好,受到极大关注。对采用块拼接技术的纹理合成方法进行综述,运用图像切割方法较好解决了块拼贴算法中的最佳分割路径获取问题,并对合成的结果进行平滑优化。相似文献

11.

Secure-by-construction synthesis of cyber-physical systems

《Annual Reviews in Control》2022

Correct-by-construction synthesis is a cornerstone of the confluence of formal methods and control theory towards designing safety-critical systems. Instead of following the time-tested, albeit laborious (re)design-verify-validate loop, correct-by-construction methodology advocates the use of continual refinements of formal requirements – connected by chains of formal proofs – to build a system that assures the correctness by design. A remarkable progress has been made in scaling the scope of applicability of correct-by-construction synthesis – with a focus on cyber-physical systems that tie discrete-event control with continuous environment – to enlarge control systems by combining symbolic approaches with principled state-space reduction techniques.Unfortunately, in the security-critical control systems, the security properties are verified ex post facto the design process in a way that undermines the correct-by-construction paradigm. We posit that, to truly realize the dream of correct-by-construction synthesis for security-critical systems, security considerations must take center-stage with the safety considerations. Moreover, catalyzed by the recent progress on the opacity sub-classes of security properties and the notion of hyperproperties capable of combining security with safety properties, we believe that the time is ripe for the research community to holistically target the challenge of secure-by-construction synthesis. This paper details our vision by highlighting the recent progress and open challenges that may serve as bricks for providing a solid foundation for secure-by-construction synthesis of cyber-physical systems. 相似文献

12.

Lurgi型甲醇合成反应器的动态模拟

胡国静张树增王键红《计算机与应用化学》2006,23(9):849-852

为了能更好地预测甲醇合成系统的动态特性、合成与分析控制系统、模拟开停车及事故和培训操作人员等,对Lurgi型甲醇合成反应器的模型化与动态模拟技术进行了研究。根据物料及热量守恒方程建立了Lurgi型甲醇合成固定床反应器的动态模型,并根据模型的形式和特点选择了适当的数值计算方法,开发了动态模拟程序模块,并据此通过模拟计算获得了适宜的操作参数范围,这对于优化合成工艺,提高甲醇产量有明确的指导意义。应用结果表明,达到稳态时的结果能真实地反映生产实际情况,动态过程能很好地反映生产变化趋势。该模型对Lurgi型甲醇合成反应器动态特性和控制方案的研究,以及甲醇合成相关工艺仿真培训系统软件的开发等都有重要的意义。相似文献

13.

Challenges and Rewards in Using Parametric or Concatenative Speech Synthesis

Caroline Henton 《International Journal of Speech Technology》2002,5(2):117-131

Highest quality synthetic voices remain scarce in both parametric synthesis systems and in concatenative ones. Much synthetic speech lacks naturalness, pleasantness and flexibility. While great strides have been made over the past few years in the quality of synthetic speech, there is still much work that needs to be done. Now the major challenges facing developers are how to provide optimal size, performance, extensibility, and flexibility, together with developing improved signal processing techniques. This paper focuses on issues of performance and flexibility against a background containing a brief evolution of speech synthesis; some acoustic, phonetic and linguistic issues; and the merits and demerits of two commonly used synthesis techniques: parametric and concatenative. Shortcomings of both techniques are reviewed. Methodological developments in the variable size, selection and specification of the speech units used in concatenative systems are explored and shown to provide a more positive outlook for more natural, bearable synthetic speech. Differentiating considerations in making and improving concatenative systems are explored and evaluated. Acoustic and sociophonetic criteria are reviewed for the improvement of variable synthetic voices, and a ranking of their relative importance is suggested. Future rewards are weighed against current technical and developmental challenges. The conclusion indicates some of the current and future applications of TTS. 相似文献

14.

基于Tacotron模型和韵律修正的情感语音合成方法

张昕胡航烨曹欣怡王蔚《数据采集与处理》2022,37(4):909-916

语音合成技术日趋成熟,为了提高合成情感语音的质量,提出了一种端到端情感语音合成与韵律修正相结合的方法。在Tacotron模型合成的情感语音基础上,进行韵律参数的修改,提高合成系统的情感表达力。首先使用大型中性语料库训练Tacotron模型,再使用小型情感语料库训练,合成出具有情感的语音。然后采用Praat声学分析工具对语料库中的情感语音韵律特征进行分析并总结不同情感状态下的参数规律,最后借助该规律,对Tacotron合成的相应情感语音的基频、时长和能量进行修正,使情感表达更为精确。客观情感识别实验和主观评价的结果表明,该方法能够合成较为自然且表现力更加丰富的情感语音。相似文献

15.

Audiovisual Speech Synthesis

G. Bailly M. Bérar F. Elisei M. Odisio 《International Journal of Speech Technology》2003,6(4):331-346

相似文献

16.

一种局部曲面纹理合成的改进方法

解慧《电脑开发与应用》2014,(11):73-76

在局部纹理映射加速曲面纹理合成的算法的基础上,提出通过方向经验模型分解算法对其进行改进.方向经验模型分解算法利用纹理固有方向,在局部纹理的合成区和映射区实现了无缝接的纹理合成。实验结果表明,该算法合成质量高、算法简单、运行快速,能够达到令人满意的合成效果。相似文献

17.

DNA合成技术与仪器研发进展概述

江湘儿王勇沈玥《集成技术》2021,10(5):80-95

基因组解读促使生命进入数字化时代,合成生物学赋予人类探索生命本质并改造利用的能力,且在医疗、化工、农业及信息等交叉融合领域实现快速发展.DNA合成是合成生物学的基础性技术,其重要性堪比测序技术对基因组学与精准医学的支撑.该文围绕DNA合成方法、技术路径及仪器研制与产业化进展进行了系统性的分析比较,并结合未来需求,对DN... 相似文献

18.

虚拟人面部行为的合成 总被引：17，自引：2，他引：17

高文陈熙霖晏洁宋益波尹宝才《计算机学报》1998,21(8):694-703

虚拟人是虚拟现实环境中很重要的一部分，对于虚拟人行为的研究除了应从宏观上考虑虚拟人的群体行为属性之外，以个体行为属性的研究也非常重要。个体行为包括自然行为和意识行为。自然行为主要是和脸部、头部以及四肢运动有关的行为。而意识行为则包括与语言和心理活动相关联的表情、发声以及对应的唇动手势动作等。本文旨在研究与意识行为有关的虚拟人面部图像合成技术，讨论了标准人脸图像的参数合成方法，给出了特定人脸图像与标相似文献

19.

VHDL语言高级综合子集的确立及其实现方法 总被引：7，自引：2，他引：7

张东晓刘明业《计算机学报》1997,20(3):198-205

越来越多的高级综合系统采用或接受ＶＨＤＬ语言作为设计输入，但ＶＨＤＬ语言的语义本质是基于模拟而非基于高级综合的，许多语法现象不能或不适于进行综合。本文系统地分析了ＶＨＤＬ语言的可综合性问题，详细讨论了ＶＨＤＬ语言的各种语法现象的可综合性，并结合实际系统分析了ＶＨＤＬ语言高级综合子集的确立及实现方法。相似文献

20.

Phoneme Intelligibility of Four Text-to-Speech Products to Nonnative Speakers of English in Noise

H.?S.?Venkatagiri Email author 《International Journal of Speech Technology》2005,8(4):313-321

The study investigated the segmental intelligibility of four text-to-speech (TTS) products under 0 dB and 5 dB signal-to-noise ratios in a group of native and nonnative speakers of English. Each product—AT&T Next-Gen™, Festival version 1.4.2, FlexVoice™ 2, and IBM ViaVoice™ Version 5.1—uses a different algorithm for generating speech from text. The results, which benefit developers of TTS technology as well as developers of products that utilize TTS, showed that (1) all TTS products were less intelligible to nonnative speakers of English than native speakers, (2) the “hybrid” TTS product that combined concatenative and formant synthesis methods was the least intelligible of the four products investigated, (3) the remaining three products, which used formant, concatenative diphone based LPC, and concatenative waveform synthesis methods respectively, were equally intelligible to nonnative speakers, (4) none of the four TTS products was better at resisting intelligibility loss due to noise than others, and (5) listening to currently available unrestricted TTS under high noise conditions would probably require a greater amount of cognitive resources on the part of both native and nonnative speakers of English and may be difficult when other demanding activities are concurrently performed. 相似文献