首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
This paper describes the design of a speech coder called pitch synchronous innovation CELP (PSI-CELP) for low hit-rate mobile communications. PSI-CELP is based on CELP, but has more adaptive excitation structures. In voiced frames, instead of conventional random excitation vectors, PSI-CELP converts even the random excitation vectors to have pitch periodicity by repeating stored random vectors as well as by using an adaptive codebook, in silent, unvoiced, and transient frames, the coder stops using the adaptive codebook and switches to fixed random codebooks. The PSI-CELP coder also implements novel structures and techniques: an FIR-type perceptual weighting filter using unquantized LPC parameters, a random codebook with a conjugate structure trained to be robust against channel errors, codebook search with delayed decision, a gain quantization with sloped amplitude, and a moving average prediction coding of LSP parameters, Our speech coder is implemented by DSP chips. Its coded speech quality at 3.6 kb/s with 2.0 kb/s redundancy is comparable to that of the Japanese full-rate VSELP coder at 6.7 kb/s with 4.5 kb/s redundancy. The basic structure of this PSI-CELP coder has been chosen as the Japanese half-rate speech codec for digital cellular telecommunications  相似文献   

2.
该文提出了一种码率为 0.75-5.4kb/s可变速率的高质量语音编码讲法。该算法对CELP的激励进行了改进,根据语音的特征把语音分成4类,不同类型的语音采用不同的激励码本。特别是对于浊音,提出了一种基于基音同步的嵌入分裂式激励码本,该码本利用浊音具有准周期性的特点,使该算法在很低的码率下就可很好地恢复浊音信号,克服了CELP在4kb/s速率以下因码本尺寸小而导致合成语音质量差的缺点。经非正式听音测试,它的主观质量超过了1~8kb/s的可变速率QCELP系统,并且平均速率大约只有2kb/s,比QCELP的5kb/s平均速率低了很多、非常适用于 CDMA移动通信系统。  相似文献   

3.
基于局部余弦变换的低比特变速率语音编码算法研究   总被引:1,自引:0,他引:1  
提出将局部余弦变换(LCT)算法应用于语音编码中,系统设计了一个平均比特率近1.6kbit/s的低比特变速率语音编码器。在变比特率编码器设计中采用SVM算法进行VAD检测。激活语音帧的语音模式采用GSM半速率编码中的划分方法,但将其中的强浊音模式和中浊音模式合并为一个中强浊音模式。对各类语音模式和无声帧(背景噪声)的局部余弦变换系数采用分维矢量量化算法进行量化,码书设计采用LGB算法。编码中的码书搜索采用树形快速搜索算法。通过主观非正式听力测试表明设计的变比特率编码器编码的重建语音MOS约为3.15,与比特率为2.4kbit/s美国联邦声码器标准MELP的重建语音相当,具有较强的顽健性,适合于对存在各种环境噪声的语音进行编码。  相似文献   

4.
该文基于LPC的自适应前后向量化技术,提出了一种可变速率的混合激励线性预测MELP语音编码算法。该算法中,采用当前语音帧(前向LPC)或前面某帧已合成语音帧(后向LPC)进行线性预测,当采用后向LPC时,只需传输时间序列编码,故减少了LPC系数的平均编码比特。计算机模拟表明,该算法与标准MELP算法合成的语音质量相当,但显著减少了LPC的传输带宽,从而明显降低了MELP平均编码速率。  相似文献   

5.
李晔  樊燕红  郝秋赟  郭强 《电声技术》2010,34(12):51-53
基于增强型混合激励线性预测模型,提出一种高质量的300 bit/s声码器算法。每个语音帧仅提取少量参数,为提高量化效率,每8个语音帧组成一个超级帧,对超级帧参数进行矢量量化。算法采用基于模式转移的码本映射估计带通浊音度参数,改善其量化精度。对不同带通浊音度模式下的基音参数量化码本尺寸进行联合优化,提高量化效率。同时,对线谱频率参数采用带有级间预测的多级矢量量化以降低谱失真。主观听觉测试表明,此声码器具有较高的可懂度并具有一定的自然度,诊断押韵测试(DRT)的分数为84.2%。  相似文献   

6.
一种改进的MELP语音编码方法   总被引:1,自引:0,他引:1  
目前2.4kbps的混合激励线性预测(MELP)语音编码方法已经被确定为美国新的联邦语音编码标准。本文提出了一种改进的MELP语音编码方法,利用滤波器相似度和基于LPC系数分类的矢量量化技术,可以把MELP的码率降到1.7kbps以下,仍有较好的合成语音质量。  相似文献   

7.
基于增强型混合激励线性预测(MELPe)模型,设计了一款600bps低速率语音编码器。该编码器在保持MELPe算法特征的同时,利用相邻帧的帧间冗余,把连续的三帧构成一个超帧,对超帧采用多模式预测和多级矩阵量化技术进行联合量化。同时针对超帧的不同模式,通过预测系数对相邻超帧的模式转换进行处理,实现线谱对参数(LSF)的矢量量化。最后对基音周期与增益参数进行联合量化,进一步提高量化效率,完成一款在600bps下仍具有较好合成语音质量的语音编码器的设计。  相似文献   

8.
Low bit-rate speech coders for multimedia communication   总被引:10,自引:0,他引:10  
The International Telecommunications Union (ITU) has standardized three speech coders which are applicable to low-bit-rate multimedia communications. ITU Rec. G.729 8 kb/s CS-ACELP has a 15 ms algorithmic codec delay and provides network-quality speech. It was originally designed for wireless applications, but is applicable to multimedia communications as well. Annex A of Rec. G.729 is a reduced-complexity version of the CS-ACELP coder. It was designed explicitly for simultaneous voice and data applications that are prevalent in low-bit-rate multimedia communications. These two coders use the same bitstream format and can interoperate. The ITU Rec. G.723.1 6.3 and 5.3 kb/s speech coder for multimedia communications was designed originally for low-bit-rate videophones. Its frame size of 30 ms and one-way algorithmic codec delay of 37.5 ms allow for a further reduction in bit rate compared to the G.729 coder. In applications where low delay is important, the delay of G.723.1 may be too large. However, if the delay is acceptable, G.723.1 provides a lower-complexity alternative to G.729 at the expense of a slight degradation in quality. This article describes the attributes of speech coders such as bit rate, complexity, delay, and quality. Then it discusses the basic concepts of the three new ITU coders by comparing their specific attributes. The second part of this article describes the standardization process for each of these coders  相似文献   

9.
A method for encoding the spectral characteristics of speech, at rates below 180 bit/s, using hierarchical temporal decomposition (HTD) is proposed. A set of the log-area-ratio (LAR) parameters, extracted from a given block of speech, are approximated through Gaussian interpolation between the most-steady frames detected by the HTD. This results in a smaller set of parameters which are encoded using vector quantisation. It is shown that the same spectral distortion is obtained with the new coder at a rate of 180 bit/s as that using a scalar quantisation, TD-based coder, at 600 bit/s  相似文献   

10.
设计了一种数码率为1.8kb/s的多带线性预测(MBLP)语音压缩编码算法。该算法采用基于谐振结构的线性预测分析和对激励信号采用多带处理的方法。试验结果表明,本算法提供了相当于码率为2.4kb/s美国联邦声码器标准MELP的重建语音质量,具有较高的清晰度和自然度。  相似文献   

11.
A new video coding algorithm called the first-order-residual/second-order-residual (FOR/SOR) codec is proposed for high definition (HD) video coding in this work. Several advanced coding techniques are adopted in the proposed FOR/SOR codec. For the FOR codec, the well known block-based motion compensated predictive codec is used to exploit temporal and spatial correlations in input image frames. However, it is observed that there still exists structured residual signal after the FOR coding, and a SOR coder is developed to encode residual image frames efficiently. To improve the coding performance furthermore, we consider bit allocation between the FOR and SOR coders at the same block and determine their optimal quantization parameters systematically. It is shown by experimental results that the proposed FOR/SOR codec outperforms H.264/AVC significantly in HD video coding.  相似文献   

12.
针对极低速率语音通信的要求,提出了一种基于MELP(Mixed-Excitation Linear Prediction)的0.6Kb/s语音编码算法。把MELP算法中3个连续语音帧组成一个超级帧,充分利用参数的帧间相关性,进行联合量化,从而获得了高质量的合成语音。采用对线谱对频率的两帧联合量化与双向预测矢量量化对基音周期的按清浊音分模式量化,对子带清浊参数量化的统计码本构造,对能量参数采用分离均值矢量量化解码端对能量参数采用了一种效果更好的插值算法等。  相似文献   

13.
This article is an overview of the standardization, architecture, and performance of the new ITU-T Recommendation G.718. G.718 is an embedded variable bit rate codec providing a scalable solution for compression of 8 and 16 kHz sampled speech and audio signals at rates between 8 kb/s and 32 kb/s. It comprises five layers where higher-layer bitstreams can be discarded without affecting the lower layersiquest decoding. The codec also has an optional core layer interoperable with ITU-T G.722.2 (3GPP AMR-WB) at 12.65 kb/s. G.718 was designed to provide high speech quality at low bit rates and to be robust to significant rates of frame erasures or packet losses. It is also targeting good quality for generic audio at higher rates.  相似文献   

14.
MELP低比特率数字语音编码技术研究   总被引:6,自引:0,他引:6  
主要介绍了一种新的低比特率MELP (MixedExcitationLinearPrediction)混合激励线性预测语音编码技术 ,其中着重分析了该编码算法所采用的几项关键技术。给出了采用MELP压缩编码算法后的输出比特流在各个参数上的比特分配表 ,并通过计算机仿真 ,对MELP合成语音与原始语音的质量做了比较 ,最后就MELP语音编码技术与现今其它几种不同的低速率语音编码技术的合成语音质量在DRT、DAM及MOS得分三个方面做了比较。  相似文献   

15.
ITU-T.G.723.1为国际电信联盟(ITU)制定的5·3bit/s和6.3kbit/s双速率语音编码建议,分别采用代数码激励线性预测(ACELP)算法和多脉冲最大似然量化(MP-MLQ)算法。在阐述G.723.1建议编译码算法的原理和实现的基础上,重点介绍了在开发基于TMS320VC5409实时实现该建议的全双工编译器过程中所做的工作。该语音编译码器通过了G.723.1所有测试矢量的验证。  相似文献   

16.
The effects of digital transmission errors on a family of variable-rate embedded subband speech coders (SBC) are analyzed in detail. It is shown that there is a difference in error sensitivity of four orders of magnitude between the most and the least sensitive bits of the speech coder. As a result, a family of rate-compatible punctured convolutional codes with flexible unequal error protection capabilities have been matched to the speech coder. These codes are optimally decoded with the Viterbi algorithm. Among the results, analysis and informal listening tests show that with a 4-level unequal error protection scheme transmission of 12 kb/s speech is possible with very little degradation in quality over a 16 kb/s channel with an average bit error rate (BER) of 2×10-2 at a vehicle speed of 60 m.p.h. and with interleaving over two 16 ms speech frames  相似文献   

17.
In this paper, we present a novel coding technique that makes use of the nonstationary characteristics of an image sequence displacement field to estimate and encode motion information. We utilize an MPEG style codec in which the anchor frames in a sequence are encoded with a hybrid approach using quadtree, DCT, and wavelet-based coding techniques. A quadtree structured approach is also utilized for the interframe information. The main objective of the overall design is to demonstrate the coding potential of a newly developed motion estimator called the coupled linearized MAP (CLMAP) estimator. This estimator can be used as a means for producing motion vectors that may be regenerated at the decoder with a coarsely quantized error term created in the encoder. The motion estimator generates highly accurate motion estimates from this coarsely quantized data. This permits the elimination of a separately coded displaced frame difference (DFD) and coded motion vectors. For low bit rate applications, this is especially important because the overhead associated with the transmission of motion vectors may become prohibitive. We exploit both the advantages of the nonstationary motion estimator and the effective compression of the anchor frame coder to improve the visual quality of reconstructed QCIF format color image sequences at low bit rates. Comparisons are made with other video coding methods, including the H.261 and MPEG standards and a pel-recursive-based codec.  相似文献   

18.
The joint development of a medium bit-rate speech coder along with an effective channel coding technique to provide a robust, spectrally efficient, high-quality mobile communication system is described. A subband coder operating at 12 kb/s is used; in the absence of channel errors, it provides speech quality comparable to current analog land-mobile radio systems. The coder design incorporates a unique coding of the side information to facilitate the use of forward-error-correction coding without the need to code the entire bit stream. The use of excessive overhead for redundancy is avoided while the harsh effects of frequent channels are mitigated. These techniques have been used in an experimental FDMA (frequency-division multiple access) digital land-mobile radio system. The combined speech and channel coder operates at 15 kb/s and provides intelligible speech at fading channel error rates up to 8%  相似文献   

19.
An adaptive predictive coder providing almost toll quality at 16 kb/s and minimal degradation when the bit rate is lowered to 9.6 kb/s is described. The coder can operate at intermediate bit rates and can also change bit rate on a packet-by-packet basis. Variable bit rate operation is achieved through the use of switched quantization, thus eliminating the need for buffering of the output. A noise shaping filter provides flexible control of the output noise spectrum. The filter, in conjunction with an enhanced way to adapt the quantizer step size, which tries to accommodate the quantization noise feedback, accounts for the toll quality. By quantizing the residue with more than one quantizer, the effective number of bits per sample can be controlled in a deterministic way regardless of the entropy residue. The lower limit of operation is at 9.6 kb/s. Performance of the coder under random bit errors is also presented. It has been found that only at error rates of 10-2 and higher does the degradation becomes objectionable  相似文献   

20.
为满足无线通信的要求,文中在传统的MELP的基础上,提出了一种速率为600b/s的语音编码算法。该算法利用帧间冗余,把连续的4帧构成一个超级帧进行联合量化。对线谱对采用两帧联合量化、双向线性内插技术,对能量参数采用分离均值矢量量化等技术。仿真实验证明该语音编码算法有较好质量。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号