首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 0 毫秒
1.
为了满足数字通信及其他商业应用的需求,语音压缩编码技术得到迅速发展。近年来主流的低速率语音编码方案主要基于LPC-10,混合激励线性预测(MELP),多带激励编码(MBE),正弦变换编码(SCI),波形内插编码(WI)。大多都工作在2.4kb/s速率下。作为一种重要的低速率语音编码算法。MELP算法对LPC-10编码方案进行大量改进,引入混合激励,非周期脉冲,残差付氏幅度谱,脉冲散布和自适应谱滤波5个特征。实验结果表明,该混合激励线性预测编码在2.4kb/s上得到了更好的合成语音,并使得合成语音能更好地拟合自然语音。  相似文献   

2.
混合激励线性预测低速率语音编码研究   总被引:1,自引:0,他引:1  
为了满足数字通信及其他商业应用的需求,语音压缩编码技术得到迅速发展.近年来主流的低速率语音编码方案主要基于LPC-10,混合激励线性预测(MELP),多带激励编码(MBE),正弦变换编码(SCI),波形内插编码(WI),大多都工作在2.4 kb/s速率下.作为一种重要的低速率语音编码算法,MELP算法对LPC-10编码方案进行大量改进,引入混合激励,非周期脉冲,残差付氏幅度谱,脉冲散布和自适应谱滤波5个特征.实验结果表明,该混合激励线性预测编码在2.4 kb/s上得到了更好的合成语音,并使得合成语音能更好地拟合自然语音.  相似文献   

3.
MELP低比特率数字语音编码技术研究   总被引:6,自引:0,他引:6  
主要介绍了一种新的低比特率MELP (MixedExcitationLinearPrediction)混合激励线性预测语音编码技术 ,其中着重分析了该编码算法所采用的几项关键技术。给出了采用MELP压缩编码算法后的输出比特流在各个参数上的比特分配表 ,并通过计算机仿真 ,对MELP合成语音与原始语音的质量做了比较 ,最后就MELP语音编码技术与现今其它几种不同的低速率语音编码技术的合成语音质量在DRT、DAM及MOS得分三个方面做了比较。  相似文献   

4.
Kwon  C.H. Un  C.K. 《Electronics letters》1993,29(2):156-157
A CELP based mixed-source model is described. It uses a mixed excitation which combines a lowpass-filtered adaptive source and a highpass-filtered stochastic source. In addition, one more stochastic source is newly employed for more natural sounding speech. In informal listening tests, the proposed model at 3 kbit/s shows very good performance both in speech quality and intelligibility.<>  相似文献   

5.
The authors describe the multiband linear predictive (MB-LPC) vocoder and its operation at 2.4 kb/s and 1.2 kb/s. The MB-LPC vocoder uses mixed excitation and exploits the advantages of both time and frequency domain speech coding techniques to produce natural sounding, good quality speech. Subjective performance of speech at 2.4 kb/s produced by the MB-LPC is very close to that for the 4.15 kb/s INMARSAT-M IMBE speech coder. Informal listening tests have shown that in most cases people could not tell the difference between the new 2.4 kb/s MB-LPC coder and the 4.15 kb/s INMARSAT-M IMBE coder  相似文献   

6.
In November 1995 the International Telecommunication Union Telecommunications Sector (ITU-T) approved an 8-kb/s speech coding algorithm with wireline quality. This culminated the effort that the CCITT had set in motion in 1990. This article presents the methods for managing the project through its major milestones from setting the terms of reference to the selection, testing, optimization, and dissemination of the algorithm. While G.729 was being finalized, a new requirement for a low complexity 8-b/s speech coding arose. This article explains how the change in scope was accommodated without the unnecessary proliferation of incompatible algorithms  相似文献   

7.
High compression rates of speech signals may be achieved by coding schemes based on relevant linguistic segments. A system is described that relies on a diphone recogniser as the coder and on a speech synthesiser reproducing speech starting from a diphone codebook as the decoder. The spoken message is encoded in textual (phoneme labels) plus prosody representation. This speech coding technique may be used for voice mail or phone communication over low bit rate channels  相似文献   

8.
The ITU near term standard for very low bitrate video coding, H.263 (ITU-T SG 15/1 Rapporteurs Group for Very Low Bitrate Visual Telephony, 1995), is described. Both QCIF and a sub-QCIF format (128 × 96) are mandatory picture formats for the decoder; the CIF picture format is optional. The H.263 algorithm consists of a mandatory core algorithm and four negotiable options. With H.263 a significantly better picture quality than with H.261 can be achieved, depending on the content of the video scene and the coding parameters. Also, the cost of the H.263 video codec can be kept low if only the minimum required is implemented. The negotiable options of H.263 increase the complexity of the video codec, but also significantly improve the picture quality. H.263 is part of a set of recommendations for a very low bitrate audio visual terminal that was frozen in January 1995 and is based on existing technology. A long term activity is planned by ITU for the development of a new video coding algorithm (H.263/L) with a considerable better picture quality than H.263. This standard will be developed in joint co-operation with MPEG4.  相似文献   

9.
基于局部余弦变换的低比特变速率语音编码算法研究   总被引:1,自引:0,他引:1  
提出将局部余弦变换(LCT)算法应用于语音编码中,系统设计了一个平均比特率近1.6kbit/s的低比特变速率语音编码器。在变比特率编码器设计中采用SVM算法进行VAD检测。激活语音帧的语音模式采用GSM半速率编码中的划分方法,但将其中的强浊音模式和中浊音模式合并为一个中强浊音模式。对各类语音模式和无声帧(背景噪声)的局部余弦变换系数采用分维矢量量化算法进行量化,码书设计采用LGB算法。编码中的码书搜索采用树形快速搜索算法。通过主观非正式听力测试表明设计的变比特率编码器编码的重建语音MOS约为3.15,与比特率为2.4kbit/s美国联邦声码器标准MELP的重建语音相当,具有较强的顽健性,适合于对存在各种环境噪声的语音进行编码。  相似文献   

10.
Known coding techniques for transmitting moving images at very low bit rates are explained by the source models on which these coding techniques are based. It is shown that with motion-compensated hybrid coding, object-based analysis-synthesis coding, knowledge-based coding and semantic coding, there is a consistent development of source models. In consequence these coding techniques can be combined in a layered coding system. From experimental results obtained for object-based analysis-synthesis, coding estimates for the coding efficiency of such a layered coding system are derived using head and shoulder video telephone test sequences. It is shown that an additional compression factor of about 3 can be expected with such a complex layered coding system, when compared to block-based hybrid coding.  相似文献   

11.
对G.728编解码算法和定点数字信号处理芯片TMS320VC5402作了简要介绍。并讨他在TMS320VC5402上实现双路全双工ITUG7.28语音编解码的关键技术,包括软件设计方法和算法优化技术。结果表明,所运行的编解码程序,既可以保证良好的语音质量,又有良好的实时性。  相似文献   

12.
该文提出了一种码率为 0.75-5.4kb/s可变速率的高质量语音编码讲法。该算法对CELP的激励进行了改进,根据语音的特征把语音分成4类,不同类型的语音采用不同的激励码本。特别是对于浊音,提出了一种基于基音同步的嵌入分裂式激励码本,该码本利用浊音具有准周期性的特点,使该算法在很低的码率下就可很好地恢复浊音信号,克服了CELP在4kb/s速率以下因码本尺寸小而导致合成语音质量差的缺点。经非正式听音测试,它的主观质量超过了1~8kb/s的可变速率QCELP系统,并且平均速率大约只有2kb/s,比QCELP的5kb/s平均速率低了很多、非常适用于 CDMA移动通信系统。  相似文献   

13.
低码率视频压缩技术的块效应消除方法   总被引:2,自引:0,他引:2  
低友率视频压缩技术是多媒体通信的关键技术。基于块编码的低码率视频压缩技术中,块效应现象成为比较突出的问题。本文讨论了低码率视频压缩中消除块效应的方法,提出了基于约束复原原理的一种具体块效应消除方法,并根据计算机软件实现的分析,得出了结论采用约束复原的块应消除方法可以同时提高客观PSNR值和改善块应现象。  相似文献   

14.
This paper describes an object-based video coding system with new ideas in both the motion analysis and source encoding procedures. The moving objects in a video are extracted by means of a joint motion estimation and segmentation algorithm based on the Markov random field (MRF) model. The two important features of the presented technique are the temporal linking of the objects, and the guidance of the motion segmentation with spatial color information. This facilitates several aspects of an object-based coder. First, a new temporal updating scheme greatly reduces the bit rate to code the object boundaries without resorting to crude lossy approximations. Next, the uncovered regions can be extracted and encoded in an efficient manner by observing their revealed contents. The objects are classified adaptively as P objects or I objects and encoded accordingly. Subband/wavelet coding is applied in encoding the object interiors. Simulations at very low bit rates yielded comparable performance in terms of reconstructed PSNR to the H.263 coder. The object-based coder produced visually more pleasing video with less blurriness and devoid of block artifacts, thus confirming the advantages of object-based coding at very low bit-rates  相似文献   

15.
Analysis of low bit rate image transform coding   总被引:8,自引:0,他引:8  
Calculations based on high-resolution quantizations prove that the distortion rate D(R¯) of an image transform coding is proportional to 2-2R when R¯ is large enough. In wavelet and block cosine bases, we show that if R¯<1 bit/pixel, then D(R¯) varies like R¯1-2γ, where γ remains of the order of 1 for most natural images. The improved performance of embedded codings in wavelet bases is analyzed. At low bit rates, we show that the compression performance of an orthonormal basis depends mostly on its ability to approximate images with a few nonzero vectors  相似文献   

16.
We present a semi-fixed-length motion vector coding method for H.263-based low bit rate video compression. The method exploits structural constraints within the motion field. The motion vectors are encoded using semi-fixed-length codes, yielding essentially the same levels of rate-distortion performance and subjective quality achieved by H.263's Huffman-based variable length codes in a noiseless environment. However, such codes provide substantially higher error resilience in a noisy environment.  相似文献   

17.
In this paper a low bit rate subband coding scheme for image sequences is described. Typically, the scheme is based on temporal DPCM in combination with an intraframe subband coder. In contrast to previous work, however, the subbands are divided into blocks onto which conditional replenishment is applied, while a bit allocation algorithm divides the bits among the blocks assigned for replenishment. A solution is given for the ‘dirty window’ effect by setting blocks to zero that were assigned to be replenished but received no bits. The effect of motion compensation and the extension to color images are discussed as well. Finally, several image sequence coding results are given for a bit rate of 300 kbit/s.  相似文献   

18.
结合图像信号瞬变、渐变和缓变的特点,从图像信号瞬态非平稳分析与重构角度出发,提出了具有瞬态分析、参数内插和时频重构特点的瞬态时频内插极低速率图像压缩编码方案。瞬态时频分析理论的应用,为今后图像信号处理研究提供了一种新的思路。  相似文献   

19.
This article presents an efficient rate control scheme for H.264/AVC video coding in low bit rate environment. In the proposed scheme, an improved rate-distortion (RD) model by both analytical and empirical approaches is developed. It involves an enhanced mean absolute difference estimating method and a more rate-robust distortion model. Based on this RD model, an efficient macroblock-layer rate control scheme for H.264/AVC video coding is proposed. Experimental results show that this model encodes video sequences with higher peak signal-to-noise ratio gains and generates bit stream closer to the target rate.  相似文献   

20.
We present an efficient computation constrained block-based motion vector estimation algorithm for low bit rate video coding that yields good tradeoffs between motion estimation distortion and number of computations. A reliable predictor determines the search origin, localizing the search process. An efficient search pattern exploits structural constraints within the motion field. A flexible cost measure used to terminate the search allows simultaneous control of the motion estimation distortion and the computational cost. Experimental results demonstrate the viability of the proposed algorithm in low bit rate video coding applications. The resulting low bit rate video encoder yields essentially the same levels of rate-distortion performance and subjective quality achieved by the UBC H.263+ video coding reference software. However, the proposed motion estimation algorithm provides substantially higher encoding speed as well as graceful computational degradation capabilities.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号