共查询到20条相似文献,搜索用时 312 毫秒
1.
YANG Zhen 《中国邮电高校学报(英文版)》2002,9(4)
The author designs a new speech codec in this paper, which is based on ANN to carry out nonlinear prediction. This new codec synthesizes speeches with better quality than the conventional waveform or hybrid codecs does at the same bit rate. Moreover, the most important characteristic of this codec is the low coding delay, which will benefit the enhancement of the speech communication QoS when we transmit speech signals in IP or ATM networks. 相似文献
2.
3.
Hardware implementation aspects of sophisticated speech codecs are addressed. The major points discussed are approaches to implementation of the sophisticated speech codecs, requirements for DSP (digital signal-processing) implementation of the codecs, such as the type of arithmetic processing and the necessity of bit-level specifications, and codec implementation and DSP programming techniques for three specific coding algorithms: 32-kb/s ADPCM (adaptive digital pulse code modulation), 64-kb/s (7 kHz) SB (subband)-ADPCM, and 16-kb/s APC-AB (adaptive predictive coding with adaptive but allocation) codecs 相似文献
4.
OpenH323是一个开放源码的VoIP(Voice over IP)协议栈,支持H.323和SIP等多媒体通信协议,为多媒体应用提供了一个很好的开发平台。G.723.1是ITU-T建议在中低速率多媒体通信中使用的语音压缩算法,目前该算法已在IP电话系统中得到广泛应啊。基于OpenH323协议栈实现G.723.1Codec有着十分重要的应用价值。介绍在OpenH323的软件终端上实现G.723.1Codec的基本方法,并可推广到G.729等其它多种语音压缩算法。 相似文献
5.
重点讨论了iLBC编解码器独立于帧的长期预测。独立于帧的长期预测是用来在编码语音没有遭受与传输丢失相关的多帧语音退化情况下,开发斜度标记相关的办法。然后介绍了iLBC,G.729A和G.723.1编解码器的平均主观得分MOS,并用信号为例说明基于独立于帧的长期预测编解码器和CELP编解码器之间的不同,最后用语音重构的例子说明二者语音质量间的差别。 相似文献
6.
在AMR的基础上介绍了一种用来提高系统容量的源自适应(SA-AMR)算法.简要阐述了算法编解码原理,分析了在无线通信系统中引入SA-AMR技术的优点,并与AMR进行性能比较,测试结果表明:SA-AMR编码速率和模式可变,能够提高语音质量和发射稳健性,有效增大系统容量,性能优于AMR. 相似文献
7.
8.
This study considers the problem of transmitting generic (non-speech) data through compressed voice channels such as those
used in wireless communications networks. These networks employ voice codecs that are designed to efficiently encode and reproduce
the relatively slow-changing signals of human speech, which leads to communication channels that are nonlinear and have long-term
memory. A data modem is presented that utilizes finite alphabets of waveforms that are numerically optimized to be as separable
as possible after passing through the voice codec. The optimization of the finite alphabet is performed using a pattern search
algorithm. When used with the GSM Enhanced Full-Rate Voice Codec, this system demonstrated improved performance, in terms
of error rates, compared to previously reported results. Simulation results for four other voice codecs are also presented. 相似文献
9.
《Networking, IEEE/ACM Transactions on》2009,17(4):1042-1051
10.
Renaud Di Francesco 《电信纪事》1992,47(5-6):214-226
This article presents new speech coding methods for real time application (telephone, videophone) or offline applications (storage). Speech quality is in the classical telephone range, with a 4 kHz bandwidth and a sampling at 8 kHz. An elementary approach leads to a 16 kbit/s codec and a 24 kbit/s codec, using integer codebooks and fast computations. The speech quality of the two codecs has been measured in comparison with more complex ones and in realistic conditions, with noisy telecommunication channels. The elementary approach is completed by a synthetic model, with a systematic generalization of the algorithms (e.g. for a generalized vselp). Some methods for channel protection, which are already known by the speech coding researchers, are summed up in the Appendix. A change of representation for low density codes (less than 1 bit/sample) is proposed. 相似文献
11.
本文提出一种新的用于LPC语音编码器的BSP激励信号,即根据语音产生的原理,以一个幅度受到二项式调制的正弦波BSP(Binomial Sine Pulse)作为LPC激励源,该二项式反映了激励信号在一个基音周期内的变化趋势。本文同时推导了BSP激励参数的求取和改进方法。实验结果表明,在此基础上构造的BSP语音编解码器具有低复杂度、低时延的优点,同时编码速率在低至2.65kb/s时,具有较高的合成语音质量。 相似文献
12.
Some of the many factors that go into the design of a personal communications system (PCS) are considered. The ways in which these factors affect the speech codec are discussed. Cell size, multiple access methods, and the trade-offs in PCS design are examined. Possible future designs of PCSs and their codecs are described 相似文献
13.
Wideband speech is the major differentiation and attraction of third-generation network services in both the circuit and packet switched domain. Increased audio bandwidth introduces a significant leap in perceived quality of service compared to currently utilized narrowband telephony in second-generation mobile communications and the PSTN. The adaptive multirate wideband (AMR-WB) speech codec is the service enabler for improved user experience. It is an established 3GPP and ITU-T wideband speech codec standard and represents the state-of-the-art in speech quality as well as robustness in error prone radio channels. It is also the first codec algorithm standardized for wideband speech for mobile communications. 相似文献
14.
S. DIMOLITSAS F. L. CORCORAN C. RAVISHANKAR A. WONG S. F. DE CAMPOS NETO 《International Journal of Satellite Communications and Networking》1996,14(5):381-387
In this paper are presented the method and results of a subjective evaluation, which was conducted in order to select a new speech codec for the Inmarsat mini-M system. The mini-M system is designed to provide the next generation of global, notebook-sized satellite terminals for transportable, land-mobile and maritime voice, facsimile, and data communications. Overall, six different codecs operating at a combined source and channel rate of 4⋅8 kbit/s were evaluated in a series of six subjective tests. From this, it was concluded that one codec was able to deliver performance that is equivalent to, or better than, the IS-54 full-rate ditigal cellular 8 kbit/s VSELP codec, and was selected for use in the mini-M system. 相似文献
15.
A digital cellular mobile radio system has been under development in Europe since 1982 under the coordination of the working group CEPT GSM (groupe speciale mobile). In a recent coordinated experiment, listening opinion tests were performed on the speech output of six candidate 16 kb/s speech coding schemes for this system: one regular-pulse excited coder, one multiple-excited coder, and four subband coders. For comparison purposes, test conditions from a companded cellular FM system currently in operation were included in the experiment. The six codecs were companded in terms of subjective quality, transmission delay, and ease of implementation. In this overall comparison, no single codec was superior in all respects. However, the regular-phase-excited linear predictive coder, which provided the best speech quality, had acceptable complexity and delay and was singled out for further improvement. Ultimately, an improved version of this codec, a regular-pulse-excited/long-term-prediction LPC coder was selected 相似文献
16.
This paper proposes an adaptive noise canceller (ANC) with low signal distortion for speech codecs. The proposed ANC has two adaptive filters: a main filter (MF) and a subfilter (SF). The signal-to-noise ratio (SNR) of input signals is estimated using the SF. To reduce signal distortion in the output signal of the ANC, a step size for coefficient update in the MF is controlled according to the estimated SNR. Computer simulation results using speech and diesel engine noise recorded in a special-purpose vehicle show that the proposed ANC reduces signal distortion in the output signal by up to 15 dB compared with a conventional ANC. Results of subjective listening tests show that the mean opinion scores (MOSs) for the proposed ANC with and without a speech codec are one point higher than the scores for the conventional ANC 相似文献
17.
A new adaptive quantizer which uses a combination of instantaneous and syllabic adaptation is presented for use in speech codecs. It can be designed to adapt to changes in the mean, variance, and pdf shape of its input signal, and to quantize the signal using one or more bits/sample. It is therefore called the generalized hybrid adaptive quantizer (GHAQ). An efficient procedure for optimizing the GHAQ using a training sequence of signal samples is described, and the effects on the performance of the GHAQ of varying the memory length and the syllabic compandor time constant are investigated. It is found that an optimized version of the two-bit GHAQ offers improved signal-to-noise ratio over Jayant's adaptive quantizer with a one-word memory when it is used in a predictive speech codec with a zero-, first-, or second-order fixed predictor 相似文献
18.
Yuh-Feng Hsu Yung-Chang Chen Chaur-Jia Huang Ming-Jyh Sun 《Broadcasting, IEEE Transactions on》1998,44(1):77-86
An effective MPEG-2 spatial scalable video codec is designed, and error concealment technique of associated transport stream is proposed. The spatial scalability can provide robust error resilience for Ka-band rain attenuation, as well as the co-existence of HDTV and SDTV systems. In MPEG-2, the spatial scalable encoder combines both spatial and temporal predictions. This paper proposes a near optimal spatial-temporal weighting analyzer to properly assign weights. Simulation shows that the proposed spatial scalable codec structure outperforms the performance of the other MPEG-2 codecs for TV broadcasting in Ka-band using satellites. We also propose a suitable error protection and concealment method for MPEG-2 transport stream. With the combination of scalable coding and error protection systems, the proposed system will achieve high link availability 相似文献
19.
20.
Current practice for multichannel delta modulation (DM) terminals uses one DM codec per channel-end. This paper describes methods by which one high-speed DM codec can be timeshared over a large number of channels. The methods are also applied to multichannel differential PCM (DPCM) terminals. In both cases the time-shared codecs include step-size adaptation determined by the recent past history of the coded digital signal. A method for digital conversions between multichannel linear PCM and adaptive DPCM (ADPCM) formats is also described. 相似文献