共查询到15条相似文献,搜索用时 62 毫秒
1.
2.
3.
4.
用同态解卷估计褐稻虱鸣声的基音周期 总被引:2,自引:0,他引:2
本文介绍同态解卷的基本原理以及它在褐稻虱鸣声分析中的应用.扼要介绍了同态滤波.提出了褐稻虱发声的机理和褐稻虱鸣声产生的声学模型.根据这一模型.用实倒谱区分鸣声属于“浊音”还是”清音“.如果属于“浊音”.就估计它的基音周期.最后,测出了在一声鸣叫中基音周期的变化. 相似文献
5.
在语音编码算法中,混和激励线性预测(MELP)算法因为能更好的模拟自然语言特征,在低速率上能合成较高质量的语音,而成为现代低速率语音编码中最有潜力的算法之一。但在无线通信、卫星通信以及军用和保密通信中,信道带宽成为一个突出的问题,因此对更低速率语音压缩编码技术乃至超低速率的语音压缩编码技术的研究是非常有必要的。针对语音通信中关于极低速率的要求,深入分析了现今的几种基于MELP的低速率语音编码算法,对其原理以及关键技术进行了归纳总结,并对语音质量进行了比较。 相似文献
6.
AbstractA new scheme that aims to cut down on the computational cost of the vector quantization (VQ) encoding procedure is proposed in this paper. In this scheme, the correlation between the codewords in the codebook is exploited and three test conditions are designed to filter out the impossible codewords in the codebook. The design of test conditions is based on the concept of integral projection.From the experimental results, it is shown that the new scheme outperforms all the other schemes proposed so far in speeding up the VQ encoding procedure. When the codebook of 1024 codewords is used in the proposed scheme, the execution time it consumes is less than 2 per cent of that needed by the full search algorithm. The average time reduction rate is approximately 97.7 per cent compared to the execution time for the full search algorithm. In other words, the proposed scheme indeed provides an effective approach to speed up the VQ encoding procedure. 相似文献
7.
Abstract To achieve high coding efficiency, modern speech coders adopt hybrid coding approaches, which utilize different coding mechanisms for various classified speech segments. With known voiced/unvoiced detection, in this paper, a classified LPC quantization (CLPQ) scheme is presented to effectively encode line spectral frequencies (LSF). The proposed CLPQ scheme improves the performance of the classified LSF vector quantizer, which adopts two LSF codebooks derived separately from voiced and unvoiced speech frames. With an objective spectral distortion measure, the CLPQ scheme successfully reduces the bit rate by about 1 bit/frame. Many classified LSF quantizers with different codebook structures and bit rates were evaluated. It would be helpful to design a classified LSF quantizer, which arrives at a compromise between distortion, bit rate and computational complexity. 相似文献
8.
Wen‐Shiung Chen Lili Hsieh 《International journal of imaging systems and technology》2002,12(4):166-174
Wavelet transform coding (WTC) with vector quantization (VQ) has been shown to be efficient in the application of image compression. An adaptive vector quantization coding scheme with the Gold‐Washing dynamic codebook‐refining mechanism in the wavelet domain, called symmetric wavelet transform‐based adaptive vector quantization (SWT‐GW‐AVQ), is proposed for still‐image coding in this article. The experimental results show that the GW codebook‐refining mechanism working in the wavelet domain rather than the spatial domain is very efficient, and the SWT‐GW‐AVQ coding scheme may improve the peak signal‐to‐noise ratio (PSNR) of the reconstructed images with a lower encoding time. © 2002 Wiley Periodicals, Inc. Int J Imaging Syst Technol 12, 166–174, 2002; Published online in Wiley InterScience (www.interscience.wiley.com). DOI 10.1002/ima.10024 相似文献
9.
MPEG-4 AAC的编码性能很大程度上依赖于量化模块的编码效率和收敛速度,但其常用的基于双循环搜索结构的率失真控制器引起编码器性能较差,尤其在低码率时更为突出。提出一种新的量化优化算法。新方案采取单循环结构,用前面数帧的量化信息对当前帧的初始量化步长做线性预测,再用接近最优比特分配的BFOS算法控制量化步长的调节。仿真证明新方案的编码性能明显优于MPEG-4 AAC VM,对比BOFS算法,运算量得到极大降低。 相似文献
10.
Han‐Gyu Kim Gil‐Jin Jang Jeong‐Sik Park Ji‐Hwan Kim Yung‐Hwan Oh 《International journal of imaging systems and technology》2013,23(1):64-70
This article proposes a novel speech and sound segregation framework incorporating a technique for correcting a series of pitch periods based on particle filtering. The conventional pitch track correction method finds the peak locations of the autocorrelation functions to estimate the pitch period, and only the longest reliable pitch streak is used to correct unreliable pitch tracks. Especially in noisy environments, it is hard to find long and reliable pitch streaks, resulting in the degradation of the speech segregation performance. The proposed algorithm based on particle filtering considers all the reliable pitch streaks rather than the longest one and smoothly connects the scattered pitch streaks. To apply the particle filtering algorithm to pitch track correction, the importance weight computation to account for the degree of matchness of the found pitch to the individual spectro‐temporal components is also proposed. The performance of the proposed method is evaluated by the results of speech segregation experiments for the mixtures of speech and various noise sources in various mixing signal‐to‐noise ratios (SNRs). The evaluation measures were SNR, energy loss ratio, and noise residue ratio of the segregated speech, and all these measures showed that the proposed segregation method achieved superior performance compared to the conventional approach. © 2013 Wiley Periodicals, Inc. Int J Imaging Syst Technol, 23, 64–70, 2013. 相似文献
11.
Wen‐Shiung Chen Lili Hsieh Shang‐Yuan Yuan 《International journal of imaging systems and technology》2002,12(5):204-210
One of the major difficulties arising in vector quantization (VQ) is high encoding time complexity. Based on the well‐known partial distance search (PDS) method and a special order of codewords in VQ codebook, two simple and efficient methods are introduced in fast full search vector quantization to reduce encoding time complexity. The exploitation of the “move‐to‐front” method, which may get a smaller distortion as early as possible, combined with the PDS algorithm, is shown to improve the encoding efficiency of the PDS method. Because of the feature of energy compaction in DCT domain, search in DCT domain codebook may be further speeded up. The experimental results show that our fast algorithms may significantly reduce search time of VQ encoding. © 2003 Wiley Periodicals, Inc. Int J Imaging Syst Technol 12, 204–210, 2002; Published online in Wiley InterScience (www.interscience.wiley.com). DOI 10.1002/ima.10030 相似文献
12.
13.
提出了一种基于对数谱估计的改进型语音增强算法。相对于传统语音增强算法,在语音信号存在不确定的条件下,利用软判决增益因子修正技术调正带噪语音信号的对数谱幅度,抑制背景噪声。引入的改进型先验信噪比估计和语音信号先验不存在概率估计方法,能够有效地估计得出语音信号的存在概率,进而求得语音信号存在时的谱增益因子函数,联合语音信号不存在时设定的增益因子函数加权求得谱增益函数。计算机仿真表明,即使在低信噪比条件下,输入背景噪声为高斯白噪声和粉红噪声等加性白噪声时,所提算法对噪声的抑制效果非常明显,且有效地克服了传统算法中引入的“音乐噪声”和语音信号畸变。 相似文献
14.
文章中提出了一种新的自适应量化数字音频水印算法,该算法首先将视觉可辨的二值水印图像降维成一维水印序列,并对水印序列进行随机置乱与BCH纠错编码,再将原始数字音频信号划分成音频数据段,最后选择音频段进行快速傅立叶变换(FFT),并依据人类听觉系统(HAS)模型自适应确定量化步长量化FFT系数嵌入水印信息。该算法提取水印信息时不需要原始数字音频信号。仿真结果表明:该自适应量化数字音频水印算法不仅具有较好的透明性,而且对诸如叠加噪声、有损压缩、低通滤波、重新采样等攻击均具有较好的鲁棒性。 相似文献
15.
Abstract This paper presents a novel algorithm for the joint design of source and channel codes. In the algorithm, channel‐optimized vector quantization (COVQ) and rate‐punctured convolutional coding (RCPC) are used for design of the source code and the channel code, respectively. We employ the genetic algorithm (GA) to prevent the design of COVQ from falling into a poor local optimum. We also adopt the GA to reduce the computational time needed for realizing the unequal error protection scheme best matched to the COVQ. Both the GA‐based source coding and channel coding scheme are then iteratively combined to achieve a near global optimal solution for the joint design. Numerical results show that the algorithm can be an effective alternative for applications where high rate‐distortion performance and low computational complexity are desired. 相似文献