共查询到18条相似文献,搜索用时 140 毫秒
1.
为实现高质量的极低速语音编码,提出一种基于压缩感知理论的线谱对(LSP)参数降维量化算法。编码端利用压缩感知理论对超帧LSP高维矢量进行降维处理,将原始LSP参数投影到低维空间,得到低维测量值,然后采用分裂矢量量化算法对测量值进行量化;解码端以量化后的测量值为已知条件,利用正交匹配追踪算法重构出原始LSP高维矢量。实验结果表明,本算法相对低速语音编码中的矩阵量化方案,平均谱失真降低了0.23dB,相对基于DCT变换的降维量化方案,平均谱失真降低了0.13dB。这种先降维再量化的思想可以大幅减少编码所需的比特数及码本存储复杂度,有效降低语音编码速率,并且合成语音可懂度、自然度较高,音质虽有所失真,但基本上感觉不到明显的听觉质量下降。 相似文献
2.
3.
4.
混合型中速率语音编码系统 总被引:1,自引:0,他引:1
朱琦 《南京邮电学院学报(自然科学版)》1997,17(3):35-37
设计了一种新型的中速率混合语压编码系统,该系统把语音分割成基带(0.3~1kHz)和高频部分(1~3.4kHz)对于重要的基带信号,采用高质量的4bit/样点的ADPCM技术;对于相对次要的高频信号,采用高效的VQ(矢量量化)技术,以压缩码率,对于矢量量化,还提出了一种新的快速算法,通过某种预处理使得搜索码本的速度提高10倍以上,且质量等效于全搜索方法,本系统具有实现简单,时延短的特点,且主观质量 相似文献
5.
一种指数型模糊学习矢量量化图像编码算法 总被引:6,自引:0,他引:6
本文分析了模糊矢量量化(FVQ)图像编码的原理,提出了一种指数型模糊学习矢量量化算法(EFLVQ)。实验结果表明,该算法具有快速收敛性能,设计的图像码书峰值信噪比与FVQ算法相比也略有改善。 相似文献
6.
一种基于自组织神经网络的图像压缩编码算法 总被引:2,自引:0,他引:2
本文提出了一种基于自组织特征映射神经网络的图像压缩编码算法,即VQ+DPCM+DCT算法,实验表明,在压缩比为31.8∶1时,其峰峰信噪比为35.82dB(Lenna亮度图像),且主观效果良好,这是至今为止使用矢量量化(VQ)方法压缩图像所获得的最好结果。 相似文献
7.
本文提出一种在离散多频(DMT)调制中采用区域分割,依次沿着各子信道进行标量矢量量化器(SVQ)成形的方法,并对其性能进行了分析和模拟。 相似文献
8.
9.
本文针对波形内插(WI)语音编码模型和参数量化等技术进行了研究,并最终提出了一种基于二维非负矩阵分解的1kb/s波形内插(2DNMF-WI)语音编码算法. 文中采用二维非负矩阵分解(2D-NMF)方法来分解语音特征波形(CW),该分解方法在行和列两个方向上同时压缩CW幅度谱矩阵的维数,使得CW幅度谱矩阵降维后得到的编码矩阵维数较小,易于量化. 此外,在甚低速率语音编码中,由于没有足够的比特数来描述编码参数,往往很难得到高质量的合成语音. 本算法采用两帧联合编码、帧间后向预测三级矢量量化、离散余弦变换(DCT)和分裂式矩阵量化等技术来降低编码速率和改善音质. 非正式主观听觉测试显示,1kb/s 2DNMF-WI编码器合成语音的质量稍差于2kb/s的NMF-WI语音编码算法. 相似文献
10.
11.
A variable dimension vector quantizer (VDVQ) has codewords of unequal dimensions. Here, a trellis-based sequential optimal VDVQ encoding algorithm is proposed. Also, a VDVQ codebook design algorithm based on splitting a node with equal or reduced dimensions is proposed that does not require any codebook parameter to be prespecified unlike known schemes. The VDVQ system is shown to outperform a few known VQ systems for AR(1) sources 相似文献
12.
Many image compression techniques require the quantization of multiple vector sources with significantly different distributions. With vector quantization (VQ), these sources are optimally quantized using separate codebooks, which may collectively require an enormous memory space. Since storage is limited in most applications, a convenient way to gracefully trade between performance and storage is needed. Earlier work addressed this problem by clustering the multiple sources into a small number of source groups, where each group shares a codebook. We propose a new solution based on a size-limited universal codebook that can be viewed as the union of overlapping source codebooks. This framework allows each source codebook to consist of any desired subset of the universal code vectors and provides greater design flexibility which improves the storage-constrained performance. A key feature of this approach is that no two sources need be encoded at the same rate. An additional advantage of the proposed method is its close relation to universal, adaptive, finite-state and classified quantization. Necessary conditions for optimality of the universal codebook and the extracted source codebooks are derived. An iterative design algorithm is introduced to obtain a solution satisfying these conditions. Possible applications of the proposed technique are enumerated, and its effectiveness is illustrated for coding of images using finite-state vector quantization, multistage vector quantization, and tree-structured vector quantization. 相似文献
13.
波形码书的二次设计方法研究 总被引:3,自引:0,他引:3
一个实用的矢量量化码书应该具有体积小、代表性强的特点 ,本文提出了对已知波形码书进行二次设计的两种方法 ,一种是基于码字使用频率 ,另一种是基于码字能量 ,二者都可降低码书复杂度 ,获得高质合成语音。进一步的分析揭示了两种方法的联系。 相似文献
14.
Two speech compression systems based on codebooks of inverse filters produced by off-line linear predictive coding (LPC) and vector quantization (VQ) techniques are considered. The first system is a pitch excited vocoder that is a variation on a speech coding system based upon vector quantization. The encoder selects an LPC reverse filter from a finite codebook that best "matches" an observed frame of sampled speech. This filter is in turn used to determine the voicing and digitized pitch information. Unlike LPC systems, the digitization is performed in a single step on the data rather than separate modeling and digitization steps. The second system is a tree encoding system that uses the filter selected by an inverse filter matching vocoder to "color" a tree that is then searched for a minimum distortion path for the original sampled speech waveform. This system can be viewed as a hybrid between an adaptive predictive coder and a universal tree encoder. The two systems are described, simulated, and compared with other similar systems. 相似文献
15.
The nonlinear principal component analysis (NLPCA) method is combined with vector quantization for the coding of images. The NLPCA is realized using the backpropagation neural network (NN), while vector quantization is performed using the learning vector quantizer (LVQ) NN. The effects of quantization in the quality of the reconstructed images are then compensated by using a novel codebook vector optimization procedure. 相似文献
16.
一种高质量的8kb/sACELP语音编码算法及其实时实现 总被引:2,自引:0,他引:2
本文介绍了一种编码速率的8kb/s的高质量实时语音编码器,它采用了代数码本激励线性预测(ACELP)的编码方法,并采用高效的码本结构,码本搜索技术和矢量量化技术来获得较高的语音合成质量和较低的算法复杂度,在无需外部RAM和ROM的情况下,该算法已用TMC320C50实时实现并用于一个实时的全双工通信系统,通过信噪比及人耳主观听视实验等性能测试表明,该算法的性能明显优于优于北美的8kb/sVSELP 相似文献
17.
一种基于改进的矢量量化技术的语音波形编码 总被引:1,自引:0,他引:1
针对GLA(Generalized Lloyd Algorithm)对初始码书的敏感性,用PNN(成对最近邻)算法训练初始码书,并将该改进措施用于语音波形编码。实验证明,此改进措施有助于克服GLA对初始码书的敏感性,并且语音恢复效果良好,失真度较低。 相似文献