首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 281 毫秒
1.
提出了一种新的适用于离散HMM说话人辨认系统的VQ码本训练方法,码本的训练准则是使码本中各码字的利用率趋于均等.将新方法训练的码本与用LBG算法训练的码本进行了比较,实验表明,在基于离散HMM的说话人辨认系统中,用新方法训练的码本性能优于用LBG算法训练的码本,特别是在与文本无关的情况下,使系统的正确辨认率显著提高.  相似文献   

2.
The author considers vector quantization that uses the L (1) distortion measure for its implementation. A gradient-based approach for codebook design that does not require any multiplications or median computation is proposed. Convergence of this method is proved rigorously under very mild conditions. Simulation examples comparing the performance of this technique with the LBG algorithm show that the gradient-based method, in spite of its simplicity, produces codebooks with average distortions that are comparable to the LBG algorithm. The codebook design algorithm is then extended to a distortion measure that has piecewise-linear characteristics. Once again, by appropriate selection of the parameters of the distortion measure, the encoding as well as the codebook design can be implemented with zero multiplications. The author applies the techniques in predictive vector quantization of images and demonstrates the viability of multiplication-free predictive vector quantization of image data.  相似文献   

3.
李霆  王东进  刘发林 《电讯技术》2007,47(1):151-153
将遗传算法与LBG算法相结合,得到了一种矢量量化码书设计算法.利用遗传算法的全局优化能力得到最优的矢量量化码书;同时,克服了传统遗传算法收敛速度慢的缺点.实验结果表明,文中提出的算法性能上优于LBG算法,且收敛速度较快.  相似文献   

4.
The Linde-Buzo-Gray (LBG) algorithm is usually used to design a codebook for encoding images in vector quantization. In each iteration of this algorithm, one must search the full codebook in order to assign the training vectors to their corresponding codewords. Therefore, the LBG algorithm needs a large computation effort to obtain a good codebook from the training set. The authors propose a finite-state LBG (FSLBG) algorithm for reducing the computation time. Instead of searching the entire codebook, they search only those codewords that are close to the codeword for a training vector in the previous iteration. In general, the number of these possible codewords can be made very small without sacrificing performance. By only searching a small part of the codebook, the computation time is reduced. In experiments, the performance of the FSLBG algorithm in terms of signal-to-noise ratio is very close to that of the LBG algorithm. However, the computation time of the FSLBG algorithm is about 10% of the time required by the LBG algorithm  相似文献   

5.
采用遗传算法的VQ码本设计及说话人识别   总被引:2,自引:0,他引:2  
芮贤义  俞一彪 《信号处理》2005,21(3):289-292
矢量量化(VQ)方法是文本无关说话人识别中广泛应用的建模方法之一。在矢量量化过程中,经典的LBG算法收敛速度快,但极易收敛于局部最优点,无法保证根据有限样本数据得到最优码本,并最终影响系统识别性能。考虑到遗传算法(GA)是一种具有全局化寻优搜索能力的算法,本文提出了遗传算法和K均值算法相结合的综合分析方法GA-K进行码本设计,改善了码本的质量。讨论了具体的算法实现,分析了在不同的特征参数LPCC及MFCC、不同测试语音长度下的说话人识别性能。实验结果显示,GA-K方法优于传统的LBG算法,可以很好地协调收敛性和识别率之间的关系。  相似文献   

6.
该文提出分区域收敛的快速码书训练算法LC,它与LBC算法相比,结构简单、速度快,用典型的测试图像Lena和Barbara做实验,表明LC算法峰值信噪比只比LBG算法少2%左右,但运行速度东LBG的4.61-13.6倍。在比特率为0.375 bpp条件下,LC算法与LBG算法的重建图像质量无明显差别。  相似文献   

7.
A comparison of several vector quantization codebook generationapproaches   总被引:1,自引:0,他引:1  
A review and a performance comparison of several often-used vector quantization (VQ) codebook generation algorithms are presented. The codebook generation algorithms discussed include the Linde-Buzo-Gray (LBG) binary-splitting algorithm, the pairwise nearest-neighbor algorithm, the simulated annealing algorithm, and the fuzzy c-means clustering analysis algorithm. A new directed-search binary-splitting method which reduces the complexity of the LBG algorithm, is presented. Also, a new initial codebook selection method which can obtain a good initial codebook is presented. By using this initial codebook selection algorithm, the overall LBG codebook generation time can be reduced by a factor of 1.5-2.  相似文献   

8.
矢量量化(VQ)是一种极其重要的信号压缩方法,广泛地应用于图像信号压缩、语音信号压缩领域。它I的主要问题是码本设计,在码本设计过程中,有许多算法被提出。本文提出了PSO和LBG算法相结合的1PSO—LBG算法采设计码本,改善了码本质量,提高了收敛速度。  相似文献   

9.
Vector quantization (VQ) is an efficient technique for data compression and has been successfully used in various applications. The methods most commonly used to generate a codebook are the Linde, Buzo, Gray (LBG) algorithm, fuzzy vector quantization (FVQ) algorithm, Kekre‘s Fast Codebook Generation (KFCG) algorithm, discrete cosine transform based (DCT-based) codebook generation method, and k-principle component analysis (K-PCA) algorithm. However, if the separation boundaries in codebook generation are nonlinear, their performance can degrade fast. In this paper, we present a kernel fuzzy learning (KFL) algorithm, which takes advantages of the distance kernel trick and the gradient-based fuzzy clustering method, to create a codebook automatically. Experiments with real data show that the proposed algorithm is more efficient in its performance compared to that of the LBG, FVQ, KFCG, and DCT-based method, and to the K-PCA algorithm.  相似文献   

10.
The aim of this study is to generate vector quantisation (VQ) codebooks by integrating principle component analysis (PCA) algorithm, Linde-Buzo-Gray (LBG) algorithm, and evolutionary algorithms (EAs). The EAs include genetic algorithm (GA), particle swarm optimisation (PSO), honey bee mating optimisation (HBMO), and firefly algorithm (FF). The study is to provide performance comparisons between PCA-EA-LBG and PCA-LBG-EA approaches. The PCA-EA-LBG approaches contain PCA-GA-LBG, PCA-PSO-LBG, PCA-HBMO-LBG, and PCA-FF-LBG, while the PCA-LBG-EA approaches contain PCA-LBG, PCA-LBG-GA, PCA-LBG-PSO, PCA-LBG-HBMO, and PCA-LBG-FF. All training vectors of test images are grouped according to PCA. The PCA-EA-LBG used the vectors grouped by PCA as initial individuals, and the best solution gained by the EAs was given for LBG to discover a codebook. The PCA-LBG approach is to use the PCA to select vectors as initial individuals for LBG to find a codebook. The PCA-LBG-EA used the final result of PCA-LBG as an initial individual for EAs to find a codebook. The search schemes in PCA-EA-LBG first used global search and then applied local search skill, while in PCA-LBG-EA first used local search and then employed global search skill. The results verify that the PCA-EA-LBG indeed gain superior results compared to the PCA-LBG-EA, because the PCA-EA-LBG explores a global area to find a solution, and then exploits a better one from the local area of the solution. Furthermore the proposed PCA-EA-LBG approaches in designing VQ codebooks outperform existing approaches shown in the literature.  相似文献   

11.
Vector quantisation (VQ) has been extensively used as an effective image coding technique. One of the most important steps in the whole process is the design of the codebook. The codebook is generally designed using the LBG algorithm which uses a large training set of empirical data that is statistically representative of the images to be encoded. The LBG algorithm, although quite effective for practical applications, is computationally very expensive and the resulting codebook has to be recalculated each time the type of image to be encoded changes. Stochastic vector quantisation (SVQ) provides an alternative way for the generation of the codebook. In SVQ, a model for the image is computed first, and then the codewords are generated according to this model and not according to some specific training sequence. The SVQ approach presents good coding performance for moderate compression ratios and different type of images. On the other hand, in the context of synthetic and natural hybrid coding (SNHC), there is always need for techniques which may provide very high compression and high quality for homogeneous textures. A new stochastic vector quantisation approach using linear prediction which is able to provide very high compression ratios with graceful degradation for homogeneous textures is presented. Owing to the specific construction of the method, there is no block effect in the synthetised image. Results, implementation details, generation of the bit stream and comparisons with the verification model of MPEG-4 are presented which prove the validity of the approach. The technique has been proposed as a still image coding technique in the SNHC standardisation group of MPEG  相似文献   

12.
基于改进禁止搜索算法的矢量量化码书设计   总被引:9,自引:0,他引:9       下载免费PDF全文
本文提出了基于改进禁止搜索(TS)算法的矢量量化(VQ)码书设计方法.禁止搜索算法的关键是如何定义一个解以及如何在当前解的基础上生成邻域解.由于码书设计的两个优化准则是最邻近条件和聚类质心条件,本文提出了两种禁止搜索算法的解描述方案,其相应算法分别叫基于码书的禁止搜索(CB-TS)算法和基于聚类划分的禁止搜索(PB-TS)算法.为了提高禁止搜索算法的性能,文中在禁止搜索算法中融入了模拟退火(SA)机制.为了进一步提高码书性能,文中还将码书设计的传统LBG算法融入禁止搜索算法中.结果表明,基于禁止搜索的两种码书设计方案所生成的码书性能都比LBG算法有明显提高.  相似文献   

13.
针对在LBG算法中存在初始码书的选择极易影响码书训练的收敛速度和最终码书性能的缺陷,提出了一种基于微粒群的矢量量化码书设计算法.首先产生具有一定全局性特点的初始码书,然后再应用LBG算法进行优化得到同时具有局部特性的码书.实验结果验证了该算法的合理性.  相似文献   

14.
矢量量化(VQ)是语音识别中广泛应用的一种数据压缩和编码方法。在矢量量化的码本设计过程中,经典的LBG算法收敛速度快,但极易陷入局部最优,且初始码本的生成对最佳码本的设计影响很大。考虑到遗传算法(GA)是一种具有全局优化搜索能力的算法,提出了GA和LBG算法相结合的GA L算法来优化码本,改善了码本质量,并将其应用于非特定人汉语连续数字语音识别中。实验结果表明,GA L算法优于传统的LBG算法。  相似文献   

15.
基于Hadamard变换和K-means理论,针对Chen的初始码书设计算法的随机性较强和峰值信噪比(PSNR)不高这两个缺点,提出了一种改进的码书设计算法。本算法利用统计特征量的分类平均法生成初始码书,然后提高求质心的频率,每当一个训练矢量被分类到胞腔时,就求出相应胞腔的质心来代替原有的码字。该算法结合LBG算法的优点,调整后的码字代表了整个胞腔的特性,加速了码书的收敛速度,提升了码书的性能。仿真实验结果表明,较Chen的算法图像效果,即峰值信噪比(PSNR),平均提高了0.5 dB,在迭代次数较小时甚至达0.9 dB。  相似文献   

16.
在粒子群优化(Particle Swarm Optimization, PSO)和混合蛙跳算法(Shuffled Frog-Leaping Algorithm, SFLA)的基础上,该文提出了一种新的混合粒子对优化(Shuffled Particle-Pair Optimizer, SPPO)算法,应用于矢量量化的说话人识别。该算法将全局信息交换和局部深度搜索相结合寻求最佳的说话人码本。群体按适应值分为3个粒子对,每个粒子对由两个粒子构成,按先后顺序执行PSO算法中的速度位置更新和LBG算法以实现局部细致搜索,间隔一定的迭代次数通过SFLA混合策略实现粒子对间的信息交换,从而使群体向全局最优解靠近。实验结果表明,本算法始终稳定地取得显著优于LBG,FCM,FRLVQ-FVQ和PSO算法的说话人识别性能,较好地解决了初始码本影响的识别性能的问题,且在计算时间和收敛速度方面有相当的优势。  相似文献   

17.
LSF(线谱频率)码书的性能对合成语音质量有着重要影响.经典的LBG算法容易陷入局部最优,而目前的一些码书进化算法搜索空间较大、搜索效率不明显.本文提出了一种新型的基于对LSF矢量空间进行拉伸变化的混合进化码书优化算法.该算法编码空间与矢量同维,相对较小,便于优化操作.算法中引入EP中的变异操作对PSO位置、速度矢量进行控制,以提高优化搜索算法的效率.实验结果表明,本文算法有效地改善了码书性能.  相似文献   

18.
刘燕  郭英 《通信技术》2008,41(2):81-82,88
为了提高模拟退火算法的最终解的质量,文中对控制算法进程的冷却进度表进行了优化选取,尤其在控制马尔可夫链长方面,给出了依据算法搜索过程的反馈信息来控制马尔可夫链长的方法.将该算法与LBG算法相结合,应用于矢量量化图像编码,既保持了模拟退火对初始码书依赖性小、不容易陷入局部极值的优点,又具备LBG算法的易于实现和计算量小的特点.仿真实验表明,该算法提高了码书的编码性能.  相似文献   

19.
文中将频率敏感算法引入到基本的蚁群算法中,提出了一种改进的蚁群聚类码书设计算法。在提出的码书设计算法中采用LBG码书优化准则,引入了频率敏感算法。仿真实验表明,提出的算法避免了停滞现象发生,有效地提高了其全局搜索能力。  相似文献   

20.
基于遗传算法的最优码本设计   总被引:2,自引:0,他引:2  
矢量量化是一种极其重要的信号压缩方法,广泛应用于图像信号压缩、语音信号压缩等领域,它的主要问题是码本设计问题,传统的LBG算法及其改进算法只能获得局部最优的码本。本文详细讨论了如何利用遗传算法来获得全局最优的码本,目的应用于说话人识别。本文具体给出了基于遗传算法的最优码本设计算法的实现方法和实验结果。实验表明,本文实现的算法有效。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号