首页 | 本学科首页   官方微博 | 高级检索  
     


A low latency modular-level deeply integrated MFCC feature extraction architecture for speech recognition
Affiliation:1. Scientific Secretary, ISRO, Bengaluru, India;2. Department of ECE, NIT, Tiruchirappalli, India;3. Avionics Entity, VSSC, Department of Space, Thiruvananthapuram, India
Abstract:In this paper, a low-complex chip to extract the Mel Frequency Cepstral Coefficient for a speech recognition system is presented. The architecture can operate in a continuous-flow manner to process streaming or the stored speech signal at high speed. The frame-overlap Hamming window, DFT and Mel-filter bank computations are deeply integrated to share memory buffers and avoid bit-reversal circuit to reduce area and latency. Moreover, normalised energy consumption and area delay product are reduced by 32%, and speed is increased by 5.2 times compared to prior works. Further, the fixed-point word-length is optimised to minimise the area without affecting the accuracy.
Keywords:Automatic speech recognition  MFCC  ASIC  FPGA  Fast fourier transform
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号