期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Comparing ANN to HMM in implementing limited Arabic vocabulary ASR systems

Yousef Ajami Alotaibi 《International Journal of Speech Technology》2012,15(1):25-32

In this paper we investigated Artificial Neural Networks (ANN) based Automatic Speech Recognition (ASR) by using limited Arabic vocabulary corpora. These limited Arabic vocabulary subsets are digits and vowels carried by specific carrier words. In addition to this, Hidden Markov Model (HMM) based ASR systems are designed and compared to two ANN based systems, namely Multilayer Perceptron (MLP) and recurrent architectures, by using the same corpora. All systems are isolated word speech recognizers. The ANN based recognition system achieved 99.5% correct digit recognition. On the other hand, the HMM based recognition system achieved 98.1% correct digit recognition. With vowels carrier words, the MLP and recurrent ANN based recognition systems achieved 92.13% and 98.06, respectively, correct vowel recognition; but the HMM based recognition system achieved 91.6% correct vowel recognition. 相似文献

2.

Recognition of Bangla compound characters using structural decomposition

Soumen Bag Gaurav Harit Partha Bhowmick 《Pattern recognition》2014

In this paper we propose a novel character recognition method for Bangla compound characters. Accurate recognition of compound characters is a difficult problem due to their complex shapes. Our strategy is to decompose a compound character into skeletal segments. The compound character is then recognized by extracting the convex shape primitives and using a template matching scheme. The novelty of our approach lies in the formulation of appropriate rules of character decomposition for segmenting the character skeleton into stroke segments and then grouping them for extraction of meaningful shape components. Our technique is applicable to both printed and handwritten characters. The proposed method performs well for complex-shaped compound characters, which were confusing to the existing methods. 相似文献

3.

利用空间相关性的改进HMM模型 总被引：1，自引：0，他引：1

苏腾荣吴及王作英吕萍《计算机工程与设计》2010,31(5)

语音识别领域中所采用的经典HMM模型,忽略了语音信号间的相关信息.针对这一问题,利用语音信号的空间相关性对经典HMM模型进行补偿,得到一种改进模型.该方法通过空间相关变换,描述了当前语音特征与历史数据之间的空间相关性,从而对联合状态输出分布进行建模.改进模型的解码算法利用空间相关性变换的参数更新算法在经典ⅧⅥM的解码算法基础上得到.实验结果表明,上述方法在说话人无关连续语音识别系统上获得了明显的性能改进. 相似文献

4.

Robust visual speakingness detection using bi-level HMM

P. Tiawongsombat Mun-Ho Jeong Joo-Seop Yun Bum-Jae You Sang-Rok Oh 《Pattern recognition》2012,45(2):783-793

Visual voice activity detection (V-VAD) plays an important role in both HCI and HRI, affecting both the conversation strategy and sync between humans and robots/computers. The typical speakingness decision of V-VAD consists of post-processing for signal smoothing and classification using thresholding. Several parameters, ensuring a good trade-off between hit rate and false alarm, are usually heuristically defined. This makes the V-VAD approaches vulnerable to noisy observation and changes of environment conditions, resulting in poor performance and robustness to undesired frequent speaking state changes. To overcome those difficulties, this paper proposes a new probabilistic approach, naming bi-level HMM and analyzing lip activity energy for V-VAD in HRI. The designing idea is based on lip movement and speaking assumptions, embracing two essential procedures into a single model. A bi-level HMM is an HMM with two state variables in different levels, where state occurrence in a lower level conditionally depends on the state in an upper level. The approach works online with low-resolution image and in various lighting conditions, and has been successfully tested in 21 image sequences (22,927 frames). It achieved over 90% of probabilities of detection, in which it brought improvements of almost 20% compared to four other V-VAD approaches. 相似文献

5.

Segmentation-free word spotting in historical Bangla handwritten document using Wave Kernel Signature

Das Sugata Mandal Sekhar 《Pattern Analysis & Applications》2020,23(2):593-610

Pattern Analysis and Applications - In this paper, we present a segmentation-free word spotting method based on Wave Kernel Signature (WKS) under the foundation of quantum mechanics. The query word... 相似文献

6.

An interoperable context sensitive model of trust 总被引：2，自引：0，他引：2

Indrakshi Ray Indrajit Ray Sudip Chakraborty 《Journal of Intelligent Information Systems》2009,32(1):75-104

Although the notion of trust is widely used in secure information systems, very few works attempt to formally define it or reason about it. Moreover, in most works, trust is defined as a binary concept—either an entity is completely trusted or not at all. Absolute trust on an entity requires one to have complete knowledge about the entity. This is rarely the case in real-world applications. Not trusting an entity, on the other hand, prohibits all communications with the entity rendering it useless. In short, treating trust as a binary concept is not acceptable in practice. Consequently, a model is needed that incorporates the notion of different degrees of trust. We propose a model that allows us to formalize trust relationships. The trust relationship between a truster and a trustee is associated with a context and depends on the experience, knowledge, and recommendation that the truster has with respect to the trustee in the given context. We show how our model can measure trust and compare two trust relationships in a given context. Sometimes enough information is not available about a given context to evaluate trust. Towards this end we show how the relationships between different contexts can be captured using a context graph. Formalizing the relationships between contexts allows us to extrapolate values from related contexts to approximate the trust of an entity even when all the information needed to calculate the trust is not available. Finally, we show how the semantic mismatch that arises because of different sources using different context graphs can be resolved and the trust of information obtained from these different sources compared.

Sudip ChakrabortyEmail:

相似文献

7.

基于上下文相关的软件体系结构求精方法

申利民马川王建龙牛景春彭思维《计算机工程与设计》2009,30(7)

为了提高软件体系结构求精的精确性与可追溯性,使处于不同抽象层次之间的体系结构之间形成规范的映射体系,引入了形式化方法,定义了一种基于上下文相关文法的形式化的求精文法,并将该文法应用到体系结构求精中,给出了基于构件的体系结构形式化求精过程.最后,基于体系结构求精方法建立了相应的用于指导软件开发的模型. 相似文献

8.

Filterbank optimization for robust ASR using GA and PSO

R. K. Aggarwal M. Dave 《International Journal of Speech Technology》2012,15(2):191-201

Automatic speech recognition (ASR) systems follow a well established approach of pattern recognition, that is signal processing based feature extraction at front-end and likelihood evaluation of feature vectors at back-end. Mel-frequency cepstral coefficients (MFCCs) are the features widely used in state-of-the-art ASR systems, which are derived by logarithmic spectral energies of the speech signal using Mel-scale filterbank. In filterbank analysis of MFCC there is no consensus for the spacing and number of filters used in various noise conditions and applications. In this paper, we propose a novel approach to use particle swarm optimization (PSO) and genetic algorithm (GA) to optimize the parameters of MFCC filterbank such as the central and side frequencies. The experimental results show that the new front-end outperforms the conventional MFCC technique. All the investigations are conducted using two separate classifiers, HMM and MLP, for Hindi vowels recognition in typical field condition as well as in noisy environment. 相似文献

9.

Handwritten Bangla city name word recognition using CNN-based transfer learning and FCN

Pramanik Rahul Bag Soumen 《Neural computing & applications》2021,33(15):9329-9341

相似文献

10.

用隐马尔柯夫模型对汉语进行切分和标注排歧 总被引：8，自引：2，他引：6

刘颖《计算机工程与设计》2001,22(4):58-62,68

对汉语进行切分和标注,不可避免要产生歧义,文中对切分和标注阶段采用相同的模型-隐马尔柯夫模型（HMM）来消歧,在切分阶段,使用基于HMM的切分评分,而在标沐阶段,使用基于HMM的词汇评分,并按最大可能原理和多结果输出原理进行词汇评分实验,实验结果表明,用HMM对汉语进行标注排歧,正确率很高。相似文献

11.

一种基于HMM的维吾尔文联机手写识别的方法

陈晓娇哈力木拉提·买买提《计算机工程与应用》2013,(24):175-178,237

在维吾尔文联机手写识别过程的训练阶段,单词被切分成字母,经过特征提取和聚类形成特征向量作为模型的输入。构造出以字符为基元的隐马尔可夫模型（HMM）,将其嵌入到识别字典网络中。通过基于HMM的分类识别器,最终得到识别结果。首次将消除延迟笔画、建立有延迟笔画和无延迟笔画的字典的方法应用于维吾尔文手写识别中,取得了较高的识别率。相似文献

12.

基于局部敏感直方图的时空上下文跟踪

葛骁倩陈秀宏傅俊鹏《传感器与微系统》2017,36(1)

针对当前目标跟踪算法在目标区域光照剧烈变化、长时间遮挡或者平面内旋转时会发生偏移甚至跟丢这一现象,提出了基于局部敏感直方图的时空上下文跟踪算法.该算法以贝叶斯框架为基础,利用生物视觉特性,结合底层灰度特征,基于局部敏感直方图提取光照不变特征,建立目标与背景的统计相关模型来实现跟踪,使跟踪时偏移较小且不会跟丢目标.在对不同视频序列的实验表明:基于局部敏感直方图的时空上下文算法和多示例学习算法相比,在光照变化、平面内旋转或者遮挡时都表现出比较好的跟踪效果且中心误差较小,具有较强鲁棒性. 相似文献

13.

Formal systems and analysis of context sensitive languages

Ghandour Z. J. 《Computer Journal》1972,15(3):229-237

相似文献

14.

A context sensitive line finder for recognition of polyhedra

Yoshiaki Shirai 《Artificial Intelligence》1973,4(2):95-119

A program to recognize polyhedra by a context sensitive line finder is presented. The program is based on the strategy of recognizing objects step by step, at each time making use of the previous results. At each stage, the most obvious and simple assumption is made and the assumption is tested. To find a line segment, a range of search is proposed. Once a line segment is found, more of the line is determined by tracking along it. Whenever a new fact is found, the program tries to reinterpret the scene taking the obtained information into consideration. Results of the experiment using an image dissector are satisfactory for scenes containing a few blocks and wedges. Some limitations of the present program and proposals for future developments are described. 相似文献

15.

A novel context sensitive multilevel thresholding for image segmentation

《Applied Soft Computing》2014

Most of the traditional histogram-based thresholding techniques are effective for bi-level thresholding and unable to consider spatial contextual information of the image for selecting optimal threshold. In this article a novel thresholding technique is presented by proposing an energy function to generate the energy curve of an image by taking into an account the spatial contextual information of the image. The behavior of this energy curve is very much similar to the histogram of the image. To incorporate spatial contextual information of the image for threshold selection process, this energy curve is used as an input of our technique instead of histogram. Moreover, to mitigate multilevel thresholding problem the properties of genetic algorithm are exploited. The proposed algorithm is evaluated on the number of different types of images using a validity measure. The results of the proposed technique are compared with those obtained by using histogram of the image and also with an existing genetic algorithm based context sensitive technique. The comparisons confirmed the effectiveness of the proposed technique. 相似文献

16.

Binary object extraction using bi-directional self-organizing neural network (BDSONN) architecture with fuzzy context sensitive thresholding

Siddhartha Bhattacharyya Paramartha Dutta Ujjwal Maulik 《Pattern Analysis & Applications》2007,10(4):345-360

A novel neural network architecture suitable for image processing applications and comprising three interconnected fuzzy layers of neurons and devoid of any back-propagation algorithm for weight adjustment is proposed in this article. The fuzzy layers of neurons represent the fuzzy membership information of the image scene to be processed. One of the fuzzy layers of neurons acts as an input layer of the network. The two remaining layers viz. the intermediate layer and the output layer are counter-propagating fuzzy layers of neurons. These layers are meant for processing the input image information available from the input layer. The constituent neurons within each layer of the network architecture are fully connected to each other. The intermediate layer neurons are also connected to the corresponding neurons and to a set of neighbors in the input layer. The neurons at the intermediate layer and the output layer are also connected to each other and to the respective neighbors of the corresponding other layer following a neighborhood based connectivity. The proposed architecture uses fuzzy membership based weight assignment and subsequent updating procedure. Some fuzzy cardinality based image context sensitive information are used for deciding the thresholding capabilities of the network. The network self organizes the input image information by counter-propagation of the fuzzy network states between the intermediate and the output layers of the network. The attainment of stability of the fuzzy neighborhood hostility measures at the output layer of the network or the corresponding fuzzy entropy measures determine the convergence of the network operation. An application of the proposed architecture for the extraction of binary objects from various degrees of noisy backgrounds is demonstrated using a synthetic and a real life image.

Ujjwal MaulikEmail:

相似文献

17.

A writer identification and verification system using HMM based recognizers

Andreas Schlapbach Horst Bunke 《Pattern Analysis & Applications》2007,10(1):33-43

相似文献

18.

Face recognition using the embedded HMM with second-order block-specific observations

Min-Sub KimAuthor Vitae Sang-Youn Lee^{Author Vitae} 《Pattern recognition》2003,36(11):2723-2735

The paper is concerned with face recognition using the embedded hidden Markov model (EHMM) with second-order block-specific observations. The proposed method partitions a face image into a 2-D lattice type, composed of many blocks. Each block is represented by the second-order block-specific observation that consists of a combination of first- and second-order feature vectors. The first-order (or second-order) feature vector is obtained by projecting the original (or residual) block image onto the first (or second) basis vector that is obtained block-specifically by applying the PCA to a set of original (or residual) block images. A sequence of feature vectors obtained from the top-to-bottom and the left-to-right scanned blocks are used as an observation sequence to train EHMM. The EHMM models the face image in a hierarchical manner as follows. Several super states are used to model the vertical facial features such as the forehead, eyes, nose, mouth, and chin, and several states in the super state are used to model the localized features in a vertical face feature. Recognition is performed by identifying the person of the model that provides the highest value of observation probability. Experimental results show that the proposed recognition method outperforms many existing methods, such as the second-order eigenface method, the EHMM with DCT observations, and the second-order eigenface method using a confidence factor in terms of average of the normalized modified retrieval rank and false identification rate. 相似文献

19.

HMM based soccer video event detection using enhanced mid-level semantic

Xueming Qian Huan Wang Guizhong Liu Xingsong Hou 《Multimedia Tools and Applications》2012,60(1):233-255

Highlight detection is a fundamental step in semantics based video retrieval and personalized sports video browsing. In this paper, an effective hidden Markov models (HMMs) based soccer video event detection method based on a hierarchical video analysis framework is proposed. Soccer video shots are classified into four coarse mid-level semantics: global, median, close-up and audience. Global and local motion information is utilized for the refinement of coarse mid-level semantics. Sequential soccer video is segmented into event clips. Both the temporal transitions of the mid-level semantics and the overall features of an event clip are fused using HMMs to determine the type of event. Highlight detection performance of dynamic Bayesian networks (DBN), conditional random fields (CRF) and the proposed HMM based approach are compared. The average F-score of our highlights (including goal, shoot, foul and placed kick) detection approach is 82.92%, which outperforms that of DBN and CRF by 9.85% and 11.12% respectively. The effects of number of hidden states, overall features, and the refinement of mid-level semantics on the event detection performance are also discussed. 相似文献

20.

基于HMM和GMM的维吾尔语联机手写体识别研究

许辉热依曼.吐尔逊吾守尔.斯拉木《计算机工程与应用》2014,(11):202-205,222

给出了一个基于HMM和GMM双引擎识别模型的维吾尔语联机手写体整词识别系统。在GMM部分,系统提取了8-方向特征,生成8-方向特征样式图像、定位空间采样点以及提取模糊的方向特征。在对模型精细化迭代训练之后,得到GMM模型文件。HMM部分,系统采用了笔段特征的方法来获取笔段分段点特征序列,在对模型进行精细化迭代训练后,得到HMM模型文件。将GMM模型文件和HMM模型文件分别打包封装再进行联合封装成字典。在第一期的实验中,系统的识别率达到97%,第二期的实验中,系统的识别率高达99%。相似文献