期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

倪崇嘉刘文举徐波《计算机工程》2011,37(23):20-23

针对现有汉语重音检测方法正确率较低的问题,利用声学、词典和语法相关特征的不同分类器组合,基于Boosting分类回归树+条件随机场的互补模型,提出一种改进的汉语重音检测方法.在ASCCD语料库上的实验结果表明,该方法能获得84.9％的重音检测正确率,相比基于神经网络+决策树的基线系统提高2.7％. 相似文献

2.

基于互补模型的汉语韵律间断自动检测

倪崇嘉刘文举徐波《计算机科学》2011,38(12):242-246

自动韵律间断检测和标注对语音理解和语音合成有十分重要的作用。提出了利用声学、词典和语法相关特征的互补模型方法检测汉语韵律间断。该方法具有下列优点:(1)摒弃了声学相关特征和词典、语法相关特征的独立性假设;(2)互补模型方法不仅在特征层上利用当前音节的上下文信息,而且在模型层次上利用了当前音节的上下文信息。在ASCCD语料库上验证了该方法能够获得90.34%的韵律间断的检测准确率,较基线系统有 6.09%的提高。相似文献

3.

基于子段拼接特征的汉语重音检测研究

赵云雪郑世杰张珑《计算机光盘软件与应用》2014,(13):99-101,92

重音是语言交流中不可或缺的部分,在语言交流中扮演着非常重要的角色。本文基于ASCCD朗读语篇语料库,提取每个语音段基于子段拼接的短时谱信息,分别构建基于MFCC算法的短时谱特征集和基于RASTA-PLP算法的短时谱特征集;并选用NaiveBayes分类器对这两类基于子段拼接的特征集进行建模,这种分类方法充分利用了当前语音段的相关语音特性;基于子段拼接的MFCC短时谱特征组和基于子段拼接的RASTA-PLP短时谱特征组在ASCCD上能够分别得到82.1%和80.8%的汉语重音检测正确率。实验结果证明,基于子段拼接特征规整方法可以用于汉语重音检测研究中。相似文献

4.

融合上下文短时谱特征的汉语重音检测研究

赵云雪张珑郑世杰《电脑学习》2014,(4)

重音是语言交流中不可或缺的部分,在语言交流中扮演着非常重要的角色。本文基于ASCCD朗读语篇语料库,使用MFCC算法提取每个语音段的融合上下文子段拼接短时谱信息,构建基于MFCC算法的上下文短时谱特征集;并选用NaiveBayes分类器对这类特征集进行建模,而且将具有最大后验概率的类作为该对象所属的类,这种分类方法充分利用了当前语音段的相关语音特性;融合上下文的MFCC短时谱特征组在ASCCD上能够得到83.6%的汉语重音检测正确率。实验结果证明,融合上下文子段拼接特征规整方法可以用于汉语重音检测研究中。相似文献

5.

汉语语音识别中基频特征的直接声学建模方法 总被引：1，自引：1，他引：0

黄浩哈力旦《计算机工程与应用》2009,45(30):132-134

提出了隐条件随机场对断续基音频率序列进行直接声学建模的方法,该方法针对汉语语音中基频值在清音段连续,浊音段断续的特点,根据隐条件随机场区别于隐马尔可夫模型的重要特性——无需对观察值采用统一的建模方式,直接对不连续基频值与连续谱特征观察值一起进行声学建模。大词汇语音库上的汉语带调音节分类实验表明,隐条件随机场下对断续基音频率序列的直接建模较使用清音段人工平滑基频特征的识别率有明显的提高,还给出了与不同区分性准则训练的隐马尔可夫声学模型的实验性能的比较。相似文献

6.

基于分词与词性标注的汉语逗号自动分类

谷晶晶周国栋《计算机工程与应用》2015,51(18):120-125

近年来,标点符号作为篇章的重要部分逐渐引起研究者的关注。然而,针对汉语逗号的研究才刚刚展开,采用的方法也大多都是在句法分析的基础上,尚不存在利用汉语句子的表层信息开展逗号自动分类的研究。提出了一种基于汉语句子的分词与词性标注信息做逗号自动分类的方法,并采用了两种有监督的机器学习分类器,即最大熵分类器和CRF分类器,来完成逗号的自动分类。在CTB 6.0语料上的实验表明,CRF的总体结果比最大熵的要好,而这两种分类器的分类精度都非常接近基于句法分析方法的分类精度。由此说明,基于词与词性做逗号分类的方法是可行的。相似文献

7.

基于韵律间断层级的汉语韵律间断分类*

倪崇嘉张爱英刘文举徐波《计算机应用研究》2011,28(7):2452-2454

针对韵律间断的层次性,提出了一种层次韵律间断分类方法。该方法能够充分利用韵律结构层次性,同时又能够充分利用来自声学、词典以及语法方面的特征,对不同类型的韵律间断进行分类。通过在具有韵律标注库ASCCD上的实验,该算法在综合测试集上达到平均78.25%检测正确率。相似文献

8.

一种基于Boosting判别模型的运动阴影检测方法 总被引：1，自引：0，他引：1

查宇飞楚瀛王勋马时平毕笃彦《计算机学报》2007,30(8):1295-1301

在视频处理中,由于运动阴影具有与运动前景相同的特性,当在提取前景时,会误把阴影检测为前景.特别是当阴影和其它前景发生粘连时,这可能会严重地影响跟踪、识别等后续处理.该文提出了一种用于运动阴影检测的Boosting判别模型.这种方法先利用Boosting在不同的特征空间来区分前景和阴影,然后在判别随机场(DRFs)中结合前景和阴影的时空一致性,实现对前景和阴影的分割.首先,差分前图像与背景图像得到颜色不变子空间和纹理不变子空间;然后在这两个子空间上应用Boosting来区分前景和阴影;最后利用前景和阴影的时空一致性,在判别随机场中通过图分割的方法准确地分割前景和阴影.实验结果表明,无论是在室内场景,还是在室外场景,该文的方法要好于传统的方法. 相似文献

9.

基于条件随机场的汉语分词系统 总被引：6，自引：1，他引：6

李双龙刘群王成耀《微计算机信息》2006,22(28):178-180

汉语分词是自然语言处理的首要的基本工作。本文提出了一个基于条件随机场(简称CRF)的汉语分词模型,CRF模型作为一个判别模型,可以容纳任意的非独立的特征信息。我们首先将分词看作是一个标记的过程,然后利用CRF模型对每个汉字进行标记,最后转换为相应的分词结果。系统采用感知机(Perceptron)算法进行参数训练。跟以前利用CRF进行分词的模型相比,本系统定义并使用了不同的特征函数,取得了更好的切分结果。在1st SIGHAN分词比赛PK测试集上封闭测试,F值为95.2%。相似文献

10.

短时谱特征的汉语重音检测方法研究 总被引：1，自引：0，他引：1

赵云雪 ;张珑 ;郑世杰《计算机与生活》2014,(9):1120-1128

重音是语言交流中不可或缺的部分,在语言交流中扮演着非常重要的角色。为了验证基于听觉模型的短时谱特征集在汉语重音检测方法中的应用效果,使用MFCC（Mel frequency cepstrum coefficient）和RASTA-PLP（relative spectra perceptual linear prediction）算法提取每个语音段的短时谱信息,分别构建了基于MFCC算法的短时谱特征集和基于RASTA-PLP算法的短时谱特征集;选用NaiveBayes分类器对这两类特征集进行建模,把具有最大后验概率的类作为该对象所属的类,这种分类方法充分利用了当前语音段的相关语音特性;基于MFCC的短时谱特征集和基于RASTA-PLP的短时谱特征集在ASCCD（annotated speech corpus of Chi-nese discourse）上能够分别得到82.1%和80.8%的汉语重音检测正确率。实验结果证明,基于 MFCC的短时谱特征和基于RASTA-PLP的短时谱特征能用于汉语重音检测研究。相似文献

11.

Automatic Prosodic Event Detection Using Acoustic, Lexical, and Syntactic Evidence

Ananthakrishnan S. Narayanan S.S. 《IEEE transactions on audio, speech, and language processing》2008,16(1):216-228

With the advent of prosody annotation standards such as tones and break indices (ToBI), speech technologists and linguists alike have been interested in automatically detecting prosodic events in speech. This is because the prosodic tier provides an additional layer of information over the short-term segment-level features and lexical representation of an utterance. As the prosody of an utterance is closely tied to its syntactic and semantic content in addition to its lexical content, knowledge of the prosodic events within and across utterances can assist spoken language applications such as automatic speech recognition and translation. On the other hand, corpora annotated with prosodic events are useful for building natural-sounding speech synthesizers. In this paper, we build an automatic detector and classifier for prosodic events in American English, based on their acoustic, lexical, and syntactic correlates. Following previous work in this area, we focus on accent (prominence, or ldquostressrdquo) and prosodic phrase boundary detection at the syllable level. Our experiments achieved a performance rate of 86.75% agreement on the accent detection task, and 91.61% agreement on the phrase boundary detection task on the Boston University Radio News Corpus. 相似文献

12.

融合句法结构变换与词汇语义特征的文本蕴涵识别

《计算机工程》2015,(9)

相似文献

13.

Exploiting Acoustic and Syntactic Features for Automatic Prosody Labeling in a Maximum Entropy Framework

Rangarajan Sridhar V.K. Bangalore S. Narayanan S.S. 《IEEE transactions on audio, speech, and language processing》2008,16(4):797-811

In this paper, we describe a maximum entropy-based automatic prosody labeling framework that exploits both language and speech information. We apply the proposed framework to both prominence and phrase structure detection within the Tones and Break Indices (ToBI) annotation scheme. Our framework utilizes novel syntactic features in the form of supertags and a quantized acoustic-prosodic feature representation that is similar to linear parameterizations of the prosodic contour. The proposed model is trained discriminatively and is robust in the selection of appropriate features for the task of prosody detection. The proposed maximum entropy acoustic-syntactic model achieves pitch accent and boundary tone detection accuracies of 86.0% and 93.1% on the Boston University Radio News corpus, and, 79.8% and 90.3% on the Boston Directions corpus. The phrase structure detection through prosodic break index labeling provides accuracies of 84% and 87% on the two corpora, respectively. The reported results are significantly better than previously reported results and demonstrate the strength of maximum entropy model in jointly modeling simple lexical, syntactic, and acoustic features for automatic prosody labeling. 相似文献

14.

Tire Defect Detection Using Local and Global Features

XIANG Yuan-yuan 《计算机辅助绘图.设计与制造(英文版)》2013,(4):49-52

In this paper, we present a tire defect detection algorithm based on sparse representation. The dictionary learned from reference images can efficiently represent the test image. As the representation coefficients of normal images have a specific distribution, the local feature can be estimate by comparing representation coefficient distribution. Meanwhile, a coding length is used to measure the global features of representation coefficients. The tire defect is located by both these local and global features. Experimental results demonstrate that the proposed method can accurately detect and locate the tire defects. 相似文献

15.

基于局部边缘特征的快速目标检测

唐旭晟陈丹《计算机辅助设计与图形学学报》2011,23(11):1902-1907

为了实现复杂环境下形状、尺度变化较大的目标检测,提出一种在复杂背景图片中快速目标检测的算法.该算法采用新的局部边缘匹配特征,通过积分图像技术实现快速计算;通过机器学习算法自动提取样本的局部边缘特征来构建目标模板,且不需要任何手工分割和人工筛选的过程.在UIUC通用图像测试库上的实验结果表明,文中算法可在平移、尺度变化、... 相似文献

16.

多特征融合的道路标识牌目标检测研究

周群袁玉锦《计算机仿真》2012,(6):332-335

研究交通标识牌定位准确度问题。针对单一特征的交通标识牌定位及识别中容易受到各种环境因素干扰的缺陷,提出了一种利用轴对称窗口进行边缘检测的交通标识牌检测算法。首先将利用轴对称滑动窗口提取水平和竖直方向上的边缘特征,然后利用连通域确定初始的交通标识牌位置;通过对可能的交通标识牌区域进行颜色色调验证,以及区域内垂直方向直方图的投影,确定最终的交通标识牌位置。利用多种特征综合检测交通标识牌,较单一特征的方法更能提供稳定有效的检测结果。试验结果显示多特征综合方法能准确检测出复杂场景下交通标识牌所在区域。相似文献

17.

Classification of Battlefield Ground Vehicles Using Acoustic Features and Fuzzy Logic Rule-Based Classifiers 总被引：3，自引：0，他引：3

Hongwei Wu Jerry M. Mendel 《Fuzzy Systems, IEEE Transactions on》2007,15(1):56-72

In this paper, we demonstrate, through the multicategory classification of battlefield ground vehicles using acoustic features, how it is straightforward to directly exploit the information inherent in a problem to determine the number of rules, and subsequently the architecture, of fuzzy logic rule-based classifiers (FLRBC). We propose three FLRBC architectures, one non-hierarchical and two hierarchical (HFLRBC), conduct experiments to evaluate the performances of these architectures, and compare them to a Bayesian classifier. Our experimental results show that: 1) for each classifier the performance in the adaptive mode that uses simple majority voting is much better than in the non-adaptive mode; 2) all FLRBCs perform substantially better than the Bayesian classifier; 3) interval type-2 (T2) FLRBCs perform better than their competing type-1 (T1) FLRBCs, although sometimes not by much; 4) the interval T2 nonhierarchical and HFLRBC-series architectures perform the best; and 5) all FLRBCs achieve higher than the acceptable 80% classification accuracy 相似文献

18.

基于前后文词形特征的生物医学文献句子边界识别 总被引：1，自引：0，他引：1

于中华张容唐常杰左劼张天庆《小型微型计算机系统》2006,27(1):180-184

针对生物医学文献的特点及信息抽取的特殊要求，提出了基于前后文词形特征和有教师学习的句子边界识别算法．与针对一般英语书面语设计的句子边界识别算法不同，本文提出的算法不使用特殊的辅助词表和语法层面的特征信息，只使用前后文单词的词形信息作为句子边界识别和消歧的依据．利用这些特征设计了最大信息熵识别器和支持向量机识别器，并在Medline摘要上进行了实验，达到了超过99％的正确率．实验结果表明，最大信息熵法和支持向量机法在句子边界消歧问题上具有相近的性能，同时还表明，对生物医学文献句子边界识别，只使用词法层面的特征，不使用辅助词表和词性等语法层面的信息，仍可达到其它算法在一般英语书面语上利用辅助词表和词性信息所达到的性能．相似文献

19.

一种基于图像底层特征的隐马尔可夫人体检测方法

徐翠郑颖汪增福《模式识别与人工智能》2009,22(5)

提出一种单幅图像中的人体检测方法.该方法用隐马尔可夫模型表示人体,根据给定的人体结构序列估计产生该序列的图像区域,从而将人体检测问题转化为隐马尔可夫解码问题求解.首先对图像进行Mean-Shift分割,并根据颜色信息搜索出属于躯干的区域,然后将明暗度、颜色及边缘3种底层特征相结合,估计特征匹配概率并由此获得四肢部分的候选区域.最后估计候选区域的连接概率并利用隐马尔可夫解码算法找出最优的人体配置区域.实验结果表明,该方法对于复杂背景中具有不同姿态的人体图像可得到较满意的检测结果.和其它检测方法相比,该方法并非单纯地给出矩形近似的人体各个部分,同时还获得较完整分割的人体图像.尤其对于图像分辨率较低、图像中的人体较小且存在运动模糊的情况,该方法能够获得较好的检测结果. 相似文献

20.

Holism,Conceptual-Role Semantics,and Syntactic Semantics

Rapaport William J. 《Minds and Machines》2002,12(1):3-59

This essay continues my investigation of `syntactic semantics': the theory that, pace Searle's Chinese-Room Argument, syntax does suffice for semantics (in particular, for the semantics needed for a computational cognitive theory of natural-language understanding). Here, I argue that syntactic semantics (which is internal and first-person) is what has been called a conceptual-role semantics: The meaning of any expression is the role that it plays in the complete system of expressions. Such a `narrow', conceptual-role semantics is the appropriate sort of semantics to account (from an `internal', or first-person perspective) for how a cognitive agent understands language. Some have argued for the primacy of external, or `wide', semantics, while others have argued for a two-factor analysis. But, although two factors can be specified–-one internal and first-person, the other only specifiable in an external, third-person way–-only the internal, first-person one is needed for understanding how someone understands. A truth-conditional semantics can still be provided, but only from a third-person perspective. 相似文献