首页 | 本学科首页   官方微博 | 高级检索  
     

基于谓词及句义类型块的汉语句义类型识别
引用本文:王 倩,罗森林,韩 磊,潘丽敏.基于谓词及句义类型块的汉语句义类型识别[J].中文信息学报,2014,28(2):8-16.
作者姓名:王 倩  罗森林  韩 磊  潘丽敏
作者单位:北京理工大学 信息与电子学院信息系统安全对抗实验中心, 北京 100081
基金项目:国家242项目(2005C48),北京理工大学科技创新计划重大项目培育专项(2011CX01015)
摘    要:从现代汉语语义学角度,可将句义类型划分为简单句义、复杂句义、复合句义和多重句义4种。作为在整体上对句义结构进行描述的方式之一,句义类型识别是对汉语句子进行完整句义结构分析的重要步骤。该文基于谓词及句义类型块提出了一种汉语句义类型识别的方法,实现了4种句义类型的识别。该方法先通过句中谓词的个数进行初步识别判断出部分简单句,再对剩余的句子先用C4.5机器学习的方法得到句中谓词经过的最大句义类型块的个数,再结合句法结构中顶端句子节点进行判决,最终给出剩余句子的句义类型判定结果。实验采用BFS-CTC汉语标注语料库中10221个句子进行开集测试,句义类型的整体识别准确率达到97.6%,为基于现代汉语语义学的研究奠定了一定的技术研究基础。

关 键 词:句义类型识别  句义类型  语义分析  自然语言处理  

Chinese Sentential Semantic Type Recognition Based on Predicate and Sentential Semantic Type Chunk
WANG Qian,LUO Senlin,HAN Lei,PAN Limin.Chinese Sentential Semantic Type Recognition Based on Predicate and Sentential Semantic Type Chunk[J].Journal of Chinese Information Processing,2014,28(2):8-16.
Authors:WANG Qian  LUO Senlin  HAN Lei  PAN Limin
Affiliation:Lab of Information Security & Countermeasures Technology, School of Information & Electronics, Beijing Institute of Technology, Beijing 100081, China
Abstract:According to modern Chinese semantics, there are 4 semantic types (single, complex, compound and multiple). Attempted to capture the overall sentential semantic structures, sentential semantic type recognition is an important step to the whole sentential semantic structure parsing. This paper proposes a 4-semantic-types recognition method based on predicate and sentential semantic type chunk. This method firstly identifies some single semantic type sentences by the predicate number in each sentence. For the rest sentences, C4.5 algorithm is applied to get the maximum number of sentential-semantic-type chunk of predicates in sentential semantic structure, and then the sentential semantic type of each sentence is identified by combining the top sentence node in syntax structure. The experimental data contains 10221 sentences chosen from Beijing Forest Studio-Chinese Tag Corpus. The accuracy rate of sentential semantic type is up to 97.6% in open test.
Keywords:sentential semantic type recognition  sentential semantic type  semantic analysis  natural language processing  
本文献已被 CNKI 等数据库收录!
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号