首页 | 本学科首页   官方微博 | 高级检索  
     

层级分类概率句法分析
引用本文:代印唐,吴承荣,马胜祥,钟亦平.层级分类概率句法分析[J].软件学报,2011,22(2):245-257.
作者姓名:代印唐  吴承荣  马胜祥  钟亦平
作者单位:复旦大学,计算机科学技术学院,上海200433
基金项目:上海市科委、上海市人力资源与社会保障局博士后科研资助计划(10R21421400); 上海市科委项目(075115008)
摘    要:对已有的句法分析中引入知识的方法进行了归纳分析,认为多种句法分析方法都可被看作是基于特征标记的分类,然后分析了其中的欠分类和过分类问题.在此基础上,提出一种层级分类短语结构文法和一种层级分类概率句法分析方法(hierarchically classified probabilistic context-free grammar),并设计了一种通过对实例进行聚类来消除句法规则的分类歧义方法.还进一步将层级分类扩展到概率上下文相关句法分析方法,利用上下文相关性的层级分类来解决引入上下文相关时的数据稀疏性问题.通过上述一系列方法有效地克服了过分类与前分类之间的矛盾.

关 键 词:短语结构文法  概率句法分析  层级分类
收稿时间:2009/4/20 0:00:00
修稿时间:2009/8/12 0:00:00

Hierarchically Classified Probabilistic Grammar Parsing
DAI Yin-Tang,WU Cheng-Rong,MA Sheng-Xiang and ZHONG Yi-Ping.Hierarchically Classified Probabilistic Grammar Parsing[J].Journal of Software,2011,22(2):245-257.
Authors:DAI Yin-Tang  WU Cheng-Rong  MA Sheng-Xiang and ZHONG Yi-Ping
Affiliation:School of Computer Science and Technology, Fudan University, Shanghai 200433, China;School of Computer Science and Technology, Fudan University, Shanghai 200433, China;School of Computer Science and Technology, Fudan University, Shanghai 200433, China;School of Computer Science and Technology, Fudan University, Shanghai 200433, China
Abstract:This paper analyzed various existing approaches of structural grammar parsing, and addressed the problem of over-classification and under-classification. Then a hierarchically classified phase structure grammar (HC-PSG) and a hierarchically classified probabilistic context-free grammar (HC-PCFG) parsing are proposed to respond to this challenge. A measure of class clustering is designed to eliminate the classification ambiguity of grammar rules. The HC approach implements a general learning rule from a small number of phrase instances. An instant clustering method is used to disambiguate rules learned from corpus. The HC method is also extended to context sensitive grammar parsing to improve performance. It employs the classification of the context relevancy to handle the problem of corpus sparsity. By all the means, it can leverage the conflicts between under-classification and over-classification.
Keywords:phrase structure grammar  probabilistic grammar parsing  hierarchical classification
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《软件学报》浏览原始摘要信息
点击此处可从《软件学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号