首页 | 本学科首页   官方微博 | 高级检索  
     

汉语基本名词短语结构分析模型
引用本文:赵军,黄昌宁. 汉语基本名词短语结构分析模型[J]. 计算机学报, 1999, 22(2): 141-146
作者姓名:赵军  黄昌宁
作者单位:1. 清华大学计算机科学与技术系,北京,100084
2. 清华大学智能技术与系统国家重点实验室,北京,100084
摘    要:本文提出了用词语潜在依存关系分析汉语baseNP结构的模型,它有以下的特点:①将依存语法知识融入概率模型中,使得baseNP结构分析在依存语法知识的指导下进行,其性能优于纯粹的概率模型-相依模型;②词语潜在依存强度的获取算法是基于MDL原则的,在模型建造时既考虑数据拟合性,又考虑模型归纳性,其性能优于基于极大似然原则的词语在依存强度获取算法;③词语潜在依存强度获取算法在复杂特性集上进行,可以有效地

关 键 词:自然语言处理  语料库  基本名词短语
修稿时间:1998-02-17

THE MODEL FOR CHINESE BASENP STRUCTURE ANALYSIS
ZHAO Jun,HUANG Chang-ning. THE MODEL FOR CHINESE BASENP STRUCTURE ANALYSIS[J]. Chinese Journal of Computers, 1999, 22(2): 141-146
Authors:ZHAO Jun  HUANG Chang-ning
Abstract:The paper puts forward a potential dependency relation based model for Chinese baseNP structure analysis, which has the following characteristics: The dependency grammar is integrated into the statistical model so that the baseNP structure can be analyzed under the supervision of dependency grammar. The performance of the model is superior to that of pure statistical model, adjacency model; The proposed acquisition algorithm of potential dependency strength is based on MDL principle, in which both the data fitness and the generality of the model are considered. The performance of the algorithm is superior to that of the traditional ML based algorithm; The acquisition algorithm is implemented on the basis of complex feature set, so that the data sparseness problem is solved successfully. The experiment shows that the proposed model is suitable for Chinese baseNP structure analysis.
Keywords:Natural language processing   corpus   BaseNP.  
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号