首页 | 本学科首页   官方微博 | 高级检索  
     

汉语概率型上下文无关语法的自动推导
引用本文:周强,黄昌宁.汉语概率型上下文无关语法的自动推导[J].计算机学报,1998,21(5):385-392.
作者姓名:周强  黄昌宁
作者单位:1. 智能技术与系统国家重点实验室,北京,100084
2. 清华大学计算机科学与技术系,北京,100084
基金项目:国家自然科学重点基金,中国博士后科学基金
摘    要:本文提出了一种汉语概率型上下文无关语法的自动推导方法,它在匹配分析机制上实现了无指导的EM迭代训练算法,并通过对训练语料的自动短语界定预处理以及在集成不同知识源基础上构造合适始规则集

关 键 词:语法推导  PCFG  语料库语言学  语言信息处理
修稿时间:1997年6月25日

AN INFERENCE APPROACH FOR CHINESE PROBABILISTIC CONTEXT-FREE GRAMMAR
ZHOU Qiang,HUANG Chang-ning.AN INFERENCE APPROACH FOR CHINESE PROBABILISTIC CONTEXT-FREE GRAMMAR[J].Chinese Journal of Computers,1998,21(5):385-392.
Authors:ZHOU Qiang  HUANG Chang-ning
Abstract:This paper proposes a new inference approach for Chinese probabilisticcontext-free grammar, which implements the EM algorithm based on the bracketmatching schemes. Two characteristics of the algorithm are as follows: 1) To pre-process the training texts with automatic constituent boundary prediction tools,which can provide stronger syntactic restriction upon training texts in lower compu-tational costs; 2) To develop an initial rule set by integrating different knowledgeresources, including a set of basic syntactic rules generated by an automatic gram-mar construction t00l and a set of special rules summarized by linguists or extractedfrom treebanks, and provide a better initialization for the learning process. There-fore, a linguistically-motivated and broad-coverage Chinese PCFG rule set can beeasily generated through this algorithm. Current experimental results prove goodlearning efficiency of this algorithm and high reliability of the generated rule set.
Keywords:Probabilistic context-free grammar  expectation-maximization algorithm  grammar inference
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号