首页 | 本学科首页   官方微博 | 高级检索  
     

基于最大熵模型的汉语句子分析
引用本文:徐延勇,周献中,井祥鹤,郭忠伟.基于最大熵模型的汉语句子分析[J].电子学报,2003,31(11):1608-1612.
作者姓名:徐延勇  周献中  井祥鹤  郭忠伟
作者单位:南京理工大学自动控制系,江苏南京 210094
基金项目:国家自然科学基金资助项目 (No .60 1 740 2 8)
摘    要:文中运用浅层句法分析理论,把汉语句子分析划分为标注、组块、构造和检查三个过程.并针对已有概率评价模型的特征类型少,不能充分利用上下文中对分析有用的信息等问题,提出了基于最大熵的概率评价模型来评估分析过程中每个行为的概率.在该模型中,对分析有用的任何信息都可以成为模型中的一个特征;定义了静态模板结构的特征集和训练集,给出了相应的特征选择策略和基于GIS的参数估计算法;采取BFS算法高效搜索概率值最高的候选句法树作为最终的句法分析结果.实验结果表明:该模型具有较高的分析效率和准确性.

关 键 词:自然语言处理  最大熵模型  组块  句法分析  广度优先搜索  
文章编号:0372-2112(2003)11-1608-05
收稿时间:2003-01-02

Chinese Sentence Parsing Based on Maximum Entropy Model
XU Yan-yong,ZHOU Xian-zhong,JING Xiang-he,GUO Zhong-wei.Chinese Sentence Parsing Based on Maximum Entropy Model[J].Acta Electronica Sinica,2003,31(11):1608-1612.
Authors:XU Yan-yong  ZHOU Xian-zhong  JING Xiang-he  GUO Zhong-wei
Affiliation:Department of Automation,Nanjing University of Science and Technology,Nangjing,Jiangsu 210094,China
Abstract:The shallow parsing theory is applied to partition Chinese sentence parsing into three procedures:TAG,CHUNK,BUILD and CHECK.To resolve the problem of lacking feature types for available probabilistic models and make the best of useful information for parsing in context,we present probabilistic model based on maximum entropy to evaluate the probability of each action in the parsing procedures.In this model,any useful information for parsing in a context could be an actual feature; the features and training events are defined; the strategy of feature selection and the algorithm of parameter estimation based on Generalized Iterative Scaling(GIS)are given; The final result of parsing is the parse tree with the largest probability searched with Breadth-first search(BFS).The model is experimentally proved satisfying in both parsing efficiency and precision.
Keywords:natural language processing  maximum entropy models  chunk  sentence parsing  BFS
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《电子学报》浏览原始摘要信息
点击此处可从《电子学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号