首页 | 本学科首页   官方微博 | 高级检索  
     

汉语中人称代词的消解研究
引用本文:王厚峰,何婷婷. 汉语中人称代词的消解研究[J]. 计算机学报, 2001, 24(2): 136-143
作者姓名:王厚峰  何婷婷
作者单位:1. 北京大学计算机科学与技术系 中国科学院声学研究所
2. 华中师范大学计算机科学系
基金项目:国家“九七三”项目基金! (G19980 30 5 0 6 ),武汉“青年科技晨光计划”资助!(995 0 0 40 91G)资助
摘    要:人称代词的消解是自然语言处理中十分重要的问题,人称代词消解,就是确定人称代词与先行语之间的相互关系,从而明确人称代词究竟指代什么对象,现有的许多应用系统,如文本摘要、信息抽取等采取了从文本中直接抽取句子的做法,而结果可能会含有某些无先行语的人称代词,使理解变得非常困难,人称代词消解无疑可以解决类似的问题。该文主要结合句类基本知识,根据人称代词所在语义块中的语义角色和人称代词对应的先行语可能的语义角色,给出了消解人称代词的基本规则。同时,作者也从句法的角度,结合局部焦点法给出了优选性规则。

关 键 词:句类 语义块 人称代词 指代消解 自然语言处理 知识约束
修稿时间:2000-03-09

Research on Chinese Pronominal Anaphora Resolution
WANG Hou Feng ),) HE Ting Ting ) ). Research on Chinese Pronominal Anaphora Resolution[J]. Chinese Journal of Computers, 2001, 24(2): 136-143
Authors:WANG Hou Feng )  ) HE Ting Ting ) )
Affiliation:WANG Hou Feng 1),2) HE Ting Ting 3) 1)
Abstract:Natural Language provides people with rich mechanisms for varying the expression of the same meaning. However. It also challenges computational researchers to do with a great number of nontrivial problems, one of them is anaphora resolution. In some application fields of discourse processing, such as Automatic Summarization, Information Retrieval, and Information Extraction etc., the final processing result is usually generated by directly extracting some representative sentences from text or document, in which, some pronouns could be contained without their antecedents. So, It is very important to resolve pronominal anaphora in Natural Language Processing(NLP), and that has attracted the attention of increasing researchers. Many approaches developed offer approximation solutions, including principle based such as purely syntactic ones to semantic and pragmatic ones, and statistics based ones. In this paper, sentences category(SC) based resolution tactics of Chinese pronominal anaphora is presented. In this method, the semantic role of pronoun and potential objects will be analyzed, and antecedent will be chosen from these objects according to the semantic relation between them and pronoun. SC is a semantic category of sentences within HNC(Hierarchical Network of Concept) Pattern. According to the HNC theory, every sentence can be mapped into one and only SC and the number of SC in Natural Language is determinate; Every SC consists of definite primary semantic chunk and indefinite secondary semantic chunk, and there exist conceptual constraints between primary semantic chunks in a sentence, as well as between primary semantic chunk and parts of secondary semantic chunks. In order to resolve anaphora, constraint rules and preference rules are given in this paper, which are related to the above conceptual constraint, and four kinds of anaphora relations will be discussed as follow: (1) in one semantic chunk; (2) between two primary semantic chunks; (3)primary semantic chunk and secondary semantic chunk; and (4) between two semantic chunks in different sentences.
Keywords:sentences category   semantic chunk   pronoun   antecedent   pronominal anaphora resolution
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号