首页 | 本学科首页   官方微博 | 高级检索  
     

基于含边界词性特征的中文命名实体识别
引用本文:邱莎,王付艳,申浩如,段玻,阿圆,丁海燕.基于含边界词性特征的中文命名实体识别[J].计算机工程,2012,38(13):128-130.
作者姓名:邱莎  王付艳  申浩如  段玻  阿圆  丁海燕
作者单位:1. 昆明学院信息技术学院,昆明650214;复旦大学计算机科学技术学院,上海201203
2. 昆明学院信息技术学院,昆明,650214
3. 云南大学信息学院,昆明,650091
基金项目:昆明学院科研课题基金资助项目
摘    要:根据词性在任务中可能出现的特征表达,在字粒度一级,基于条件随机场模型,对词性特征在中文命名实体识别任务中的应用进行研究,提出一种将词性和词边界合成为一个特征项的方法。在相同实验环境下针对多种词性特征的应用情况,采用序列标注的方式在公共语料上进行多次中文命名实体识别实验。通过对多次实验结果的比较分析得出,二级词性与词边界合成的特征在系统执行性能和识别效果等方面均为最优。

关 键 词:中文命名实体识别  条件随机场  特征模板  词性  词边界  标注集
收稿时间:2011-08-23

Chinese Named Entity Recognition Based on Part of Speech Feature with Edges
QIU Sha , WANG Fu-yan , SHEN Hao-ru , DUAN Bo , A Yuan , DING Hai-yan.Chinese Named Entity Recognition Based on Part of Speech Feature with Edges[J].Computer Engineering,2012,38(13):128-130.
Authors:QIU Sha  WANG Fu-yan  SHEN Hao-ru  DUAN Bo  A Yuan  DING Hai-yan
Affiliation:1.Institute of Information Technology,Kunming University,Kunming 650214,China;2.Institute of Information,Yunnan University,Kunming 650091,China;3.School of Computer Science,Fudan University,Shanghai 201203,China)
Abstract:According to the possible expressions as the features in the task,the application of Part of Speech(PoS) used in the task of Chinese personal name recognition is discussed based on the Conditional Random Fields(CRFs) on the character level.And the method of combined PoS and word-edge as a feature item is put forward.By sequence labeling on common corpus,multiple experiments of Chinese personal name recognition are token which are done in similar experiment environment with multiple applications of PoS features.Through the results of the experiments,the combination of second level PoS and word-edges is obtained the best effect in the system performance and the recognition of Chinese named entities.
Keywords:Chinese named entity recognition  Conditional Random Fields(CRFs)  feature template  Part of Speech(PoS)  word-edge  label set
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《计算机工程》浏览原始摘要信息
点击此处可从《计算机工程》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号