首页 | 本学科首页   官方微博 | 高级检索  
     

基于条件随机场的中文时间短语识别
引用本文:朱莎莎,刘宗田,付剑锋,朱芳. 基于条件随机场的中文时间短语识别[J]. 计算机工程, 2011, 37(15): 164-167. DOI: 10.3969/j.issn.1000-3428.2011.15.052
作者姓名:朱莎莎  刘宗田  付剑锋  朱芳
作者单位:上海大学计算机工程与科学学院,上海,200072
基金项目:国家自然科学基金,上海市重点学科建设基金,上海大学研究生创新基金
摘    要:传统时间短语识别方法存在中文文本时间短语边界定位不准确和长距离依赖的问题。为此,提出一种基于条件随机场(CRFs)的时间短语识别方法。采用基于机器学习的方法识别时间短语,分析中文文本中时间短语的词法、句法和上下文信息等语言学特征,将时间短语分为日期型和事件型2种类型,并半自动构建3个常用词表作为外部特征。在此基础上,引入能整合不同层面特征的CRFs方法,将识别问题转化为序列标注问题。实验结果表明,该方法在日期型时间短语和事件型时间短语识别上分别取得95.70%和85.75%的F1值,识别效果较好。

关 键 词:中文时间短语  时间短语识别  条件随机场  时间信息处理
收稿时间:2011-02-10

Chinese Temporal Phrase Recognition Based on Conditional Random Fields
ZHU Sha-sha,LIU Zong-tian,FU Tan-feng,ZHU Fang. Chinese Temporal Phrase Recognition Based on Conditional Random Fields[J]. Computer Engineering, 2011, 37(15): 164-167. DOI: 10.3969/j.issn.1000-3428.2011.15.052
Authors:ZHU Sha-sha  LIU Zong-tian  FU Tan-feng  ZHU Fang
Affiliation:(School of Computer Engineering and Science,Shanghai University,Shanghai 200072,China)
Abstract:With complex and diverse language forms, temporal phrases are not perfectly recognized by traditional rule-based method. It is hard to extract an exact match for temporal phrases and recognize the long-distance-dependent temporal phrases representing time with many tokens in Chinese text. To solve these issues, based on the capability to integrate different levels features of Conditional Random Fields(CRFs) model, this paper presents a CRFs-based approach for temporal phrases recognition. By analyzing a set of linguistic features of time phrases in Chinese text such as lexical features, syntactic features and context information, temporal phrases are divided into two types, time-denoting temporal phrases and event-denoting temporal phrases. Three common vocabularies are semi-auto structured as external features. Experimental results show a performance reaching scores of 95.70% for F-measure to time-denoting temporal phrases and 85.75% for F-measure to event-denoting temporal phrases.
Keywords:Chinese temporal phrase  temporal phrase recognition  Conditional Random Fields(CRFs)  temporal information processing
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《计算机工程》浏览原始摘要信息
点击此处可从《计算机工程》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号