首页 | 本学科首页   官方微博 | 高级检索  
     

融合字位置特征的铁路事故命名实体识别
引用本文:陈业明,戴齐,刘捷.融合字位置特征的铁路事故命名实体识别[J].计算机系统应用,2022,31(12):211-219.
作者姓名:陈业明  戴齐  刘捷
作者单位:西南交通大学 计算机与人工智能学院, 成都 611756
基金项目:国家铁路集团有限公司科技研究开发重点课题(N2020S009)
摘    要:铁路事故的相关信息以事故概况文本的形式存在,对于铁路安全工作有重要意义.但由于缺乏有效的信息抽取手段,导致分散在文本中的铁路事故知识没有得到充分的利用.命名实体识别是信息抽取的重要子任务,目前关于事故领域的命名实体识别问题研究较少.针对铁路事故命名实体识别问题,提出一种融合字位置特征的命名实体识别模型,该模型通过全连接神经网络获取字的位置特征,并与语义层面的字向量合并作为字的最终向量表示输入BiLSTM-CRF模型获取最优标签序列.实验结果表明,模型在铁路事故文本命名实体识别问题上的准确率、召回率和F1值分别为93.29%、94.77%和94.02%,相比于传统模型,取得了更好的效果,为铁路事故知识图谱的构建奠定基础.

关 键 词:命名实体识别  铁路事故  字位置特征  双向长短期记忆网络(BiLSTM)  条件随机场  知识图谱  自然语言处理
收稿时间:2022/4/14 0:00:00
修稿时间:2022/5/22 0:00:00

Named Entity Recognition of Railway Accident Texts with Character Position Features
CHEN Ye-Ming,DAI Qi,LIU Jie.Named Entity Recognition of Railway Accident Texts with Character Position Features[J].Computer Systems& Applications,2022,31(12):211-219.
Authors:CHEN Ye-Ming  DAI Qi  LIU Jie
Affiliation:School of Computing and Artificial Intelligence, Southwest Jiaotong University, Chengdu 611756, China
Abstract:Relevant information of railway accidents, existing in the form of accident overview texts, is of great significance to railway safety work. However, due to the lack of effective information extraction methods, the knowledge of railway accidents scattered in the texts has not been fully utilized. Named entity recognition is an important subtask of information extraction, and there are few studies on named entity recognition of accidents. A named entity recognition model fused with character position features is proposed for the named entity recognition of railway accidents. The model obtains the character position features through a fully connected neural network. It merges them with the character vectors at the semantic level as the final vector representation of the characters, which is then input to the BiLSTM-CRF model to obtain the optimal label sequence. The experimental results show that the accuracy, recall, and F1 value of the model on the named entity recognition of railway accident texts are 93.29%, 94.77%, and 94.02% respectively. This model yields better effects than traditional models and lays a foundation for the construction of a railway accident knowledge graph.
Keywords:named entity recognition  railway accident  character position features  bidirectional long short-term memory (BiLSTM)  conditional random field  knowledge graph  natural language processing (NLP)
点击此处可从《计算机系统应用》浏览原始摘要信息
点击此处可从《计算机系统应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号