首页 | 本学科首页   官方微博 | 高级检索  
     

一种基于时空分析的事件抽取方法
引用本文:梁月仙,郭智. 一种基于时空分析的事件抽取方法[J]. 国外电子测量技术, 2017, 36(6): 36-40
作者姓名:梁月仙  郭智
作者单位:1.中国科学院空间信息处理与应用系统技术重点实验室,北京 100190;2.中国科学院电子学研究所,北京 100190;3.中国科学院大学,北京 100190,1.中国科学院空间信息处理与应用系统技术重点实验室,北京 100190;2.中国科学院电子学研究所,北京 100190
基金项目:国家高新技术研究与发展(2015AA7013033)项目资助
摘    要:新闻媒体及社交网站每天呈现大规模的时空文本数据,人们难以从中获取有价值的事件信息。针对前人方法依赖大量的标注数据,同时以孤立的方式考虑事件的时间要素和空间要素等的问题。该文提出一种基于时空分析的事件抽取方法,该方法首先引入数据立方体结构存储事件信息,用户可基于不同的时空粒度抽取出重要的事件;然后提出一种基于语义相似性的实时事件聚类算法,该聚类算法采用GloVe 模型训练词的语义关联性,使聚在同一事件类的事件元素具有强的语义关联性。在大量未标注的网络文本中,该方法取得了77.4%的F1值,表明了该方法能够实现时空分析下的事件抽取任务。

关 键 词:事件抽取  时空分析   实时聚类  可视化

Approach of event extraction based on spatio-temporal analysis
Liang Yuexian and Guo Zhi. Approach of event extraction based on spatio-temporal analysis[J]. Foreign Electronic Measurement Technology, 2017, 36(6): 36-40
Authors:Liang Yuexian and Guo Zhi
Affiliation:1.Key Laboratory of Technology in Geo-Spatial Information Processing and Application System ,Institute of Electronics, Chinese Academy of Sciences ,Beijing 100190,China; 2.Institute of Electronics ,Chinese Academy of Sciences ,Beijing 100190,China; 3.University of Chinese Academy of Sciences ,Beijing 100190,China and 1.Key Laboratory of Technology in Geo-Spatial Information Processing and Application System ,Institute of Electronics, Chinese Academy of Sciences ,Beijing 100190,China; 2.Institute of Electronics ,Chinese Academy of Sciences ,Beijing 100190,China
Abstract:News media and social networking sites present a great volume of spatio-temporal text data every day. It is difficult for people to get valuable event information. Aiming at the existing methods which depend on a large number of annotation data and consider the temporal elements and spatial elements of events in an isolated way, this paper proposes an event extraction approach based on spatio-temporal analysis. Firstly, a data cube structure was introduced to store event information. Users can extract important events based on different temporal and spatial granularity. Secondly, a real-time event clustering algorithm based on semantic similarity is proposed. In the clustering algorithm, a GloVe model was adopted to learn the vector space of words, so that the event elements which collected in the same event cluster have strong semantic relevance. According to a large number of unlabeled network text, the approach achieves a 77.4% value of F1 which indicates that it can achieve event extraction task based on spatio-temporal analysis.
Keywords:event extraction   real-time clustering   spatio-temporal analysis   visualization
点击此处可从《国外电子测量技术》浏览原始摘要信息
点击此处可从《国外电子测量技术》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号