首页 | 本学科首页   官方微博 | 高级检索  
     

基于触发词指导的自相似度聚类事件检测
引用本文:张先飞,郭志刚,刘嵩,程磊,田雨暄.基于触发词指导的自相似度聚类事件检测[J].计算机科学,2010,37(3):212-214220.
作者姓名:张先飞  郭志刚  刘嵩  程磊  田雨暄
作者单位:1. 解放军信息工程大学信息工程学院,郑州,450002
2. 中国人民解放军61081部队,北京,100094
基金项目:863国家重点基金项目(2007AA01Z439)资助
摘    要:传统方法将事件检测任务看作分类问题,将词作为实例来训练分类器,容易导致训练正反例不平衡,同时,在语料库规模较小时存在一定的数据稀疏问题。首先避开以词为实例进行分类,在事件类别判断上引入聚类思想,在事件触发词的指导下,采用自相似度对K-means聚类算法中的K值进行自收敛,优化了聚类算法。然后结合命名实体及其位置信息,对事件类别进行详细定位,很好地解决了传统事件检测对类别模板的依赖性,所检测的事件在文本摘要、检索和主题检测与追踪上得到了很好的应用。

关 键 词:事件检测  触发词  自相似度  命名实体  聚类  
收稿时间:2009/4/17 0:00:00
修稿时间:2009/7/18 0:00:00

Self-similarity Clustering Event Detection Based on Triggers Guidance
ZHANG Xian-fei,GUO Zhi-gang,LIU Song,CHENG Lei,TIAN Yu-xuan.Self-similarity Clustering Event Detection Based on Triggers Guidance[J].Computer Science,2010,37(3):212-214220.
Authors:ZHANG Xian-fei  GUO Zhi-gang  LIU Song  CHENG Lei  TIAN Yu-xuan
Affiliation:Information Engineering Institute/a>;PLA Information Engineering University/a>;Zhengzhou 450002/a>;China;PLA 61081 Unit/a>;Beijing 100094/a>;China
Abstract:Traditional method of Event Detection and Characterization (EDC) regards event detection task as classificalion problem. It makes words as samples to train classifier, which can lead to positive and negative samples of classifier imbalance. Meanwhile, there is data sparseness problem of this method when the corpus is small. This paper didn't classify event using word as samples, but clustered event in judging event types. It adapted self-similarity to convergence the value of Kin K-means algorithm by the guidance of event triggers, and optimized clustering algorithm. hhen, combining with named entity and its comparative position information, the new method further ensures the pinpoint type of event.The new method avoids depending on template of event in tradition methods, and its result of event detection can well be used in automatic text summarization, text retrieval, and topic detection and tracking.
Keywords:Event detection  Trigger  Self-similarity  Named entity  Clustering  
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《计算机科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号