首页 | 本学科首页   官方微博 | 高级检索  
     

基于基本要素向量空间的英文多文档自动摘要
引用本文:刘德喜,何炎祥,姬东鸿,杨华.基于基本要素向量空间的英文多文档自动摘要[J].计算机工程,2007,33(14):166-167.
作者姓名:刘德喜  何炎祥  姬东鸿  杨华
作者单位:1. 襄樊学院物理学系,襄樊,441053;武汉大学计算机学院,武汉,430079
2. 武汉大学计算机学院,武汉,430079
3. 新加坡信息通讯研究所,新加坡,119613
摘    要:在基于基本要素(BE)向量空间的英文多文档自动文摘中,句子不再用术语向量或词向量来表达,而是用基本要素向量来表示。在用k-均值聚类算法时,采用一种自动探测k值的技术。实验表明,基于基本要素的多文档自动文摘MSBEC比基于词更优越。

关 键 词:多文档自动文摘  基本要素  k-均值聚类
文章编号:1000-3428(2007)14-0166-02
修稿时间:2006-07-30

English Multi-document Summarization Based on Basic Element Vector Space
LIU Dexi,HE Yanxiang,JI Donghong,YANG Hua.English Multi-document Summarization Based on Basic Element Vector Space[J].Computer Engineering,2007,33(14):166-167.
Authors:LIU Dexi  HE Yanxiang  JI Donghong  YANG Hua
Affiliation:1. School of Physics, Xiangfan University, Xiangfan 441053; 2. School of Computer, Wuhan University, Wuhan 430079; 3. Institute for Information Research Comm., Singapore 119613
Abstract:This paper proposes a novel multi-document summarization strategy based on basic element(BE) vector clustering.In this strategy,sentences are represented by BE vectors instead of word or term vectors before clustering.The BE-vector clustering is realized by adopting the k-means clustering method,and a novel clustering analysis method is employed to automatically detect the number of clusters,k.The experimental results indicate a superiority of the proposed strategy over the traditional summarization strategy based on word vector clustering.
Keywords:multi-document summarization  basic element  k-means clustering
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《计算机工程》浏览原始摘要信息
点击此处可从《计算机工程》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号