首页 | 本学科首页   官方微博 | 高级检索  
     

基于WFC和MI的主题句提取方法
引用本文:薛扣英,原盛,张心严.基于WFC和MI的主题句提取方法[J].计算机工程,2009,35(20):184-186.
作者姓名:薛扣英  原盛  张心严
作者单位:1. 西安交通大学电子与信息工程学院,西安,710049
2. 西安交通大学软件学院,西安,710049
基金项目:中科院国际合作伙伴计划基金资助项目 
摘    要:提出一种基于加权模糊聚类(WFC)和互信息(MI)的主题句提取方法,使主题句尽可能全面覆盖全文主题的同时,缩减自身的冗余,以提高摘要效率,采用加权模糊聚类的方法对文本句子进行分类,对在同一类中的句子使用比较互信息的方法进行排名处理,从而获得高质量的摘要。实验结果表明,与传统聚类方法比较,该方法的正确率提高约15%,可以达到约70%的精确度,并在阅读摘要时能够基本正确地获取文本信息。

关 键 词:主题句  加权模糊聚类  互信息
修稿时间: 

Topic Sentence Extraction Method Based on Weight Fuzzy Clustering and Mutual Information
XUE Kou-ying,YUAN Sheng,ZHANG Xin-yan.Topic Sentence Extraction Method Based on Weight Fuzzy Clustering and Mutual Information[J].Computer Engineering,2009,35(20):184-186.
Authors:XUE Kou-ying  YUAN Sheng  ZHANG Xin-yan
Affiliation:(1. School of Electronics and Information Engineering, Xi’an Jiaotong University, Xi’an 710049; 2. School of Software, Xi’an Jiaotong University, Xi’an 710049)
Abstract:A topic sentence extraction method based on Weight Fuzzy Clustering(WFC) and Mutual Information(MI) is proposed, which is to cover more topics and lower the redundant information of the text. The abstract efficiency is promoted. Using WFC method, the sentences is classified. Sentences in each cluster is ranked by MI values. High qualified abstact is obtained. Experimental results show that, compared with former clustering method, this method can improve the precision by nearly 15%, and has about 70% accuracy. It can get text information correctly.
Keywords:topic sentence  Weight Fuzzy Clustering(WFC)  Mutual Information(MI)
本文献已被 维普 万方数据 等数据库收录!
点击此处可从《计算机工程》浏览原始摘要信息
点击此处可从《计算机工程》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号