首页 | 本学科首页   官方微博 | 高级检索  
     

多文档文摘评价标准的研究
引用本文:魏继增,孙济洲,秦兵.多文档文摘评价标准的研究[J].计算机工程与应用,2007,43(2):180-183.
作者姓名:魏继增  孙济洲  秦兵
作者单位:1. 天津大学,计算机系,天津,300072
2. 哈尔滨工业大学,计算机学院,哈尔滨,150001
摘    要:多文档自动文摘是自然语言处理领域的一个重要研究方向。但对于多文档文摘的评价方法仍然存在方法单一,缺乏统一标准的问题。针对这些问题,就多文档文摘信息覆盖度尝试性地提出一套标准。该标准将涉及以下几个重要参数:改进BLEU参数(改进召回率),与原文档有效词覆盖度,高频词覆盖度。实验证明利用该标准能准确反映出文摘系统在信息覆盖度方面的优劣,并且接近人工评价结果。

关 键 词:BLEU  高频词覆盖度  有效词覆盖度  召回率
文章编号:1002-8331(2007)02-0180-04
修稿时间:2006-05

Research on standard of evaluation of multi-document summarization
WEI Ji-zeng,SUN Ji-zhou,QIN Bing.Research on standard of evaluation of multi-document summarization[J].Computer Engineering and Applications,2007,43(2):180-183.
Authors:WEI Ji-zeng  SUN Ji-zhou  QIN Bing
Affiliation:1.Computer Science Department,Tianjin University,Tianjin 300072 , China; 2.Computer Science Institute,Harbin Institute of Technology,Harbin 150001,China
Abstract:Multi-document automatic summarization is an important branch of natural language understanding.But the methods of evaluation of the Multi-document automatic summarization also have many problems,which are single and lack of uniform standard.The investigative point in this text is to attempt to give a standard aiming at the covered rate of information of Multi-document automatic summarization.This standard will use a few of parameters:improved BLEU parameter(recall),covered rate of effective phrase with original documents,high frequency phrase covered rate.The experiments have indicated this standard can reflect the covered rate of information of summarization system good or bad,and whether it is near to artificial evaluation results.
Keywords:BLEU  high frequency phrase covered rate  covered rate of effective phrase  recall
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号