基于多粒度特征的文本生成评价方法 Evaluation Method of Text Generation Based on Multi-granularity Features期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于多粒度特征的文本生成评价方法

引用本文：	赖华,高玉梦,黄于欣,余正涛,张勇丙. 基于多粒度特征的文本生成评价方法[J]. 中文信息学报, 2022, 36(3): 45-53,63

作者姓名：	赖华高玉梦黄于欣余正涛张勇丙

作者单位：	1.昆明理工大学信息工程与自动化学院,云南昆明 650504; 2.昆明理工大学云南省人工智能重点实验室,云南昆明 650504

基金项目：	国家自然科学基金(61732005, 61972186, 61762056, 61761026);云南省重大科技专项计划项目(202002AD080001-5);云南省重大科技专项计划项目(202103AA080015);云南省高新技术产业专项(201606);云南省基础研究计划项目(202001AT070047,2018FB104)

摘要：	近年来,基于预训练语言模型的文本生成评价方法得到了广泛关注,其通过计算两个句子间子词粒度的相似度来评价生成文本的质量.但是对于越南语、泰语等存在大量黏着语素的语言,单个音节或子词不能独立成词表达语义,仅基于子词粒度匹配的方法并不能够完整表征两个句子间的语义相似关系.基于此,该文提出一种基于子词、音节、词组等多粒度特征的...
关键词：	文本生成评价方法黏着语素多粒度特征 MBERT
Evaluation Method of Text Generation Based on Multi-granularity Features

LAI Hua,GAO Yumeng,HUANG Yuxin,YU Zhengtao,ZHANG Yongbing. Evaluation Method of Text Generation Based on Multi-granularity Features[J]. Journal of Chinese Information Processing, 2022, 36(3): 45-53,63

Authors:	LAI Hua GAO Yumeng HUANG Yuxin YU Zhengtao ZHANG Yongbing

Affiliation:	1.Faculty of Information Engineering and Automation, Kunming University of Science and Technology,Kunming,Yunnan 650504, China; 2.Yunnan Key Laboratory of Artificial Intelligence, Kunming University of Science and Technology,Kunming, Yunnan 650504, China

Abstract:	Recently, the evaluation method of text generation based on pre-trained language model has gained attention, which evaluates the quality of generated text by computing the granularity similarity of sub-words of two sentences. However, for languages that contain many adhesive morphemes, such as Vietnamese and Thai, a single syllable or sub-word cannot form the semantic integrity, which means that the sub-word granularity matching method cannot fully represent the semantic relationship between two sentences. Therefore, we propose a text generation evaluation method with multi-granularity features of sub-words, syllables, and phrases. After the representation of text is obtained by MBERT, the semantic similarity of syllables and phrases is introduced to enhance the evaluation model of sub-words. Experimental results on such tasks as cross-language summarization, machine translation, and data screening show that, compared with ROUGE, BLEU based on statistical evaluation and Bertscore based on deep semantic matching, the proposed metric correlates better with human judgments.

Keywords:	text generation evaluation method adhesive morphemes multi-granularity feature MBERT

	点击此处可从《中文信息学报》浏览原始摘要信息
	点击此处可从《中文信息学报》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏