首页 | 本学科首页   官方微博 | 高级检索  
     

基于多粒度特征的文本生成评价方法
引用本文:赖华,高玉梦,黄于欣,余正涛,张勇丙. 基于多粒度特征的文本生成评价方法[J]. 中文信息学报, 2022, 36(3): 45-53,63
作者姓名:赖华  高玉梦  黄于欣  余正涛  张勇丙
作者单位:1.昆明理工大学 信息工程与自动化学院,云南 昆明 650504;
2.昆明理工大学 云南省人工智能重点实验室,云南 昆明 650504
基金项目:国家自然科学基金(61732005, 61972186, 61762056, 61761026);云南省重大科技专项计划项目(202002AD080001-5);云南省重大科技专项计划项目(202103AA080015);云南省高新技术产业专项(201606);云南省基础研究计划项目(202001AT070047,2018FB104)
摘    要:近年来,基于预训练语言模型的文本生成评价方法得到了广泛关注,其通过计算两个句子间子词粒度的相似度来评价生成文本的质量.但是对于越南语、泰语等存在大量黏着语素的语言,单个音节或子词不能独立成词表达语义,仅基于子词粒度匹配的方法并不能够完整表征两个句子间的语义相似关系.基于此,该文提出一种基于子词、音节、词组等多粒度特征的...

关 键 词:文本生成  评价方法  黏着语素  多粒度特征  MBERT

Evaluation Method of Text Generation Based on Multi-granularity Features
LAI Hua,GAO Yumeng,HUANG Yuxin,YU Zhengtao,ZHANG Yongbing. Evaluation Method of Text Generation Based on Multi-granularity Features[J]. Journal of Chinese Information Processing, 2022, 36(3): 45-53,63
Authors:LAI Hua  GAO Yumeng  HUANG Yuxin  YU Zhengtao  ZHANG Yongbing
Affiliation:1.Faculty of Information Engineering and Automation, Kunming University of Science and Technology,Kunming,Yunnan 650504, China;
2.Yunnan Key Laboratory of Artificial Intelligence, Kunming University of Science and Technology,Kunming, Yunnan 650504, China
Abstract:Recently, the evaluation method of text generation based on pre-trained language model has gained attention, which evaluates the quality of generated text by computing the granularity similarity of sub-words of two sentences. However, for languages that contain many adhesive morphemes, such as Vietnamese and Thai, a single syllable or sub-word cannot form the semantic integrity, which means that the sub-word granularity matching method cannot fully represent the semantic relationship between two sentences. Therefore, we propose a text generation evaluation method with multi-granularity features of sub-words, syllables, and phrases. After the representation of text is obtained by MBERT, the semantic similarity of syllables and phrases is introduced to enhance the evaluation model of sub-words. Experimental results on such tasks as cross-language summarization, machine translation, and data screening show that, compared with ROUGE, BLEU based on statistical evaluation and Bertscore based on deep semantic matching, the proposed metric correlates better with human judgments.
Keywords:text generation    evaluation method    adhesive morphemes    multi-granularity feature    MBERT  
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号