首页 | 本学科首页   官方微博 | 高级检索  
     

基于单向Transformer和孪生网络的多轮任务型对话技术
引用本文:王涛,刘超辉,郑青青,黄嘉曦. 基于单向Transformer和孪生网络的多轮任务型对话技术[J]. 计算机工程, 2021, 47(7): 55-58,66. DOI: 10.19678/j.issn.1000-3428.0058557
作者姓名:王涛  刘超辉  郑青青  黄嘉曦
作者单位:深圳市易马达科技有限公司, 广东 深圳 518055
基金项目:中美绿色基金(MA009RX18)。
摘    要:循环神经网络和Transformer在多轮对话系统的建模上依赖大量的样本数据且回复准确率过低.为此,提出一种针对任务型对话系统的建模方法.引入预训练模型对句子语意和对话过程进行深度编码,对Transformer模型进行精简,仅保留编码器部分的单向Transformer,将应答部分抽象成不同的指令,采用孪生网络对指令进行...

关 键 词:循环神经网络  多轮对话系统  预训练模型  Transformer模型  孪生网络
收稿时间:2020-06-05
修稿时间:2020-07-12

Multi-turn Task-oriented Dialogue Technology Based on Unidirectional Transformer and Siamese Network
WANG Tao,LIU Chaohui,ZHENG Qingqing,HUANG Jiaxi. Multi-turn Task-oriented Dialogue Technology Based on Unidirectional Transformer and Siamese Network[J]. Computer Engineering, 2021, 47(7): 55-58,66. DOI: 10.19678/j.issn.1000-3428.0058557
Authors:WANG Tao  LIU Chaohui  ZHENG Qingqing  HUANG Jiaxi
Affiliation:Shenzhen Immotor Technology Co., Ltd., Shenzhen, Guangdong 518055, China
Abstract:The existing Recurrent Neural Network(RNN) and Transformer models rely on a large amount of sample data for the modeling of the multi-turn dialogue system,and the accuracy of answering is low.To address the problem,a new modeling method for the task-oriented dialogue system is proposed.Some pre-trained models are introduced for deep encoding of the sentence semantics and the dialog contents.At the same time,the Transformer model is simplified to a unidirectional transformer with only the encoder retained.On this basis,the answering part is abstracted to different commands,which are sorted based on similarity by using the siamese network.The command with the highest similarity is chosen to generate the answer.The experimental results on the MultiWOZ dataset show that compared to LSTM and Transformer-based models,the proposed method has a faster prediction speed,providing better performance on small datasets and equal performance on large datasets.
Keywords:Recurrent Neural Network(RNN)  multi-turn dialogue system  pre-training model  Transformer model  siamese network  
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机工程》浏览原始摘要信息
点击此处可从《计算机工程》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号