首页 | 本学科首页   官方微博 | 高级检索  
     

基于生成式-判别式混合模型的可解释性文档分类
引用本文:王强,陈志豪,徐庆,鲍亮,廖祥文.基于生成式-判别式混合模型的可解释性文档分类[J].模式识别与人工智能,2020,33(11):995-1003.
作者姓名:王强  陈志豪  徐庆  鲍亮  廖祥文
作者单位:1.福州大学 数学与计算机科学学院 福州 350116;
2.福州大学 福建省网络计算与智能信息处理重点实验室 福州 350116;
3.福州大学 数字福建金融大数据研究所 福州 350116
基金项目:国家自然科学基金;国家自然科学基金;国家自然科学基金;福建省自然科学基金面上项目;模式识别国家重点实验室开放基金
摘    要:现有可解释性文档分类常忽略对文本信息的深度挖掘,未考虑单词与单词上下文、句子与句子上下文之间的语义关系.为此,文中提出基于生成式-判别式混合模型的可解释性文档分类方法,在文档编码器中引入分层注意力机制,获得富含上下文语义信息的文档表示,生成精确的分类结果及解释性信息,解决现有模型对文本信息挖掘不够充分的问题.在PCMag、Skytrax评论数据集上的实验表明,文中方法在文档分类上性能较优,生成较准确的解释性信息,提升方法的整体性能.

关 键 词:可解释性  分层注意力机制  文本分类  文本摘要  视角级情感分类  
收稿时间:2020-07-02

Interpretability Document Classification Based on Generative-Discriminatory Hybrid Model
WANG Qiang,CHEN Zhihao,XU Qing,BAO Liang,LIAO Xiangwen.Interpretability Document Classification Based on Generative-Discriminatory Hybrid Model[J].Pattern Recognition and Artificial Intelligence,2020,33(11):995-1003.
Authors:WANG Qiang  CHEN Zhihao  XU Qing  BAO Liang  LIAO Xiangwen
Affiliation:1. College of Mathematics and Computer Science,Fuzhou University,Fuzhou 350116;
2. Fujian Provincial Key Laboratory of Networking Computing and Intelligent Information Processing,Fuzhou University,Fuzhou 350116;
3. Digital Fujian Institute of Financial Big Data,Fuzhou University,Fuzhou 350116
Abstract:The deep mining of text information and the semantic relationships between word and word context and between sentence and sentence context are not taken into account in the existing interpretability document classification.Therefore,an interpretable document classification method based on a generative-discriminatory-hybrid model is proposed.The hierarchical attention mechanism is introduced into the document encoder to obtain the document representations rich in contextual semantic information.And thus more accurate classification results and explanatory information are generated,and the problem of insufficient text information mining in the existing models is handled.Experiments on PCMag and Skytrax comment datasets show that the proposed method has better performance in document classification,generates more accurate explanatory information and improves the overall performance of the method.
Keywords:Interpretability  Hierarchical Attention Mechanism  Document Classification  Text Summarization  Aspect Level Sentiment Classification  
本文献已被 万方数据 等数据库收录!
点击此处可从《模式识别与人工智能》浏览原始摘要信息
点击此处可从《模式识别与人工智能》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号