首页 | 本学科首页   官方微博 | 高级检索  
     

基于biRNN的海军军械不均衡文本数据集处理方法
引用本文:齐玉东,丁海强,赵锦超,孙明玮. 基于biRNN的海军军械不均衡文本数据集处理方法[J]. 计算机与现代化, 2019, 0(12): 21. DOI: 10.3969/j.issn.1006-2475.2019.12.005
作者姓名:齐玉东  丁海强  赵锦超  孙明玮
作者单位:海军航空大学,山东 烟台,264001;海军92665部队,湖南 张家界,427000
摘    要:传统的不均衡数据集处理方法存在人工设置特征繁琐、普适性差等缺陷,难以适用于海军军械不均衡文本数据集处理。针对此问题,本文提出一种基于biRNN模型的海军军械不均衡文本数据集处理方法。通过biRNN模型自动学习文本序列特征,以双向文本序列预测方式扩展少数类文本,达到文本数据均衡目的,并在均衡数据集的基础上将整个文本数据集进行扩充。分别对原始数据集、均衡数据集、扩充数据集进行文本分类实验,实验结果表明,基于biRNN的不均衡数据集扩展方法对原始数据集进行均衡、扩展处理能够有效提高文本分类的性能。

关 键 词:深度学习  海军军械  不均衡数据集  双向循环神经网络  文本数据挖掘
收稿时间:2019-12-11

biRNN-based Method for Processing Unbalanced Text Data Sets of Naval Ordnance
QI Yu-dong,DING Hai-qiang,ZHAO Jin-chao,SUN Ming-wei. biRNN-based Method for Processing Unbalanced Text Data Sets of Naval Ordnance[J]. Computer and Modernization, 2019, 0(12): 21. DOI: 10.3969/j.issn.1006-2475.2019.12.005
Authors:QI Yu-dong  DING Hai-qiang  ZHAO Jin-chao  SUN Ming-wei
Abstract:Traditional unbalanced data sets processing methods are characterized by complicated artificial settings and poor universality, which are difficult to be applied to naval ordnance unbalanced text data sets processing. Aiming at this problem, this paper proposes a method of processing unbalanced text data sets of naval ordnance based on biRNN model. The biRNN model is used to automatically learn the features of text sequences and expand a few types of texts by two-way text sequence prediction to achieve the goal of text data balancing. The whole text data set is expanded on the basis of balanced data set. Text classification experiments are carried out on the original data set, the balanced data set and the extended data set. The experimental results show that the unbalanced data set expansion method based on biRNN can effectively improve the performance of text classification by balancing and extending the original data set.
Keywords:deep learning  naval ordnance  unbalanced data set  bidirectional recurrent neural network  text data mining
  
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机与现代化》浏览原始摘要信息
点击此处可从《计算机与现代化》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号