首页 | 本学科首页   官方微博 | 高级检索  
     

基于改进生成式对抗网络的电网异常数据辨识方法
引用本文:陈杰,张浩天,汤奕.基于改进生成式对抗网络的电网异常数据辨识方法[J].电力建设,2021,42(5):9-15.
作者姓名:陈杰  张浩天  汤奕
作者单位:国网江苏省电力有限公司溧阳市供电分公司,江苏省溧阳市213300;东南大学网络空间安全学院,南京市210096;东南大学电气工程学院,南京市210096
基金项目:国家重点研发计划资助"互联大电网高性能分析和态势感知技术"
摘    要:基于数据驱动的电网异常数据辨识方法已成为电网安全领域研究的重点,由于实际电力发电统计数据中异常数据样本数极少,给通过数据挖掘方法辨识异常数据情况带来了极大困难。文章提出了一种基于Wasserstein生成式对抗网络(Wasserstein generative adversarial networks, WGAN)和孤立森林算法(isolation forest,iForest)的发电统计异常数据辨识方法。首先,利用WGAN交替训练生成器和判别器学习发电统计数据的分布特性并生成样本,用生成异常样本对原始异常样本进行增强,根据异常数据辨识精度确定异常样本的扩充比例;然后,在扩充后得到的平衡数据集上利用孤立森林算法实现异常数据辨识;最后,通过扩充样本前后模型的准确率、查全率以及查准率来比较模型异常数据的辨识效果。算例结果表明,文章提出的异常样本增强方法能够有效地改善辨识模型对于多数类的分类偏好问题,提升整体辨识精度。

关 键 词:生成式对抗网络  Wasserstein距离  样本生成  非均衡数据  异常数据辨识
收稿时间:2020-08-09

An Abnormal Data Identification Method Based on Improved Generative Adversarial Network
CHEN Jie,ZHANG Haotian,TANG Yi.An Abnormal Data Identification Method Based on Improved Generative Adversarial Network[J].Electric Power Construction,2021,42(5):9-15.
Authors:CHEN Jie  ZHANG Haotian  TANG Yi
Affiliation:1. State Grid Liyang County Electric Power Supply Company, Liyang 213300, Jiangsu Province, China2. School of Cyber Science and Engineering, Southeast University, Nanjing 210096, China3. School of Electrical Engineering, Southeast University, Nanjing 210096, China
Abstract:The identification method based on data driving for identifying abnormal data in power grid has become the focus of research in the field of power grid security. Due to the small number of abnormal data in the actual statistical data of power generation, it is extremely difficult to identify abnormal data through data mining. This paper proposes an improved generative adversarial network and isolated forests based abnormal data identification method. Firstly, by using WGAN alternating training generator and discriminator, the distribution characteristics of power generation statistical data are learned and samples are generated, which will generate abnormal samples to enhance the original abnormal samples. According to the accuracy of outlier data identification, the expansion ratio of outlier samples is determined. Then the isolated forest algorithm is used to identify the abnormal data on the expanded balanced dataset. Finally, an evaluation index system consisting of accuracy, recall and precision is selected to evaluate and compare the identification effects of models before and after category equalization. The results show that the proposed method can effectively improve the classification preference of the identification model for most classes and improve the overall identification accuracy.
Keywords:generative adversarial networks                                                                                                                        Wasserstein distance                                                                                                                        sample generating                                                                                                                        unbanced data                                                                                                                        abnormal data identification
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《电力建设》浏览原始摘要信息
点击此处可从《电力建设》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号