首页 | 本学科首页   官方微博 | 高级检索  
     

一种基于深度学习的异常数据清洗算法
引用本文:匡俊搴,赵畅,杨柳,王海峰,钱骅.一种基于深度学习的异常数据清洗算法[J].电子与信息学报,2022,44(2):507-513.
作者姓名:匡俊搴  赵畅  杨柳  王海峰  钱骅
作者单位:1.中国科学院上海高等研究院 上海 2012102.上海科技大学信息科学与技术学院 上海 2012103.中国科学院大学 北京 1000494.中国科学院大学微电子学院 北京 1000495.中国科学院上海微系统与信息技术研究所 上海 200050
基金项目:国家自然科学基金(61971286),国家重点研究发展计划(2020YFB2205603),上海市科学技术委员会科技创新行动计划(19DZ1204300)
摘    要:在物联网(IoT)中采用合适的异常数据清洗算法能极大地提升数据质量.许多研究人员采用统计学方法或分类聚类等方法对时-空相关数据进行清洗.但这些方法需要额外的先验知识,会给汇聚节点带来额外的计算开销.该文根据低秩-稀疏矩阵分解模型,提出一种基于深度神经网络的快速异常数据清洗算法,来解决物联网中时-空相关数据的清洗问题.结...

关 键 词:物联网  异常数据清洗  迭代阈值收缩算法  展开  深度神经网络
收稿时间:2020-12-30

An Outlier Cleaning Algorithm Based on Deep Learning
KUANG Junqian,ZHAO Chang,YANG Liu,WANG Haifeng,QIAN Hua.An Outlier Cleaning Algorithm Based on Deep Learning[J].Journal of Electronics & Information Technology,2022,44(2):507-513.
Authors:KUANG Junqian  ZHAO Chang  YANG Liu  WANG Haifeng  QIAN Hua
Affiliation:1.Shanghai Advanced Research Institute, Chinese Academy of Sciences, Shanghai 201210, China2.School of Information Science and Technology, ShanghaiTech University, Shanghai 201210, China3.University of Chinese Academy of Sciences, Beijing 100049, China4.School of Microelectronics, University of Chinese Academy of Sciences, Beijing 100049, China5.Shanghai Institute of Microsystem and Information Technology, Chinese Academy of Sciences, Shanghai 200050, China
Abstract:The use of appropriate abnormal data cleaning algorithms in the Internet of Things (IoT) can greatly improve data quality. Statistical methods or clustering methods are utilized to clean anomalies in Spatio-temporal data. However, these methods require additional prior knowledge, which will incur additional computational overhead for the sink node. In this paper, in line with the low-rank sparse matrix decomposition model, a fast anomaly cleaning algorithm based on a deep neural network is proposed to solve the Spatio-temporal data cleaning problem in IoT. Both the Spatio-temporal correlation of sensing data and the abnormal values' sparsity are considered in an optimization problem. The Iterative Shrinkage-Thresholding Algorithm (ISTA) is used to solve it. Then the ISTA is unfolded into a fixed-length deep neural network. The real-world dataset’s experimental results show that the proposed method can automatically update the thresholds faster and more accurately than the traditional ISTA.
Keywords:Internet of Things (IoT)  Outlier cleaning  Iterative Shrinkage-Thresholding Algorithm (ISTA)  Unfolding  Deep neural network
本文献已被 万方数据 等数据库收录!
点击此处可从《电子与信息学报》浏览原始摘要信息
点击此处可从《电子与信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号