首页 | 本学科首页   官方微博 | 高级检索  
     


PM2.5 concentration prediction using hidden semi-Markov model-based times series data mining
Authors:Ming Dong  Dong Yang  Yan Kuang  David He  Serap Erdal  Donna Kenski
Affiliation:1. Department of Industrial Engineering and Management, School of Mechanical Engineering, Shanghai Jiao Tong University, 800 Dong-chuan Road, Shanghai 200240, PR China;2. General Electric (Shanghai) Corporation, 1800 Cai Lun Road, Shanghai 201203, PR China;3. Department of Mechanical and Industrial Engineering, 842 West Taylor Street, University of Illinois-Chicago, Chicago, IL 60607, USA;4. Environmental and Occupational Health Sciences, School of Public Health, University of Illinois-Chicago, Chicago, IL 60612, USA;5. Lake Michigan Air Directors Consortium, 2250 E. Devon Ave., Suite 250, Des Plaines, IL 60018, USA;1. National Research Base of Intelligent Manufacturing Service, Chongqing Technology and Business University, Chongqing 400067, China;2. Institute of Groundwater and Earth Sciences, Jinan University, Guangzhou 510632, China;1. Center for Geospatial Technology, Texas Tech University, Lubbock, TX, 79409, USA;2. Department of Geosciences, Texas Tech University, Lubbock, TX, 79409, USA;3. Scripps Institution of Oceanography and School of Medicine, University of California San Diego, La Jolla, CA, 92093, USA;4. School of Sustainability, Arizona State University, Tempe, AZ, 85281, USA;1. College of Environmental & Resource Sciences, Shanxi University, Taiyuan, China;2. School of Environmental Economics, Shanxi University of Finance & Economics, Taiyuan, China;1. State Key Laboratory of Remote Sensing Science, Institute of Remote Sensing and Digital Earth, Chinese Academy of Sciences, Beijing 100012, China;2. University of Chinese Academy of Sciences, Beijing, 100049, China;3. Zhejiang-CAS Application Center for Geoinformatics, Jiashan, 314100, China;4. School of Surveying and Geo-Informatics, Shandong Jianzhu University, Jinan, 250101, Shandong, China
Abstract:In this paper, a novel framework and methodology based on hidden semi-Markov models (HSMMs) for high PM2.5 concentration value prediction is presented. Due to lack of explicit time structure and its short-term memory of past history, a standard hidden Markov model (HMM) has limited power in modeling the temporal structures of the prediction problems. To overcome the limitations of HMMs in prediction, we develop the HSMMs by adding the temporal structures into the HMMs and use them to predict the concentration levels of PM2.5. As a model-driven statistical learning method, HSMM assumes that both data and a mathematical model are available. In contrast to other data-driven statistical prediction models such as neural networks, a mathematical functional mapping between the parameters and the selected input variables can be established in HSMMs. In the proposed framework, states of HSMMs are used to represent the PM2.5 concentration levels. The model parameters are estimated through modified forward–backward training algorithm. The re-estimation formulae for model parameters are derived. The trained HSMMs can be used to predict high PM2.5 concentration levels. The validation of the proposed framework and methodology is carried out in real world applications: prediction of high PM2.5 concentrations at O’Hare airport in Chicago. The results show that the HSMMs provide accurate predictions of high PM2.5 concentration levels for the next 24 h.
Keywords:
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号