首页 | 本学科首页   官方微博 | 高级检索  
     

数据预处理技术和机器学习方法在质子转移反应质谱中的应用
引用本文:孙运,陈一冰,褚美娟,蒋学慧,汪曣,郭冰清. 数据预处理技术和机器学习方法在质子转移反应质谱中的应用[J]. 质谱学报, 2018, 39(5): 513-523. DOI: 10.7538/zpxb.2017.0181
作者姓名:孙运  陈一冰  褚美娟  蒋学慧  汪曣  郭冰清
作者单位:1.天津大学精密仪器与光电子工程学院,天津300072;2.中国人民解放军总医院呼吸内科,北京100853
摘    要:质子转移反应质谱(PTR-MS)法是一种用于检测挥发性有机物(VOCs)的分析技术。它具有检测限低、响应速度快、无需样品前处理、实时分析等特点,在大气化学、环境化学、食品、生物医学等领域得到广泛应用。随着PTR-MS应用的扩展和样品种类的增加,如何从复杂的质谱数据中提取特征,并寻找内在规律,对分析算法的处理能力提出了更高的要求。本工作从数据预处理技术和机器学习方法两方面展开论述,归纳了具有PTR-MS特点的数据预处理技术,总结了不同机器学习算法在PTR-MS数据分析中的应用,并讨论了它们的优点和不足。

关 键 词:质子转移反应质谱(PTR-MS)  挥发性有机物(VOCs)  数据预处理  机器学习  

Review of Data Pre-processing Techniques and Machine Learning in PTR-MS
SUN Yun,CHEN Yi-bing,CHU Mei-juan,JIANG Xue-hui,WANG Yan,GUO Bing-qing. Review of Data Pre-processing Techniques and Machine Learning in PTR-MS[J]. Journal of Chinese Mass Spectrometry Society, 2018, 39(5): 513-523. DOI: 10.7538/zpxb.2017.0181
Authors:SUN Yun  CHEN Yi-bing  CHU Mei-juan  JIANG Xue-hui  WANG Yan  GUO Bing-qing
Affiliation:1.School of Precision Instrument and Optoelectronics Engineering, Tianjin University, Tianjin 300072, China;2.Respiratory Medicine, Chinese PLA General Hospital, Beijing 100853, China
Abstract:Proton transfer reaction mass spectrometry (PTR-MS) is an analytical technique developed for the detection of volatile organic compounds (VOCs). It offers many advantages for VOCs analysis,such us ultra-low detection limits, very short response, no sample preparation, real-time analysis, etc. It has been applied in atmospheric chemistry environmental chemistry, food and biomedical. With the expansion of applications of PTR-MS and the increase of sample types, how to analyze the features from complex data and find out the inherent rules have put forward higher requirements on the processing ability of the algorithm. Therefore, this paper discussed the data preprocessing techniques and machine learning methods. Firstly, we summarized the data preprocessing methods with PTR-MS features. The data generated by the instrument cannot be directly used for statistical analysis, otherwise it will bring great error. Therefore, data pre-processing is an essential step. It includes several steps,such as denoising, normalization, and concentration calculation. The purpose of preprocessing is to get data matrix for subsequent analysis. Next, we focused on the use of machine learning methods for data analysis in PTR-MS, and the advantages of this techniques would be demonstrated as well as the drawbacks. The machine learning method can be divided into two parts. Usually unsupervised methods are common choices for initial data analysis. For further analysis and a priori knowledge, a supervised analysis would be a better way. These methods use this knowledge to learn rules and patterns related to classes in the data, and then use these rules and patterns to predict classes in newly acquired data sets. The main goal of all surveillance techniques is to find the relationship between the predictor (VOC) matrix and the response vector. In general, the combination of the unsupervised and supervised methods is a good idea. PTR-MS is a soft ionization technique, however, the presence of a few fragments will still cause great difficulties in spectral analysis, especially for unknown mixtures, which is the main reason why spectral analysis of PTR-MS differs from other mass spectrometry methods. Perhaps, the data fusion of different platform instruments and different samples will be a good way to solve this problem.
Keywords:proton transfer reaction mass spectrometry (PTR-MS)  volatile organic compounds (VOCs)  data pre-processing  machine learning  
本文献已被 CNKI 等数据库收录!
点击此处可从《质谱学报》浏览原始摘要信息
点击此处可从《质谱学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号