首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
研究溶解氧含量预测的精确度与鲁棒性问题,为水质评价和水处理控制提供科学依据.探讨了水中溶解氧的影响因素及其作用规律,分析了现有预测算法的不足,在此基础上提出了一种新的溶解氧含量预测方法.从样本集中随机抽取数据构成训练集和测试集,以网格搜索法确定WLS-SVM的参数寻优范围,再用QPSO与留一交叉验证组合算法找出其最优值,以此建立WLS-SVM回归模型进行水中溶解氧含量的预测.应用该方法与LS-SVMlab工具箱函数分别建模进行对比测试,结果表明其预测精确度和鲁棒性都更好.  相似文献   

2.
In recent years, regulators and environmental groups have identified the large volumes of wastewater discharged through offshore petroleum production activities as an issue of concern. In this paper, fuzzy set theory coupled with Monte Carlo analysis have provided a stochastic simulation of pollutant dispersion for the prediction of the environmental risks associated with produced water discharges. With the application of the fuzzy set to data drawn from previous risk-assessment studies, the model allowed the evaluation of various existing pollution standards for marine environments. The present modeling method was validated against data for lead (Pb) levels in the area adjacent to an offshore petroleum facility located on the Grand Banks of Newfoundland, Canada through a multi-year field expedition. The proposed risk-assessment approach contributes to the implementation of effective assessment and management of produced water discharges in the offshore water environment.  相似文献   

3.
Since fresh water is limited while agricultural and human water demands are continuously increasing, optimal prediction and management of streamflows as a source of fresh water is crucially important. This study investigates and demonstrates how data preprocessing and data mining techniques would improve the accuracy of streamflow predictive models. Based on easily accessible Snow Telemetry data (SNOTEL), four streamflow prediction models – autoregressive integrated moving average (ARIMA), artificial neural networks (ANNs), a hybrid-model of ANN and ARIMA (ANN-ARIMA), and an adaptive neuro fuzzy inference system (ANFIS) – were developed and utilized in a streamflow prediction process on Elephant Butte Reservoir. Utilizing the statistical correlation analysis and the extracting importance degrees of predictors led to efficiently select the most effective predictors for daily and monthly streamflow to Elephant Butte Reservoir. For the daily prediction time step, by preprocessing the historical data and extracting and utilizing the extracted climate variability indices through data mining techniques, the ANFIS model achieved a superior streamflow prediction performance for Elephant Butte Reservoir compared to the other three evaluated prediction models. Additionally, for predicting monthly streamflow to the Elephant Butte Reservoir, ANFIS showed significantly higher accuracy than the ANNs. As an optimal application of the developed predictive expert systems, successful integrating the prediction models in integrated reservoir operations balanced the need for a reliable supply of irrigation water against losses through evaporation. The optimal operation plan significantly minimizes the total evaporation loss from both reservoirs by providing the optimal storage levels in both reservoirs. This study provides the conceptual procedures of non-seasonal (ARIMA) model, and since the model is univariate, it demonstrates a strongly-reliable inflow prediction when existing information is limited to streamflow data as a predictor.  相似文献   

4.
软件工程领域的一个重要问题是预测软件开发项目的规模、工作量和成本,即软件项目估算问题。基于机器学习的方法在软件项目估算领域具有优势地位,本文提出了基于决策树的聚类分析预测方法,通过对目标项目的目标属性进行正确分类,预测目标属性的取值范围。通过对502个ISBSG v9项目数据集中的项目进行基于C4.5算法的分类预测,正确率达到82.4701%,满足了软件项目估算的指标要求。  相似文献   

5.
Traffic engineering with traffic prediction is a promising approach to accommodate time-varying traffic without frequent route changes. In this approach, the routes are decided so as to avoid congestion on the basis of the predicted traffic. However, if the range of variation including temporal traffic changes within the next control interval is not appropriately decided, the route cannot accommodate the shorter-term variation and congestion still occurs. To solve this problem, we propose a prediction procedure to consider the short-term and longer-term future traffic demands. Our method predicts the longer-term traffic variation from the monitored traffic data. We then take account of the short-term traffic variation in order to accommodate prediction uncertainty incurred by temporal traffic changes and prediction errors. We use the standard deviation to estimate the range of short-term fluctuation. Through the simulation using actual traffic traces on a backbone network of Internet2, we show that traffic engineering using the traffic information predicted by our method can set up routes that accommodate traffic variation for several or more hours with efficient load balancing. As a result, we can reduce the required bandwidth by 18.9% using SARIMA with trend component compared with that of the existing traffic engineering methods.  相似文献   

6.
WSN中基于GM-LSSVM的数据融合方法   总被引:2,自引:0,他引:2  
针对传统时间序列预测融合算法对于具有非线性、随机性和突发性的数据拟合度不佳的问题,提出了一种基于灰色最小二乘支持向量机(GM-LSSVM)预测的时序数据融合方法.利用少量监测数据对模型进行训练,以灰色回归预测数据作为最小二乘支持向量机的输入数据,并对下一步未知信息进行预测,以达到减少通信开销的目的.实际测量结果表明,该方法所需样本数量较少,预测准确率较高,能有效降低数据传输开销.  相似文献   

7.
王静  张建伟  梁海军 《计算机工程与设计》2012,33(4):1514-1517,1552
通过对空中交通运输管理中目前常用的轨迹预测算法的研究比较和分析,提出了利用遗传算法的从历史数据中进行函数挖掘的思想.针对四维轨迹数据特征的分析和传统的单一函数挖掘的局限性,提出了基于基因表达式编程的频繁函数集挖掘的建模方法.该模型方法通过对历史飞行数据进行遗传算法的操作挖掘出数据集中对应的函数关系集合,用较好的函数模型预测未来航迹.以某一航班雷达数据为训练集做实验,结果表明了应用该方法的准确性和可用性.  相似文献   

8.
全国取用水平台整合是实现取用水业务“一网统管”的重要途径,但因拟整合的取用水业务系统繁多,信息资源分散,平台整合中存在一数多源、标准不统一、户证点关系不清晰等现实问题,由此尝试提出了利用知识图谱技术应用于平台整合过程,建设一套全国统一的数据库表结构标准,形成一套真实且唯一的户证点关系,构建一套智能且有效的监管数据产品。本文采用自顶向下的知识图谱构建方式,根据各系统的结构化数据,构造模式层中的本体及其相互关系,形成对应的概念模型和规则关系,然后依照此模式从数据中抽取实体及关系,进行数据融合,构造数据层,并及时进行知识更新,完成全国取用水平台知识图谱的构建。最后基于统一规范的数据库表结构标准,依据对取用水户取用水行为监管需求,开展时空尺度的数据信息统计分析,构建六类监管产品,形成取用水管控“一张图”。  相似文献   

9.
Nowadays, thanks to the rapid evolvement of information technology, an explosively large amount of information with very high-dimensional features for customers is being accumulated in companies. These companies, in turn, are exerting every effort to develop more efficient churn prediction models for managing customer relationships effectively. In this paper, a novel method is proposed to deal with a high-dimensional large data set for constructing better churn prediction models. The proposed method starts by partitioning a data set into small-sized data subsets, and applies sequential manifold learning to reduce high-dimensional features and give consistent results for combined data subsets. The performance of the constructed churn prediction model using the proposed method is tested using an E-commerce data set by comparing it with other existing methods. The proposed method works better and is much faster for high-dimensional large data sets without the need for retraining the original data set to reduce the dimensions of new test samples.  相似文献   

10.
跨项目缺陷预测旨在解决传统的项目内缺陷预测的历史数据缺失,新项目初期缺乏训练数据等实际问题。然而,在跨项目缺陷预测中,不同项目之间以及实例之间的数据分布差异降低了其预测性能。针对这一问题,提出了基于分层数据筛选的跨项目缺陷预测方法。该方法将训练数据的筛选过程分为项目层筛选和实例层筛选,从源数据集中选出与目标项目数据分布最接近的候选项目集,在候选项目集中选出与目标项目中实例相似度较高的训练数据集,最后在训练数据集上训练朴素贝叶斯模型。在PROMISE数据集进行实验对比。结果表明,与项目内缺陷预测比较,提出的分层数据筛选方法优于项目内缺陷预测,并且有效降低了训练数据和目标项目数据之间的差异性。  相似文献   

11.
针对实际系统中采集的数据流的不确定性,给异常点检测与修正带来了现实挑战。因此,根据滑动基本窗口采样算法(sliding basic windows sampling,SBWB)与高斯过程回归(Gaussian process regression,GPR)模型的特性,提出了基于SBWS_GPR预测模型的不确定性多数据流的异常检测方法。在基于时间序列采集的历史数据集中,引入索引号,对历史数据集进行聚类,分析数据集与索引号的映射关系,将实时获得的输入数据流通过滑动窗口匹配,实现对单数据流的异常点检测与修正。再利用输入、输出数据间的相关性,基于GPR建立预测模型,比较实时观察的输出数据流与预测模型的输出数据流,最终从输入、输出两种不同通道实现多数据流的异常检测与修正。  相似文献   

12.
时间序列预测方法综述   总被引:1,自引:0,他引:1  
时间序列是按照时间排序的一组随机变量,它通常是在相等间隔的时间段内依照给定的采样率对某种潜在过程进行观测的结果。时间序列数据本质上反映的是某个或者某些随机变量随时间不断变化的趋势,而时间序列预测方法的核心就是从数据中挖掘出这种规律,并利用其对将来的数据做出估计。针对时间序列预测方法,着重介绍了传统的时间序列预测方法、基于机器学习的时间序列预测方法和基于参数模型的在线时间序列预测方法,并对未来的研究方向进行了进一步的展望。  相似文献   

13.
交通拥堵预测是智慧交通一个重要组成部分,但是大量的交通数据无法以公开的方式获取。在不完全数据下,提出了一种基于时空相关性的交通拥堵预测方法。该方法采用改进的核密度估计法,使得预测过程中不依赖大量历史数据进行训练,直接利用部分采集到的数据精准地实时地对交通拥堵进行预测。在真实数据集上对提出的交通拥堵预测方法进行验证,实验结果表明了该方法在实时交通预测上的可行性。  相似文献   

14.
黄晓璐  闵应骅 《计算机工程》2006,32(14):85-86,1
引入了半马尔柯夫模型描述网络流量特性,并在该模型的基础上分析推导了相应的流量预测方法。分别对广域网和局域网不同时间尺度统计的实际流量数据进行分析和短期、长期预测,所有数据的实际预测精确度均小于预先设定的置信度。说明引入的模型能真实反映网络流量特性,基于该模型的流量预测方法具有良好的预测性能且适用于不同长度的预测。  相似文献   

15.
16.
快速准确有效的数据处理方法是日益增长的地学数据为成矿预测工作提出的迫切要求。提出了一种基于GIS的地球化学数据处理新方法,克服了传统统计分析方法对样本数据要求服从正态或对数正态分布,处理结果受少数特高品位数据影响的缺点。通过凤凰山铜矿地球化学数据处理的对比分析,认为该方法具有较好的抗干扰和致矿异常分辨能力。  相似文献   

17.
计算机技术和网络的发展使得数据呈爆炸式的涌现,社交媒体不断融入到人们的生活中,社会网络分析已成为研究的热点。随着大数据时代的到来,对社交网络链接算法研究产生巨大影响,原有的基于网络结构的预测方法已经渐渐不适应现状。因此,提出了一种基于主题模型的社交网络链接预测方法。首先以微博社交网络为数据源,将实验网络分为测试集和训练集;其次利用主题模型得到用户的主题特征,结合命名实体集和用户联系特征集合得到用户的兴趣特征相似性度量,加上网络结构相似性从而得到用户节点相似度,进而对社交网络链接进行预测;最终使用链接预测最常用的评价体系AUC来评价链接预测方法的效果。通过实验验证,该方法的预测准确率更高。  相似文献   

18.
Protein-protein interaction (PPI) prediction is one of the main goals in the current Proteomics. This work presents a method for prediction of protein-protein interactions through a classification technique known as support vector machines. The dataset considered is a set of positive and negative examples taken from a high reliability source, from which we extracted a set of genomic features, proposing a similarity measure. From this dataset we extracted 26 proteomics/genomics features using well-known databases and datasets. Feature selection was performed to obtain the most relevant variables through a modified method derived from other feature selection methods for classification. Using the selected subset of features, we constructed a support vector classifier that obtains values of specificity and sensitivity higher than 90% in prediction of PPIs, and also providing a confidence score in interaction prediction of each pair of proteins.  相似文献   

19.
The usual huge fluctuations in the blast furnace gas (BFG) generation make the scheduling of the gas system become a difficult problem. Considering that there are high level noises and outliers mixed in original industrial data, a quantile regression-based echo state network ensemble (QR-ESNE) is modeled to construct the prediction intervals (PIs) of the BFG generation. In the process of network training, a linear regression model of the output matrix is reported by the proposed quantile regression to improve the generalization ability. Then, in view of the practical demands on reliability and further improving the prediction accuracy, a bootstrap strategy based on QR-ESN is designed to construct the confidence intervals and the prediction ones via combining with the regression models of various quantiles. To verify the performance of the proposed method, the practical data coming from a steel plant are employed, and the results indicate that the proposed method exhibits high accuracy and reliability for the industrial data. Furthermore, an application software system based on the proposed method is developed and applied to the practice of this plant.  相似文献   

20.
对航班备降问题的小样本特点进行了分析,提出了基于观察学习的航班备降概率分布预测模型。该模型利用松弛属性约束思想抽取数据子集,三次样条插值方法构建基学习器,并结合虚拟数据生成策略促使各基学习器达成一致。并在此基础上,对信任度参数进行优化,进一步完善了预测模型。在航班备降数据集的实验表明,在大样本下,该预测模型的预测精度高于朴素贝叶斯方法和贝叶斯网方法;在小样本数据集上分析了航班不同备降次数下的置信度,为相关部门提供决策支持。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号