首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 62 毫秒
1.
针对不同工况下训练样本与测试样本分布差异导致滚动轴承寿命阶段无法被有效识别的问题,提出改进均衡分布适配的滚动轴承寿命阶段识别方法。采用无重复均匀随机抽样对源域类间样本进行多次均匀随机抽样,得到源域多样本训练集,以减小源域内部样本选择对目标域预测标签的影响;在再生核希尔伯特空间上利用平衡因子μ动态调节边缘分布和条件分布所占权值,并通过迭代的方式不断优化目标域伪标签以减小两域的最大均值差异;利用源域多样本数据集各自的映射矩阵构造多个分类器,经过一致性判别得到目标域样本最终识别结果。在两组滚动轴承寿命阶段数据集上进行实验验证,证明了所提方法的可行性和有效性。  相似文献   

2.
《工程(英文)》2021,7(9):1262-1273
Data-driven process-monitoring methods have been the mainstream for complex industrial systems due to their universality and the reduced need for reaction mechanisms and first-principles knowledge. However, most data-driven process-monitoring methods assume that historical training data and online testing data follow the same distribution. In fact, due to the harsh environment of industrial systems, the collected data from real industrial processes are always affected by many factors, such as the changeable operating environment, variation in the raw materials, and production indexes. These factors often cause the distributions of online monitoring data and historical training data to differ, which induces a model mismatch in the process-monitoring task. Thus, it is difficult to achieve accurate process monitoring when a model learned from training data is applied to actual online monitoring. In order to resolve the problem of the distribution divergence between historical training data and online testing data that is induced by changeable operation environments, a robust transfer dictionary learning (RTDL) algorithm is proposed in this paper for industrial process monitoring. The RTDL is a synergy of representative learning and domain adaptive transfer learning. The proposed method regards historical training data and online testing data as the source domain and the target domain, respectively, in the transfer learning problem. Maximum mean discrepancy regularization and linear discriminant analysis-like regularization are then incorporated into the dictionary learning framework, which can reduce the distribution divergence between the source domain and target domain. In this way, a robust dictionary can be learned even if the characteristics of the source domain and target domain are evidently different under the interference of a realistic and changeable operation environment. Such a dictionary can effectively improve the performance of process monitoring and mode classification. Extensive experiments including a numerical simulation and two industrial systems are conducted to verify the efficiency and superiority of the proposed method.  相似文献   

3.
Sentiment analysis is a research hot spot in the field of natural language processing and content security. Traditional methods are often difficult to handle the problems of large difference in sample distribution and the data in the target domain is transmitted in a streaming fashion. This paper proposes a sentiment analysis method based on Kmeans and online transfer learning in the view of fact that most existing sentiment analysis methods are based on transfer learning and offline transfer learning. We first use the Kmeans clustering algorithm to process data from one or multiple source domains and select the data similar to target domain data to establish the classifier, so that the processed data does not negatively transfer the data in the target domain. And then create a new classifier based on the new target domain. The source domain classifier and target domain classifier are combined with certain weights by using the homogeneous online transfer learning method to achieve sentiment analysis. The experimental results show that this method has achieved better performance in terms of error rate and classification accuracy.  相似文献   

4.
The application of deep learning in the field of object detection has experienced much progress. However, due to the domain shift problem, applying an off-the-shelf detector to another domain leads to a significant performance drop. A large number of ground truth labels are required when using another domain to train models, demanding a large amount of human and financial resources. In order to avoid excessive resource requirements and performance drop caused by domain shift, this paper proposes a new domain adaptive approach to cross-domain vehicle detection. Our approach improves the cross-domain vehicle detection model from image space and feature space. We employ objectives of the generative adversarial network and cycle consistency loss for image style transfer in image space. For feature space, we align feature distributions between the source domain and the target domain to improve the detection accuracy. Experiments are carried out using the method with two different datasets, proving that this technique effectively improves the accuracy of vehicle detection in the target domain.  相似文献   

5.
针对变工况下的滚动轴承无法获得大量带标签样本数据以及传统深度学习诊断方法识别率低的问题,提出一种基于迁移学习的卷积神经网络模型滚动轴承故障诊断方法.首先,采用短时傅里叶变换处理滚动轴承振动信号获得源域、目标域样本集;其次,利用源域样本预训练卷积神经网络模型;最后,通过目标域样本微调卷积神经网络模型实现滚动轴承故障诊断....  相似文献   

6.
The accurate and stable prediction of protein domain boundaries is an important avenue for the prediction of protein structure, function, evolution, and design. Recent research on protein domain boundary prediction has been mainly based on widely known machine learning techniques. In this paper, we propose a new machine learning based domain predictor namely, DomNet that can show a more accurate and stable predictive performance than the existing state-of-the-art models. The DomNet is trained using a novel compact domain profile, secondary structure, solvent accessibility information, and interdomain linker index to detect possible domain boundaries for a target sequence. The performance of the proposed model was compared to nine different machine learning models on the Benchmark_2 dataset in terms of accuracy, sensitivity, specificity, and correlation coefficient. The DomNet achieved the best performance with 71% accuracy for domain boundary identification in multidomains proteins. With the CASP7 benchmark dataset, it again demonstrated superior performance to contemporary domain boundary predictors such as DOMpro, DomPred, DomSSEA, DomCut, and DomainDiscovery.  相似文献   

7.
考虑到传统物理分析方法无法解决导线舞动的预测问题,综合运用机器学习算法,对已有的舞动历史数据进行筛选和预处理,并挖掘有效信息,利用one class SVM算法解决舞动数据中负样本缺失问题,采用集成学习算法中Bagging算法建立分类器学习方法,实现了数据的随机抽样,分成不同组数据集进行相互独立的训练,避免对舞动数据过拟合,提升机器学习算法的抗噪声能力以及泛化能力,采用k折交叉验证算法进行模型的验证,并利用F1-score描述导线舞动预警模型的性能,验证了该方法在舞动预测方面的有效性。  相似文献   

8.
针对未知环境条件下的高光谱图像目标检测问题进行了研究,提出了一种基于投影的自动目标检测算法。该算法通过构造正交投影算子预先对部分干扰物信息进行削弱,再以无监督的自动目标搜寻方法找到场景中可能的目标物,将图像数据向可能目标物所张成的子空间投影以增强目标物的信息,然后用匹配的方法完成检测。有效减弱了干扰物对目标检测的影响,缩小了目标搜索的范围。应用此算法对实验采集数据进行处理,取得了较好的结果。  相似文献   

9.
Software defect prediction plays an important role in software quality assurance. However, the performance of the prediction model is susceptible to the irrelevant and redundant features. In addition, previous studies mostly regard software defect prediction as a single objective optimization problem, and multi-objective software defect prediction has not been thoroughly investigated. For the above two reasons, we propose the following solutions in this paper: (1) we leverage an advanced deep neural network—Stacked Contractive AutoEncoder (SCAE) to extract the robust deep semantic features from the original defect features, which has stronger discrimination capacity for different classes (defective or non-defective). (2) we propose a novel multi-objective defect prediction model named SMONGE that utilizes the Multi-Objective NSGAII algorithm to optimize the advanced neural network—Extreme learning machine (ELM) based on state-of-the-art Pareto optimal solutions according to the features extracted by SCAE. We mainly consider two objectives. One objective is to maximize the performance of ELM, which refers to the benefit of the SMONGE model. Another objective is to minimize the output weight norm of ELM, which is related to the cost of the SMONGE model. We compare the SCAE with six state-of-the-art feature extraction methods and compare the SMONGE model with multiple baseline models that contain four classic defect predictors and the MONGE model without SCAE across 20 open source software projects. The experimental results verify that the superiority of SCAE and SMONGE on seven evaluation metrics.  相似文献   

10.
利用统计能量分析进行高频动力学环境响应预示的关键步骤之一是确定可靠的统计能量分析参数。该文研究了基于子空间的统计能量分析一阶功率流模型辨识和模型修正理论,基于多变量输出误差状态空间(MOESP)算法和约束优化思想,提出一种利用子空间法识别统计能量分析参数的新方法。通过算例对该方法进行了仿真验证,结果表明方法可行并具有较好的抗噪性能。最后,对L 型板结构进行了实验统计能量分析,比较了子空间方法和功率输入法结果,两者吻合很好,从而进一步验证了子空间方法的正确性。该文验证了利用瞬态时域数据进行统计能量分析模型修正和参数识别的可行性,也是对实验统计能量分析的补充和发展。  相似文献   

11.
Fusion of multimodal imaging data supports medical experts with ample information for better disease diagnosis and further clinical investigations. Recently, sparse representation (SR)‐based fusion algorithms has been gaining importance for their high performance. Building a compact, discriminative dictionary with reduced computational effort is a major challenge to these algorithms. Addressing this key issue, we propose an adaptive dictionary learning approach for fusion of multimodal medical images. The proposed approach consists of three steps. First, zero informative patches of source images are discarded by variance computation. Second, the structural information of remaining image patches is evaluated using modified spatial frequency (MSF). Finally, a selection rule is employed to separate the useful informative patches of source images for dictionary learning. At the fusion step, batch‐OMP algorithm is utilized to estimate the sparse coefficients. A novel fusion rule which measures the activity level in both spatial domain and transform domain is adopted to reconstruct the fused image with the sparse vectors and trained dictionary. Experimental results of various medical image pairs and clinical data sets reveal that the proposed fusion algorithm gives better visual quality and competes with existing methodologies both visually and quantitatively.  相似文献   

12.
The remaining useful life (RUL) of the machine is one of the key information for predictive maintenance. If there is a lack of predictive maintenance strategy, it will increase the maintenance and breakdown costs of the machine. We apply transfer learning techniques to develop a new method that predicts the RUL of target data using degradation trends learned from complete bearing test data called source data. The training length of the model plays a crucial role in RUL prediction. First, the exponentially weighted moving average (EWMA) chart is used to identify the abnormal points of the bearing to determine the starting point of the model's training. Secondly, we propose transfer learning based on a bidirectional long and short-term memory with attention mechanism (BiLSTMAM) model to estimate the RUL of the ball bearing. At the same time, the public data set is used to compare the estimation effect of the BiLSTMAM model with some published models. The BiLSTMAM model with the EWMA chart can achieve a score of 0.6702 for 11 target bearings. The accuracy of the RUL estimation ensures a reliable maintenance strategy to reduce unpredictable failures.  相似文献   

13.
材料数据由于小样本、高维度、噪音大等特性, 用于机器学习建模时常常会产生与领域专家认知不一致的结果。面向机器学习全流程, 开发材料领域知识嵌入的机器学习模型是解决这一问题的有效途径。材料数据的准确性直接影响了数据驱动的材料性能预测的可靠性。本研究针对机器学习应用过程中的数据预处理阶段, 提出了融合材料领域知识的数据准确性检测方法。该方法首先结合材料专家认知构建了材料领域知识库。然后, 将其与数据驱动的数据准确性检测方法结合, 从数据和领域知识两个角度对材料数据集进行基于描述符取值规则的单维度数据正确性检测、基于描述符相关性规则的多维度数据相关性检测以及基于多维相似样本识别策略的全维度数据可靠性检测。对于每一阶段识别出的异常数据, 结合材料领域知识进行修正, 并将领域知识融入到数据准确性检测方法的全过程以确保数据集从初始阶段就具有较高准确性。最后该方法在NASICON型固态电解质激活能预测数据集上的实验结果表明: 本研究提出的方法可以有效识别异常数据并进行合理修正。与原始数据集相比, 基于修正数据集的6种机器学习模型的预测精度都有不同程度的提升。其中, 在最优模型上R2提升了33%。  相似文献   

14.
基于CEM的高光谱图像小目标检测算法   总被引:1,自引:0,他引:1  
针对高光谱图像中小目标检测问题,提出了一种基于约束能量最小化(Constrained Energy Minimization,CEM)的目标检测算法.该算法首先对原始图像进行背景信息抑制从而抑制背景地物、突出低概率的小目标,用迭代误差分析的自动端元提取算法找出目标的端元光谱,然后把目标端元光谱代入CEM滤波器得到该目标的检测结果图.用高光谱数据进行了实验研究,并与CEM滤波器进行了比较.结果表明,其检测性能与直接采用CEM方法的检测性能相当,但是相对于CEM方法,该算法不需要目标的先验光谱信息,更具有实用性.  相似文献   

15.
夏懿  徐文学 《振动与冲击》2022,(3):45-53+81
针对轴承在不同工况条件下的振动数据存在分布差异,导致诊断精度低的问题,提出一种新的深度自适应网络用于跨域条件下的轴承故障诊断。采用傅里叶变换将原始时域振动信号转换为频域信号并通过深度特征提取器提取其分类特征;利用最大均值差异(maximize mean discrepancy,MMD)来进行深度特征的边缘分布对齐;利用Wasserstein度量网络将源域中有标签数据的类别结构与目标域中无标签数据的类别结构进行匹配,即对齐不同域的类别条件分布,使得故障数据在不同域的分布能够更好的对齐,从而提高模型在目标域未标签数据集上的分类准确率。试验利用凯斯西储大学公开的故障轴承数据集进行了两种跨域条件的模型迁移,验证了该网络在不同迁移场景中都具有较高的准确率,且优于其他深度自适应网络。  相似文献   

16.
为提升迁移学习在运动想象脑机接口应用过程中的迁移高效性及普适性,综合实例迁移和特征迁移学习方法的优势进而构建了混合迁移学习模型。首先,依据样本权重极化原理改进TrAdaBoost算法以实现实例层面的迁移,优化源域训练样本;其次,基于大间隔投射迁移支持向量机进一步缩短源域与目标域间的分布距离以完成特征层面的迁移,实现迁移效率最大化。进一步,将该方法应用于脑机接口竞赛Dataset IIb数据集进行离线测试及分析,研究结果表明混合迁移学习模型的迁移效率明显高于单一迁移学习模型,并且对于不同迁移对象识别准确率相对提升均值在70%以上,验证了所述方法的有效性与普适性。此外,基于已搭建的运动想象识别系统进行在线测试,验证了模型的实用性。  相似文献   

17.
Abstract

Through its Department of Defense (DoD) agencies, and outside contractors, the USA invests billions of dollars each year in military construction (MILCON) projects. Although construction management expertise is gained and significant amount of data are collected from past projects, completing projects on time remains a challenge. This article uses data from 466 MILCON projects to identify key factors that influence project duration and provide a new model to predict project time outcomes. The model generates accurate results and serves as a useful tool in the early phases of a project life cycle. Another key contribution of this study is the employed methodology, which includes the use of available data, targeting of relevant parameters, and development of the predictive model. The contributed methodology is applicable outside of the MILCON domain with the appropriate data set and by targeting the relevant influential factors to create models to predict time outcomes of future projects.  相似文献   

18.
Recently, sparse representation classification (SRC) and fisher discrimination dictionary learning (FDDL) methods have emerged as important methods for vehicle classification. In this paper, inspired by recent breakthroughs of discrimination dictionary learning approach and multi-task joint covariate selection, we focus on the problem of vehicle classification in real-world applications by formulating it as a multi-task joint sparse representation model based on fisher discrimination dictionary learning to merge the strength of multiple features among multiple sensors. To improve the classification accuracy in complex scenes, we develop a new method, called multi-task joint sparse representation classification based on fisher discrimination dictionary learning, for vehicle classification. In our proposed method, the acoustic and seismic sensor data sets are captured to measure the same physical event simultaneously by multiple heterogeneous sensors and the multi-dimensional frequency spectrum features of sensors data are extracted using Mel frequency cepstral coefficients (MFCC). Moreover, we extend our model to handle sparse environmental noise. We experimentally demonstrate the benefits of joint information fusion based on fisher discrimination dictionary learning from different sensors in vehicle classification tasks.  相似文献   

19.
With the continuous expansion of software scale, software update and maintenance have become more and more important. However, frequent software code updates will make the software more likely to introduce new defects. So how to predict the defects quickly and accurately on the software change has become an important problem for software developers. Current defect prediction methods often cannot reflect the feature information of the defect comprehensively, and the detection effect is not ideal enough. Therefore, we propose a novel defect prediction model named ITNB (Improved Transfer Naive Bayes) based on improved transfer Naive Bayesian algorithm in this paper, which mainly considers the following two aspects: (1) Considering that the edge data of the test set may affect the similarity calculation and final prediction result, we remove the edge data of the test set when calculating the data similarity between the training set and the test set; (2) Considering that each feature dimension has different effects on defect prediction, we construct the calculation formula of training data weight based on feature dimension weight and data gravity, and then calculate the prior probability and the conditional probability of training data from the weight information, so as to construct the weighted bayesian classifier for software defect prediction. To evaluate the performance of the ITNB model, we use six datasets from large open source projects, namely Bugzilla, Columba, Mozilla, JDT, Platform and PostgreSQL. We compare the ITNB model with the transfer Naive Bayesian (TNB) model. The experimental results show that our ITNB model can achieve better results than the TNB model in terms of accurary, precision and pd for within-project and cross-project defect prediction.  相似文献   

20.
The majority of big data analytics applied to transportation datasets suffer from being too domain-specific, that is, they draw conclusions for a dataset based on analytics on the same dataset. This makes models trained from one domain (e.g. taxi data) applies badly to a different domain (e.g. Uber data). To achieve accurate analyses on a new domain, substantial amounts of data must be available, which limits practical applications. To remedy this, we propose to use semi-supervised and active learning of big data to accomplish the domain adaptation task: Selectively choosing a small amount of datapoints from a new domain while achieving comparable performances to using all the datapoints. We choose the New York City (NYC) transportation data of taxi and Uber as our dataset, simulating different domains with 90% as the source data domain for training and the remaining 10% as the target data domain for evaluation. We propose semi-supervised and active learning strategies and apply it to the source domain for selecting datapoints. Experimental results show that our adaptation achieves a comparable performance of using all datapoints while using only a fraction of them, substantially reducing the amount of data required. Our approach has two major advantages: It can make accurate analytics and predictions when big datasets are not available, and even if big datasets are available, our approach chooses the most informative datapoints out of the dataset, making the process much more efficient without having to process huge amounts of data.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号