期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Global convergence of Oja's PCA learning algorithm with a non-zero-approaching adaptive learning rate

Jian Cheng Lv Zhang Yi K.K. Tan 《Theoretical computer science》2006

A non-zero-approaching adaptive learning rate is proposed to guarantee the global convergence of Oja's principal component analysis (PCA) learning algorithm. Most of the existing adaptive learning rates for Oja's PCA learning algorithm are required to approach zero as the learning step increases. However, this is not practical in many applications due to the computational round-off limitations and tracking requirements. The proposed adaptive learning rate overcomes this shortcoming. The learning rate converges to a positive constant, thus it increases the evolution rate as the learning step increases. This is different from learning rates which approach zero which slow the convergence considerably and increasingly with time. Rigorous mathematical proofs for global convergence of Oja's algorithm with the proposed learning rate are given in detail via studying the convergence of an equivalent deterministic discrete time (DDT) system. Extensive simulations are carried out to illustrate and verify the theory derived. Simulation results show that this adaptive learning rate is more suitable for Oja's PCA algorithm to be used in an online learning situation. 相似文献

2.

An adaptive learning algorithm for principal component analysis 总被引：2，自引：0，他引：2

Liang-Hwa Chen Shyang Chang 《Neural Networks, IEEE Transactions on》1995,6(5):1255-1263

Principal component analysis (PCA) is one of the most general purpose feature extraction methods. A variety of learning algorithms for PCA has been proposed. Many conventional algorithms, however, will either diverge or converge very slowly if learning rate parameters are not properly chosen. In this paper, an adaptive learning algorithm (ALA) for PCA is proposed. By adaptively selecting the learning rate parameters, we show that the m weight vectors in the ALA converge to the first m principle component vectors with almost the same rates. Comparing with the Sanger's generalized Hebbian algorithm (GHA), the ALA can quickly find the desired principal component vectors while the GHA fails to do so. Finally, simulation results are also included to illustrate the effectiveness of the ALA. 相似文献

3.

Dynamics of Generalized PCA and MCA Learning Algorithms 总被引：1，自引：0，他引：1

Dezhong Peng Zhang Yi 《Neural Networks, IEEE Transactions on》2007,18(6):1777-1784

Principal component analysis (PCA) and minor component analysis (MCA) are two important statistical tools which have many applications in the fields of signal processing and data analysis. PCA and MCA neural networks (NNs) can be used to online extract principal component and minor component from input data. It is interesting to develop generalized learning algorithms of PCA and MCA NNs. Some novel generalized PCA and MCA learning algorithms are proposed in this paper. Convergence of PCA and MCA learning algorithms is an essential issue in practical applications. Traditionally, the convergence is studied via deterministic continuous-time (DCT) method. The DCT method requires the learning rate of the algorithms to approach to zero, which is not realistic in many practical applications. In this paper, deterministic discrete-time (DDT) method is used to study the dynamical behaviors of the proposed algorithms. The DDT method is more reasonable for the convergence analysis since it does not require constraints as that of the DCT method. It is proven that under some mild conditions, the weight vector in these proposed algorithms will converge exponentially to principal or minor component. Simulation results are further used to illustrate the theoretical results. 相似文献

4.

Convergence analysis of the OJAn MCA learning algorithm by the deterministic discrete time method

Dezhong Peng Zhang Yi 《Theoretical computer science》2007

Minor component analysis (MCA) is a statistical method of extracting the eigenvector associated with the smallest eigenvalue of the covariance matrix of input signals. Convergence is essential for MCA algorithms towards practical applications. Traditionally, the convergence of MCA algorithms is indirectly analyzed via their corresponding deterministic continuous time (DCT) systems. However, the DCT method requires the learning rate to approach zero, which is not reasonable in many applications due to the round-off limitation and tracking requirements. This paper studies the convergence of the deterministic discrete time (DDT) system associated with the OJAn MCA learning algorithm. Unlike the DCT method, the DDT method does not require the learning rate to approach zero. In this paper, some important convergence results are obtained for the OJAn MCA learning algorithm via the DDT method. Simulations are carried out to illustrate the theoretical results achieved. 相似文献

5.

Convergence analysis of a deterministic discrete time system of Oja's PCA learning algorithm 总被引：1，自引：0，他引：1

Zhang Yi Mao Ye Jian Cheng Lv Kok Kiong Tan 《Neural Networks, IEEE Transactions on》2005,16(6):1318-1328

The convergence of Oja's principal component analysis (PCA) learning algorithms is a difficult topic for direct study and analysis. Traditionally, the convergence of these algorithms is indirectly analyzed via certain deterministic continuous time (DCT) systems. Such a method will require the learning rate to converge to zero, which is not a reasonable requirement to impose in many practical applications. Recently, deterministic discrete time (DDT) systems have been proposed instead to indirectly interpret the dynamics of the learning algorithms. Unlike DCT systems, DDT systems allow learning rates to be constant (which can be a nonzero). This paper will provide some important results relating to the convergence of a DDT system of Oja's PCA learning algorithm. It has the following contributions: 1) A number of invariant sets are obtained, based on which we can show that any trajectory starting from a point in the invariant set will remain in the set forever. Thus, the nondivergence of the trajectories is guaranteed. 2) The convergence of the DDT system is analyzed rigorously. It is proven, in the paper, that almost all trajectories of the system starting from points in an invariant set will converge exponentially to the unit eigenvector associated with the largest eigenvalue of the correlation matrix. In addition, exponential convergence rate are obtained, providing useful guidelines for the selection of fast convergence learning rate. 3) Since the trajectories may diverge, the careful choice of initial vectors is an important issue. This paper suggests to use the domain of unit hyper sphere as initial vectors to guarantee convergence. 4) Simulation results will be furnished to illustrate the theoretical results achieved. 相似文献

6.

Adaptive Generalized Eigenvector Estimating Algorithm for Hermitian Matrix Pencil

下载免费PDF全文

Yingbin Gao 《IEEE/CAA Journal of Automatica Sinica》2022,9(11):1967-1979

Generalized eigenvector plays an essential role in the signal processing field. In this paper, we present a novel neural network learning algorithm for estimating the generalized eigenvector of a Hermitian matrix pencil. Differently from some traditional algorithms, which need to select the proper values of learning rates before using, the proposed algorithm does not need a learning rate and is very suitable for real applications. Through analyzing all of the equilibrium points, it is proven that if and only if the weight vector of the neural network is equal to the generalized eigenvector corresponding to the largest generalized eigenvalue of a Hermitian matrix pencil, the proposed algorithm reaches to convergence status. By using the deterministic discrete-time (DDT) method, some convergence conditions, which can be satisfied with probability 1, are also obtained to guarantee its convergence. Simulation results show that the proposed algorithm has a fast convergence speed and good numerical stability. The real application demonstrates its effectiveness in tracking the optimal vector of beamforming. 相似文献

7.

Adaptive multiple minor directions extraction in parallel using a PCA neural network

Kok Kiong Tan Zhang Yi 《Theoretical computer science》2010,411(48):4200-4215

A principal component analysis (PCA) neural network is developed for online extraction of the multiple minor directions of an input signal. The neural network can extract the multiple minor directions in parallel by computing the principal directions of the transformed input signal so that the stability-speed problem of directly computing the minor directions can be avoided to a certain extent. On the other hand, the learning algorithms for updating the net weights use constant learning rates. This overcomes the shortcoming of the learning rates approaching zero. In addition, the proposed algorithms are globally convergent so that it is very simple to choose the initial values of the learning parameters. This paper presents the convergence analysis of the proposed algorithms by studying the corresponding deterministic discrete time (DDT) equations. Rigorous mathematical proof is given to prove the global convergence. The theoretical results are further confirmed via simulations. 相似文献

8.

A symmetric linear neural network that learns principal componentsand their variances

Peper F. Noda H. 《Neural Networks, IEEE Transactions on》1996,7(4):1042-1047

This paper proposes a linear neural network for principal component analysis whose weight vector lengths converge to the variances of the principal components in the input data. The neural network breaks the symmetry in its learning process by the differences in weight vector lengths and, as opposed to other linear neural networks described in literature, does not need to assume any asymmetries in its structure to extract the principal components. We prove the asymptotic stability of a stationary solution of the network's learning equation. Simulations show that the set of weight vectors converge to this solution. Comparison of convergence speeds shows that in the simulations the proposed neural network is about as fast as Sanger's generalized Hebbian algorithm (GHA) network, the weighted subspace rule network of Oja et al., and Xu's LMSER network (weighted linear version). 相似文献

9.

基于自适应学习率的深度信念网设计与应用 总被引：1，自引：0，他引：1

乔俊飞王功明李晓理韩红桂柴伟《自动化学报》2017,43(8):1339-1349

针对深度信念网（Deep belief network，DBN）预训练耗时长的问题，提出了一种基于自适应学习率的DBN（Adaptive learning rate DBN，ALRDBN）.ALRDBN将自适应学习率引入到对比差度（Contrastive divergence，CD）算法中，通过自动调整学习步长来提高CD算法的收敛速度.然后设计基于自适应学习率的权值训练方法，通过网络性能分析给出学习率变化系数的范围.最后，通过一系列的实验对所设计的ALRDBN进行测试，仿真实验结果表明，ALRDBN的收敛速度得到了提高且预测精度也有所改善. 相似文献

10.

Forecasting foreign exchange rates with an improved back-propagation learning algorithm with adaptive smoothing momentum terms

Lean Yu Shouyang Wang Kin Keung Lai 《Frontiers of Computer Science in China》2009,3(2):167-176

The slow convergence of back-propagation neural network (BPNN) has become a challenge in data-mining and knowledge discovery applications due to the drawbacks of the gradient descent (GD) optimization method, which is widely adopted in BPNN learning. To solve this problem, some standard optimization techniques such as conjugate-gradient and Newton method have been proposed to improve the convergence rate of BP learning algorithm. This paper presents a heuristic method that adds an adaptive smoothing momentum term to original BP learning algorithm to speedup the convergence. In this improved BP learning algorithm, adaptive smoothing technique is used to adjust the momentums of weight updating formula automatically in terms of “3 σ limits theory.” Using the adaptive smoothing momentum terms, the improved BP learning algorithm can make the network training and convergence process faster, and the network’s generalization performance stronger than the standard BP learning algorithm can do. In order to verify the effectiveness of the proposed BP learning algorithm, three typical foreign exchange rates, British pound (GBP), Euro (EUR), and Japanese yen (JPY), are chosen as the forecasting targets for illustration purpose. Experimental results from homogeneous algorithm comparisons reveal that the proposed BP learning algorithm outperforms the other comparable BP algorithms in performance and convergence rate. Furthermore, empirical results from heterogeneous model comparisons also show the effectiveness of the proposed BP learning algorithm. 相似文献

11.

Deterministic Learning for Maximum-Likelihood Estimation Through Neural Networks

《Neural Networks, IEEE Transactions on》2008,19(8):1456-1467

In this paper, a general method for the numerical solution of maximum-likelihood estimation (MLE) problems is presented; it adopts the deterministic learning (DL) approach to find close approximations to ML estimator functions for the unknown parameters of any given density. The method relies on the choice of a proper neural network and on the deterministic generation of samples of observations of the likelihood function, thus avoiding the problem of generating samples with the unknown density. Under mild assumptions, consistency and convergence with favorable rates to the true ML estimator function can be proved. Simulation results are provided to show the good behavior of the algorithm compared to the corresponding exact solutions. 相似文献

12.

Adaptive observer for multiple-input-multiple-output (MIMO) lineartime-varying systems

Zhang Q. 《Automatic Control, IEEE Transactions on》2002,47(3):525-529

For joint state-parameter estimation in linear time-varying (LTV) multiple-input-multiple-output (MIMO) systems, an approach to the design of adaptive observers is proposed. It is conceptually simple and computationally efficient. Its global exponential convergence is established for noise-free systems. In the presence of noises, it is proved that the estimation errors are bounded and converge in the mean to zero if the noises are bounded and have zero means. Potential applications are fault detection and isolation, and adaptive control 相似文献

13.

模拟电路故障诊断的约束聚类集成BP网络算法

王月海蒋爱民程冉王彤威《计算机测量与控制》2011,19(7)

针对通用BP网络对于高纬度、大数据量训练收敛困难的问题,在使用动量因子、自适应调整学习速率等方法的基础,引入约束聚类,构造集成神经网络,以提高网络的训练速度及诊断效果;首先,采用约束聚类算法将训练样本集划分为若干个规模相当的子样本集,分别训练生成相应子网络;此外,在诊断过程中除各子网络的输出变量外,还加入了诊断数据相对各子训练样本集的隶属度因子;最后通过一个实际电路板25维采样数据、38类故障的BP网络诊断实例验证了算法的可行性。相似文献

14.

Fast Convergent Generalized Back-Propagation Algorithm with Constant Learning Rate

S.C. Ng S.H. Leung A. Luk 《Neural Processing Letters》1999,9(1):13-23

The conventional back-propagation algorithm is basically a gradient-descent method, it has the problems of local minima and slow convergence. A new generalized back-propagation algorithm which can effectively speed up the convergence rate and reduce the chance of being trapped in local minima is introduced. The new back-propagation algorithm is to change the derivative of the activation function so as to magnify the backward propagated error signal, thus the convergence rate can be accelerated and the local minimum can be escaped. In this letter, we also investigate the convergence of the generalized back-propagation algorithm with constant learning rate. The weight sequences in generalized back-propagation algorithm can be approximated by a certain ordinary differential equation (ODE). When the learning rate tends to zero, the interpolated weight sequences of generalized back-propagation converge weakly to the solution of associated ODE. 相似文献

15.

任意初态下的工业机器人迭代学习控制

下载免费PDF全文

惠小健《计算机工程与应用》2021,57(3):261-265

对迭代初值为任意值的工业机器人轨迹跟踪控制系统,提出了一种基于滑模面的非线性迭代学习控制算法,使机器人轨迹能快速、精确跟踪上期望轨迹。基于有限时间收敛原理,构建了关于机器人轨迹跟踪误差的迭代滑模面,在滑模面内,机器人轨迹跟踪误差在预定时间内收敛到零。设计了基于滑模面的迭代学习控制算法,理论证明了随着迭代次数的增加,处于任意初态的轨迹将一致收敛到滑模面内,解决了迭代学习中的任意初值问题。数值仿真验证了该算法的有效性和抗干扰能力。相似文献

16.

Adaptive iterative learning controller with input learning technique for a class of uncertain MIMO nonlinear systems

Kim Minsung Kuc Tae-Yong Kim Hyosin Lee Jin S. 《International Journal of Control, Automation and Systems》2017,15(1):315-328

In this paper, an adaptive iterative learning controller (AILC) with input learning technique is presented for uncertain multi-input multi-output (MIMO) nonlinear systems in the normal form. The proposed AILC learns the internal parameter of the state equation as well as the input gain parameter, and also estimates the desired input using an input learning rule to track the whole history of command trajectory. The features of the proposed control scheme can be briefly summarized as follows: 1) To the best of authors’ knowledge, the AILC with input learning is first developed for uncertain MIMO nonlinear systems in the normal form; 2) The convergence of learning input error is ensured; 3) The input learning rule is simple; therefore, it can be easily implemented in industrial applications. With the proposed AILC scheme, the tracking error and desired input error converge to zero as the repetition of the learning operation increases. Single-link and two-link manipulators are presented as simulation examples to confirm the feasibility and performance of the proposed AILC.

相似文献

17.

Adaptive observers design for a class of linear descriptor systems

Marouane Alma Mohamed Darouach 《Automatica》2014

相似文献

18.

On Adaptive Learning Rate That Guarantees Convergence in Feedforward Networks 总被引：2，自引：0，他引：2

《Neural Networks, IEEE Transactions on》2006,17(5):1116-1125

This paper investigates new learning algorithms (LF I and LF II) based on Lyapunov function for the training of feedforward neural networks. It is observed that such algorithms have interesting parallel with the popular backpropagation (BP) algorithm where the fixed learning rate is replaced by an adaptive learning rate computed using convergence theorem based on Lyapunov stability theory. LF II, a modified version of LF I, has been introduced with an aim to avoid local minima. This modification also helps in improving the convergence speed in some cases. Conditions for achieving global minimum for these kind of algorithms have been studied in detail. The performances of the proposed algorithms are compared with BP algorithm and extended Kalman filtering (EKF) on three bench-mark function approximation problems: XOR, 3-bit parity, and 8-3 encoder. The comparisons are made in terms of number of learning iterations and computational time required for convergence. It is found that the proposed algorithms (LF I and II) are much faster in convergence than other two algorithms to attain same accuracy. Finally, the comparison is made on a complex two-dimensional (2-D) Gabor function and effect of adaptive learning rate for faster convergence is verified. In a nutshell, the investigations made in this paper help us better understand the learning procedure of feedforward neural networks in terms of adaptive learning rate, convergence speed, and local minima. 相似文献

19.

An information theoretic sparse kernel algorithm for online learning

《Expert systems with applications》2014,41(9):4349-4359

Kernel-based algorithms have been proven successful in many nonlinear modeling applications. However, the computational complexity of classical kernel-based methods grows superlinearly with the increasing number of training data, which is too expensive for online applications. In order to solve this problem, the paper presents an information theoretic method to train a sparse version of kernel learning algorithm. A concept named instantaneous mutual information is investigated to measure the system reliability of the estimated output. This measure is used as a criterion to determine the novelty of the training sample and informative ones are selected to form a compact dictionary to represent the whole data. Furthermore, we propose a robust learning scheme for the training of the kernel learning algorithm with an adaptive learning rate. This ensures the convergence of the learning algorithm and makes it converge to the steady state faster. We illustrate the performance of our proposed algorithm and compare it with some recent kernel algorithms by several experiments. 相似文献

20.

Efficient Learning in Adaptive Processing of Data Structures

Cho Siu-Yeung Chi Zheru Wang Zhiyong Siu Wan-Chi 《Neural Processing Letters》2003,17(2):175-190

Many researchers have explored the use of neural network models for the adaptive processing of data structures. The learning formulation for one of the models is known as the Backpropagation Through Structure (BPTS) algorithm. The main limitations of the BPTS algorithm are attributed to the problems of slow convergence speed and long-term dependency. In this Letter, a novel heuristic algorithm is proposed. The idea of this algorithm is to optimize the free parameters of the node representation in data structure by using a hybrid type of learning algorithm. Encouraging results achieved demonstrate that this proposed algorithm outperforms the BPTS algorithm. 相似文献