期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Combining linear discriminant functions with neural networks for supervised learning

Ke Chen Xiang Yu Huisheng Chi 《Neural computing & applications》1997,6(1):19-41

A novel supervised learning method is proposed by combining linear discriminant functions with neural networks. The proposed method results in a tree-structured hybrid architecture. Due to constructive learning, the binary tree hierarchical architecture is automatically generated by a controlled growing process for a specific supervised learning task. Unlike the classic decision tree, the linear discriminant functions are merely employed in the intermediate level of the tree for heuristically partitioning a large and complicated task into several smaller and simpler subtasks in the proposed method. These subtasks are dealt with by component neural networks at the leaves of the tree accordingly. For constructive learning, growing and credit-assignment algorithms are developed to serve for the hybrid architecture. The proposed architecture provides an efficient way to apply existing neural networks (e.g. multi-layered perceptron) for solving a large scale problem. We have already applied the proposed method to a universal approximation problem and several benchmark classification problems in order to evaluate its performance. Simulation results have shown that the proposed method yields better results and faster training in comparison with the multilayered perceptron. 相似文献

2.

Collaborative filtering with maximum entropy

Pavlov D. Manavoglu E. Giles C.L. Pennock D.M. 《Intelligent Systems, IEEE》2004,19(6):40-47

As users navigate through online document collections on high-volume Web servers, they depend on good recommendations. We present a novel maximum-entropy algorithm for generating accurate recommendations and a data-clustering approach for speeding up model training. Recommender systems attempt to automate the process of "word of mouth" recommendations within a community. Typical application environments such as online shops and search engines have many dynamic aspects. 相似文献

3.

Batch map extensions of the kernel-based maximum entropy learning rule

Gautama T. Van Hulle M.M. 《Neural Networks, IEEE Transactions on》2006,17(2):529-532

In this letter, two batch-map extensions are described for the kernel-based maximum entropy learning rule (kMER). In the first, the weights are iteratively set to weighted component-wise medians, while in the second the generalized median is used, enabling kMER to process symbolic data. Simulations are performed to illustrate the extensions. 相似文献

4.

Visual object recognition with supervised learning

《Intelligent Systems, IEEE》2003,18(3):38-42

A component-based approach to visual object recognition rooted in supervised learning allows for a vision system that is more robust against changes in an object's pose or illumination. Learning figures prominently in the study of visual systems from the viewpoints of visual neuroscience and computer vision. Whereas visual neuroscience concentrates on mechanisms that let the cortex adapt its circuitry and learn a new task, computer vision aims at devising effectively trainable systems. Vision systems that learn and adapt are one of the most important trends in computer vision research. They might offer the only solution to developing robust, reusable vision systems. 相似文献

5.

Nonparametric linear tests with multiple events

Claire Pinon Odile Pons 《Computational statistics & data analysis》2009,53(12):129-4289

A generalization of the nonparametric linear rank statistics is presented to handle the two-group comparison with multiple events. For a sample divided into two groups, in which each subject may experience at least two distinct failures, the logrank tests are extended to test the null hypothesis that the vector of the marginal survival distributions of the first group equals that of the second group. Two cases are distinguished depending on whether the null hypothesis does or does not imply the equality of the joint survival functions. In both cases, under the null hypothesis, the asymptotic joint distribution of the vector of the marginal statistics is shown to be Gaussian with covariance matrix consistently estimated using martingale properties. These theoretical results are illustrated by a simulation study and an application on the German Breast Cancer data. An extension to multiple hypotheses testing in multivariate proportional hazards models is also developed. 相似文献

6.

Nonparametric interpolation of the Markov sequence

A. V. Dobrovidov 《Automation and Remote Control》2008,69(1):52-60

Consideration was given to the problem of interpolation (smoothing) of the nonobservable component of the composite Markov process within the framework of the conditional Markov scheme. In the case of the dynamic observation models such as autoregression, equations were derived for the a posteriori interpolation density of the probability of the state of the nonobservable component. The aim of the present paper was to construct a smoothing algorithm for an unknown family of the distributions of the nonobservable component of the partially observable random Markov sequence. The result was obtained for the strictly stationary random Markov processes with mixing and for the conditional densities in the observation model from the exponential family of distributions. Computer-aided modeling within the framework of the Kalman scheme demonstrated that the sampled root-mean-square error of the nonparametric smoothing algorithm constructed for an unknown state equation was situated between the errors of the optimal linear filtration and the optimal linear interpolation. 相似文献

7.

Continuous affect recognition with weakly supervised learning

Pei Ercheng Jiang Dongmei Alioscha-Perez Mitchel Sahli Hichem 《Multimedia Tools and Applications》2019,78(14):19387-19412

Multimedia Tools and Applications - Recognizing a person’s affective state from audio-visual signals is an essential capability for intelligent interaction. Insufficient training data and the... 相似文献

8.

A new feedback neural network with supervised learning

Salam F.M.A. Bai S. 《Neural Networks, IEEE Transactions on》1991,2(1):170-173

A model is introduced for continuous-time dynamic feedback neural networks with supervised learning ability. Modifications are introduced to conventional models to guarantee precisely that a given desired vector, and its negative, are indeed stored in the network as asymptotically stable equilibrium points. The modifications entail that the output signal of a neuron is multiplied by the square of its associated weight to supply the signal to an input of another neuron. A simulation of the complete dynamics is then presented for a prototype one neuron with self-feedback and supervised learning; the simulation illustrates the (supervised) learning capability of the network. 相似文献

9.

A neural fuzzy system with fuzzy supervised learning 总被引：2，自引：0，他引：2

Chin-Teng Lin Ya-Ching Lu 《IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics》1996,26(5):744-763

A neural fuzzy system learning with fuzzy training data (fuzzy if-then rules) is proposed in this paper. This system is able to process and learn numerical information as well as linguistic information. At first, we propose a five-layered neural network for the connectionist realization of a fuzzy inference system. The connectionist structure can house fuzzy logic rules and membership functions for fuzzy inference. We use alpha-level sets of fuzzy numbers to represent linguistic information. The inputs, outputs, and weights of the proposed network can be fuzzy numbers of any shape. Furthermore, they can be hybrid of fuzzy numbers and numerical numbers through the use of fuzzy singletons. Based on interval arithmetics, a fuzzy supervised learning algorithm is developed for the proposed system. It extends the normal supervised learning techniques to the learning problems where only linguistic teaching signals are available. The fuzzy supervised learning scheme can train the proposed system with desired fuzzy input-output pairs which are fuzzy numbers instead of the normal numerical values. With fuzzy supervised learning, the proposed system can be used for rule base concentration to reduce the number of rules in a fuzzy rule base. Simulation results are presented to illustrate the performance and applicability of the proposed system. 相似文献

10.

Modelling time-varying higher moments with maximum entropy density

Felix Chan 《Mathematics and computers in simulation》2009

Since the introduction of the Autoregressive Conditional Heteroscedasticity (ARCH) model of Engle [R. Engle, Autoregressive conditional heteroscedasticity with estimates of the variance of United Kingdom inflation, Econometrica 50 (1982) 987–1007], the literature of modelling the conditional second moment has become increasingly popular in the last two decades. Many extensions and alternate models of the original ARCH have been proposed in the literature aiming to capture the dynamics of volatility more accurately. Interestingly, the Quasi Maximum Likelihood Estimator (QMLE) with normal density is typically used to estimate the parameters in these models. As such, the higher moments of the underlying distribution are assumed to be the same as those of the normal distribution. However, various studies reveal that the higher moments, such as skewness and kurtosis of the distribution of financial returns are not likely to be the same as the normal distribution, and in some cases, they are not even constant over time. These have significant implications in risk management, especially in the calculation of Value-at-Risk (VaR) which focuses on the negative quantile of the return distribution. Failed to accurately capture the shape of the negative quantile would produce inaccurate measure of risk, and subsequently lead to misleading decision in risk management. This paper proposes a solution to model the distribution of financial returns more accurately by introducing a general framework to model the distribution of financial returns using maximum entropy density (MED). The main advantage of MED is that it provides a general framework to estimate the distribution function directly based on a given set of data, and it provides a convenient framework to model higher order moments up to any arbitrary finite order k. However this flexibility comes with a high cost in computational time as k increases, therefore this paper proposes an alternative model that would reduce computation time substantially. Moreover, the sensitivity of the parameters in the MED with respect to the dynamic changes of moments is derived analytically. This result is important as it relates the dynamic structure of the moments to the parameters in the MED. The usefulness of this approach will be demonstrated using 5 min intra-daily returns of the Euro/USD exchange rate. 相似文献

11.

Spectral algorithms for supervised learning

Lo Gerfo L Rosasco L Odone F De Vito E Verri A 《Neural computation》2008,20(7):1873-1897

We discuss how a large class of regularization methods, collectively known as spectral regularization and originally designed for solving ill-posed inverse problems, gives rise to regularized learning algorithms. All of these algorithms are consistent kernel methods that can be easily implemented. The intuition behind their derivation is that the same principle allowing for the numerical stabilization of a matrix inversion problem is crucial to avoid overfitting. The various methods have a common derivation but different computational and theoretical properties. We describe examples of such algorithms, analyze their classification performance on several data sets and discuss their applicability to real-world problems. 相似文献

12.

Adaptive sparseness for supervised learning 总被引：14，自引：0，他引：14

《IEEE transactions on pattern analysis and machine intelligence》2003,25(9):1150-1159

The goal of supervised learning is to infer a functional mapping based on a set of training examples. To achieve good generalization, it is necessary to control the "complexity" of the learned function. In Bayesian approaches, this is done by adopting a prior for the parameters of the function being learned. We propose a Bayesian approach to supervised learning, which leads to sparse solutions; that is, in which irrelevant parameters are automatically set exactly to zero. Other ways to obtain sparse classifiers (such as Laplacian priors, support vector machines) involve (hyper)parameters which control the degree of sparseness of the resulting classifiers; these parameters have to be somehow adjusted/estimated from the training data. In contrast, our approach does not involve any (hyper)parameters to be adjusted or estimated. This is achieved by a hierarchical-Bayes interpretation of the Laplacian prior, which is then modified by the adoption of a Jeffreys' noninformative hyperprior. Implementation is carried out by an expectation-maximization (EM) algorithm. Experiments with several benchmark data sets show that the proposed approach yields state-of-the-art performance. In particular, our method outperforms SVMs and performs competitively with the best alternative techniques, although it involves no tuning or adjustment of sparseness-controlling hyperparameters. 相似文献

13.

Cross-domain person re-identification by hybrid supervised and unsupervised learning

Pang Zhiqi Guo Jifeng Sun Wenbo Xiao Yanbang Yu Ming 《Applied Intelligence》2022,52(3):2987-3001

Although the single-domain person re-identification (Re-ID) method has achieved great accuracy, the dependence on the label in the same image domain severely limits the scalability of this method. Therefore, cross-domain Re-ID has received more and more attention. In this paper, a novel cross-domain Re-ID method combining supervised and unsupervised learning is proposed, which includes two models: a triple-condition generative adversarial network (TC-GAN) and a dual-task feature extraction network (DFE-Net). We first use TC-GAN to generate labeled images with the target style, and then we combine supervised and unsupervised learning to optimize DFE-Net. Specifically, we use labeled generated data for supervised learning. In addition, we mine effective information in the target data from two perspectives for unsupervised learning. To effectively combine the two types of learning, we design a dynamic weighting function to dynamically adjust the weights of these two approaches. To verify the validity of TC-GAN, DFE-Net, and the dynamic weight function, we conduct multiple experiments on Market-1501 and DukeMTMC-reID. The experimental results show that the dynamic weight function can improve the performance of the models, and our method is better than many state-of-the-art methods.

相似文献

14.

Environment adaptation for robust speaker verification by cascading maximum likelihood linear regression and reinforced learning

《Computer Speech and Language》2007,21(2):231-246

In speaker verification over public telephone networks, utterances can be obtained from different types of handsets. Different handsets may introduce different degrees of distortion to the speech signals. This paper attempts to combine a handset selector with (1) handset-specific transformations, (2) reinforced learning, and (3) stochastic feature transformation to reduce the effect caused by the acoustic distortion. Specifically, during training, the clean speaker models and background models are firstly transformed by MLLR-based handset-specific transformations using a small amount of distorted speech data. Then reinforced learning is applied to adapt the transformed models to handset-dependent speaker models and handset-dependent background models using stochastically transformed speaker patterns. During a verification session, a GMM-based handset classifier is used to identify the most likely handset used by the claimant; then the corresponding handset-dependent speaker and background model pairs are used for verification. Experimental results based on 150 speakers of the HTIMIT corpus show that environment adaptation based on the combination of MLLR, reinforced learning and feature transformation outperforms CMS, Hnorm, Tnorm, and speaker model synthesis. 相似文献

15.

A mesh optimization algorithm to decrease the maximum interpolation error of linear triangular finite elements 总被引：1，自引：0，他引：1

U. Hetmaniuk P. Knupp 《Engineering with Computers》2011,27(1):3-15

We present a mesh optimization algorithm for adaptively improving the finite element interpolation of a function of interest. The algorithm minimizes an objective function by swapping edges and moving nodes. Numerical experiments are performed on model problems. The results illustrate that the mesh optimization algorithm can reduce the W ^1,∞ semi-norm of the interpolation error. For these examples, the L ², L ^∞, and H ¹ norms decreased also. 相似文献

16.

Using localizing learning to improve supervised learning algorithms 总被引：3，自引：0，他引：3

Weaver S. Baird L. Polycarpou M. 《Neural Networks, IEEE Transactions on》2001,12(5):1037-1046

Slow learning of neural-network function approximators can frequently be attributed to interference, which occurs when learning in one area of the input space causes unlearning in another area. To mitigate the effect of unlearning, this paper develops an algorithm that adjusts the weights of an arbitrary, nonlinearly parameterized network such that the potential for future interference during learning is reduced. This is accomplished by the reduction of a biobjective cost function that combines the approximation error and a term that measures interference. An analysis of the algorithm's convergence properties shows that learning with this algorithm reduces future unlearning. The algorithm can be used either during online learning or can be used to condition a network to have immunity from interference during a future learning stage. A simple example demonstrates how interference manifests itself in a network and how less interference can lead to more efficient learning. Simulations demonstrate how this new learning algorithm speeds up the training in various situations due to the extra cost function term. 相似文献

17.

Tomographic analysis of sign-altering functions by maximum entropy method

《Computers & Mathematics with Applications》2000,39(9-10):15-24

Generalization of the maximum entropy method (MEM) for the reconstruction of sign-altering functions from two-dimensional tomographic measurement data is developed. Three-dimensional algorithms for parallel beam geometry are considered. Results of numerical simulations for composite model are presented. 相似文献

18.

Large-scale supervised similarity learning in networks

Shiyu Chang Guo-Jun Qi Yingzhen Yang Charu C. Aggarwal Jiayu Zhou Meng Wang Thomas S. Huang 《Knowledge and Information Systems》2016,46(3):707-730

相似文献

19.

Self-organizing network for optimum supervised learning

Tenorio M.F. Lee W.-T. 《Neural Networks, IEEE Transactions on》1990,1(1):100-110

A new algorithm called the self-organizing neural network (SONN) is introduced. Its use is demonstrated in a system identification task. The algorithm constructs a network, chooses the node functions, and adjusts the weights. It is compared to the backpropagation algorithm in the identification of the chaotic time series. The results show that SONN constructs a simpler, more accurate model, requiring less training data and fewer epochs. The algorithm can also be applied as a classifier. 相似文献

20.

Improving stereovision matching through supervised learning

G. Pajares J. M. de la Cruz J. A. López-Orozco 《Pattern Analysis & Applications》1998,1(2):105-120

Most classical local stereovision matching algorithms use features representing objects in both images and compute the minimum difference attribute values. We have verified that the differences in attributes for the true matches cluster in a cloud around a centre. The correspondence is established on the basis of the minimum squared Mahalanobis distance between the difference of the attributes for a current pair of features and the cluster centre (similarity constraint). We introduce a new supervised learning strategy derived from the Learning Vector Quantization (LVQ) approach to get the best cluster centre. Additionally, we obtain the contribution or specific weight of each attribute for matching. We improve the learning law introducing a variable learning rate. The supervised learning and the improved learning law are the most important findings, which are justified by the computed better results compared with classical local stereovision matching methods without learning and with other learning strategies. The method is illustrated with 47 pairs of stereo images. 相似文献