期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

A note on sparse least-squares regression

Christos Boutsidis Malik Magdon-Ismail 《Information Processing Letters》2014

相似文献

2.

Fast generalized cross-validation algorithm for sparse model learning

Sundararajan S Shevade S Keerthi SS 《Neural computation》2007,19(1):283-301

We propose a fast, incremental algorithm for designing linear regression models. The proposed algorithm generates a sparse model by optimizing multiple smoothing parameters using the generalized cross-validation approach. The performances on synthetic and real-world data sets are compared with other incremental algorithms such as Tipping and Faul's fast relevance vector machine, Chen et al.'s orthogonal least squares, and Orr's regularized forward selection. The results demonstrate that the proposed algorithm is competitive. 相似文献

3.

Parallel algorithms for the iterative solution of sparse least-squares problems 总被引：2，自引：0，他引：2

R. Morandi F. Sgallari 《Parallel Computing》1990,13(3):271-280

In this paper, we are concerned with a generalized Gauss-Seidel approach to sparse linear least-squares problems. Two algorithms, related to those given by Schechter (1959), for the solution of linear systems are presented and their parallel implementation is discussed. In these procedures, which can be viewed as an alternative ordering of the variables in the SOR methods, the variables are divided into nondisjoint groups. Numerical results, obtained on CRAY X-MP/48, are presented and discussed. 相似文献

4.

Relaxed sparse eigenvalue conditions for sparse estimation via non-convex regularized regression

Zheng Pan Changshui Zhang 《Pattern recognition》2015

Non-convex regularizers usually improve the performance of sparse estimation in practice. To prove this fact, we study the conditions of sparse estimations for the sharp concave regularizers which are a general family of non-convex regularizers including many existing regularizers. For the global solutions of the regularized regression, our sparse eigenvalue based conditions are weaker than that of L1-regularization for parameter estimation and sparseness estimation. For the approximate global and approximate stationary (AGAS) solutions, almost the same conditions are also enough. We show that the desired AGAS solutions can be obtained by coordinate descent (CD) based methods. Finally, we perform some experiments to show the performance of CD methods on giving AGAS solutions and the degree of weakness of the estimation conditions required by the sharp concave regularizers. 相似文献

5.

L1 and L2 cross-validation for density estimation with special reference to orthogonal expansions

《Computational statistics & data analysis》1988,6(3):203-227

L₁ and L₂ cross-validation criteria are studied for a wide class of kernel estimators, estimators that have expansions as linear combinations of prescribed functions. In particular, the L₁ and L₂ cross-validation objective functions to be minimized are described for such estimators, and specialized for estimators based on orthogonal series. An alternative L₂ cross-validation criterion due to Scott and Terrell is adapted for these estimators. A new family of such estimators, “decreasing policy” estimators, is introduced.Results are given for both continuous and discrete densities, but the main emphasis is on the discrete case. In the multinomial case, new predictors are compared with some of the estimators proposed by Good, Fienberg and Holland, and Stone, and studied by Stone (1974), and consistency is discussed. 相似文献

6.

Stabilized sparse ordinal regression for medical risk stratification

Truyen Tran Dinh Phung Wei Luo Svetha Venkatesh 《Knowledge and Information Systems》2015,43(3):555-582

相似文献

7.

Approximation bounds for some sparse kernel regression algorithms

Zhang T 《Neural computation》2002,14(12):3013-3042

Gaussian processes have been widely applied to regression problems with good performance. However, they can be computationally expensive. In order to reduce the computational cost, there have been recent studies on using sparse approximations in gaussian processes. In this article, we investigate properties of certain sparse regression algorithms that approximately solve a gaussian process. We obtain approximation bounds and compare our results with related methods. 相似文献

8.

A sparse linear regression model for incomplete datasets

Veras Marcelo B. A. Mesquita Diego P. P. Mattos Cesar L. C. Gomes João P. P. 《Pattern Analysis & Applications》2020,23(3):1293-1303

Pattern Analysis and Applications - Incomplete data are often neglected when designing machine learning methods. A popular strategy adopted by practitioners to circumvent this consists of taking a... 相似文献

9.

Efficient sparse nonparallel support vector machines for classification

Yingjie Tian Xuchan Ju Zhiquan Qi 《Neural computing & applications》2014,24(5):1089-1099

In this paper, we propose a novel nonparallel classifier, named sparse nonparallel support vector machine (SNSVM), for binary classification. Different with the existing nonparallel classifiers, such as the twin support vector machines (TWSVMs), SNSVM has several advantages: It constructs two convex quadratic programming problems for both linear and nonlinear cases, which can be solved efficiently by successive overrelaxation technique; it does not need to compute the inverse matrices any more before training; it has the similar sparseness with standard SVMs; it degenerates to the TWSVMs when the parameters are appropriately chosen. Therefore, SNSVM is certainly superior to them theoretically. Experimental results on lots of data sets show the effectiveness of our method in both sparseness and classification accuracy and, therefore, confirm the above conclusions further. 相似文献

10.

Efficient regularized least-squares algorithms for conditional ranking on relational data

Tapio Pahikkala Antti Airola Michiel Stock Bernard De Baets Willem Waegeman 《Machine Learning》2013,93(2-3):321-356

In domains like bioinformatics, information retrieval and social network analysis, one can find learning tasks where the goal consists of inferring a ranking of objects, conditioned on a particular target object. We present a general kernel framework for learning conditional rankings from various types of relational data, where rankings can be conditioned on unseen data objects. We propose efficient algorithms for conditional ranking by optimizing squared regression and ranking loss functions. We show theoretically, that learning with the ranking loss is likely to generalize better than with the regression loss. Further, we prove that symmetry or reciprocity properties of relations can be efficiently enforced in the learned models. Experiments on synthetic and real-world data illustrate that the proposed methods deliver state-of-the-art performance in terms of predictive power and computational efficiency. Moreover, we also show empirically that incorporating symmetry or reciprocity properties can improve the generalization performance. 相似文献

11.

Fast cross-validation algorithms for least squares support vector machine and kernel ridge regression

Senjian An Wanquan Liu 《Pattern recognition》2007,40(8):2154-2162

Given n training examples, the training of a least squares support vector machine (LS-SVM) or kernel ridge regression (KRR) corresponds to solving a linear system of dimension n. In cross-validating LS-SVM or KRR, the training examples are split into two distinct subsets for a number of times (l) wherein a subset of m examples are used for validation and the other subset of (n-m) examples are used for training the classifier. In this case l linear systems of dimension (n-m) need to be solved. We propose a novel method for cross-validation (CV) of LS-SVM or KRR in which instead of solving l linear systems of dimension (n-m), we compute the inverse of an n dimensional square matrix and solve l linear systems of dimension m, thereby reducing the complexity when l is large and/or m is small. Typical multi-fold, leave-one-out cross-validation (LOO-CV) and leave-many-out cross-validations are considered. For five-fold CV used in practice with five repetitions over randomly drawn slices, the proposed algorithm is approximately four times as efficient as the naive implementation. For large data sets, we propose to evaluate the CV approximately by applying the well-known incomplete Cholesky decomposition technique and the complexity of these approximate algorithms will scale linearly on the data size if the rank of the associated kernel matrix is much smaller than n. Simulations are provided to demonstrate the performance of LS-SVM and the efficiency of the proposed algorithm with comparisons to the naive and some existent implementations of multi-fold and LOO-CV. 相似文献

12.

Stochastic variational hierarchical mixture of sparse Gaussian processes for regression

Thi Nhat Anh Nguyen Abdesselam Bouzerdoum Son Lam Phung 《Machine Learning》2018,107(12):1947-1986

In this article, we propose a scalable Gaussian process (GP) regression method that combines the advantages of both global and local GP approximations through a two-layer hierarchical model using a variational inference framework. The upper layer consists of a global sparse GP to coarsely model the entire data set, whereas the lower layer comprises a mixture of sparse GP experts which exploit local information to learn a fine-grained model. A two-step variational inference algorithm is developed to learn the global GP, the GP experts and the gating network simultaneously. Stochastic optimization can be employed to allow the application of the model to large-scale problems. Experiments on a wide range of benchmark data sets demonstrate the flexibility, scalability and predictive power of the proposed method. 相似文献

13.

A forward-constrained regression algorithm for sparse kernel density estimation.

Xia Hong Sheng Chen Chris J Harris 《Neural Networks, IEEE Transactions on》2008,19(1):193-198

Using the classical Parzen window (PW) estimate as the target function, the sparse kernel density estimator is constructed in a forward-constrained regression (FCR) manner. The proposed algorithm selects significant kernels one at a time, while the leave-one-out (LOO) test score is minimized subject to a simple positivity constraint in each forward stage. The model parameter estimation in each forward stage is simply the solution of jackknife parameter estimator for a single parameter, subject to the same positivity constraint check. For each selected kernels, the associated kernel width is updated via the Gauss-Newton method with the model parameter estimate fixed. The proposed approach is simple to implement and the associated computational cost is very low. Numerical examples are employed to demonstrate the efficacy of the proposed approach. 相似文献

14.

Efficient algorithms for order basis computation

Wei Zhou George Labahn 《Journal of Symbolic Computation》2012

相似文献

15.

Efficient sparse LU factorization with partial pivoting ondistributed memory architectures

Cong Fu Xiangmin Jiao Tao Yang 《Parallel and Distributed Systems, IEEE Transactions on》1998,9(2):109-125

A sparse LU factorization based on Gaussian elimination with partial pivoting (GEPP) is important to many scientific applications, but it is still an open problem to develop a high performance GEPP code on distributed memory machines. The main difficulty is that partial pivoting operations dynamically change computation and nonzero fill-in structures during the elimination process. This paper presents an approach called S* for parallelizing this problem on distributed memory machines. The S* approach adopts static symbolic factorization to avoid run-time control overhead, incorporates 2D L/U supemode partitioning and amalgamation strategies to improve caching performance, and exploits irregular task parallelism embedded in sparse LU using asynchronous computation scheduling. The paper discusses and compares the algorithms using 1D and 2D data mapping schemes, and presents experimental studies on Cray-T3D and T3E. The performance results for a set of nonsymmetric benchmark matrices are very encouraging, and S* has achieved up to 6.878 GFLOPS on 128 T3E nodes. To the best of our knowledge, this is the highest performance ever achieved for this challenging problem and the previous record was 2.583 GFLOPS on shared memory machines 相似文献

16.

An empirical comparison of V-fold penalisation and cross-validation for model selection in distribution-free regression

Charanpal Dhanjal Nicolas Baskiotis Stéphan Clémençon Nicolas Usunier 《Pattern Analysis & Applications》2016,19(1):41-53

相似文献

17.

Efficient sparse matrix-delayed vector multiplication for discretized neural field model

Jan Fousek 《The Journal of supercomputing》2018,74(5):1863-1884

Computational models of the human brain provide an important tool for studying the principles behind brain function and disease. To achieve whole-brain simulation, models are formulated at the level of neuronal populations as systems of delayed differential equations. In this paper, we show that the integration of large systems of sparsely connected neural masses is similar to well-studied sparse matrix-vector multiplication; however, due to delayed contributions, it differs in the data access pattern to the vectors. To improve data locality, we propose a combination of node reordering and tiled schedules derived from the connectivity matrix of the particular system, which allows performing multiple integration steps within a tile. We present two schedules: with a serial processing of the tiles and one allowing for parallel processing of the tiles. We evaluate the presented schedules showing speedup up to \(2\,\times \) on single-socket CPU, and \(1.25\,\times \) on Xeon Phi accelerator. 相似文献

18.

A modified orthogonal forward regression least-squares algorithm for system modelling from noisy regressors

L. Guo 《International journal of control》2013,86(3):340-348

In this paper, a modified orthogonal forward regression (OFR) least-squares algorithm is presented for system identification and modelling from noisy regressors. Under the assumption that the energy and signal-to-noise ratio (SNR) of the signals are known or can be estimated, it is shown that unbiased estimates of the Error reduction ratios (ERRs) and the parameters can be obtained in each forward regression step. Examples are provided to illustrate the proposed approach. 相似文献

19.

Efficient sparse unmixing analysis for hyperspectral imagery based on random projection

Zhenwei Shi Liu Liu Xinya Zhai Zhiguo Jiang 《Neural computing & applications》2013,23(7-8):2281-2293

Hyperspectral imagery including rich spectral information could be applied to detect and identify objects at a distance. In this paper, we concentrate on the surface material identification of interested objects within the domain of space object identification (SOI) and geological survey. One of the approaches is the unmixing analysis that identifies the components (called endmembers) in each pixel and estimates their corresponding fractional abundances, and then, we could obtain the space distributions of substances. To solve this problem, we present an approach in a semi-supervised fashion, by assuming that the measured spectrum is expressed in the form of linear combination of a number of pure spectral signatures in a spectral library and the fractional abundances are their weights. Thus, the abundances are sparse and we propose a sparse regression model to realize the sparse unmixing analysis. We apply random projection technique to accelerate the sparse unmixing process and use split Bregman iteration to optimize the objective function. Our algorithm is tested and compared with other classic algorithms by using simulated hyperspectral images and a real-world image. 相似文献

20.

Efficient methods for estimating constrained parameters with applications to lasso logistic regression

Tian GL Tang ML Fang HB Tan M 《Computational statistics & data analysis》2008,52(7):3528-3542

Fitting logistic regression models is challenging when their parameters are restricted. In this article, we first develop a quadratic lower-bound (QLB) algorithm for optimization with box or linear inequality constraints and derive the fastest QLB algorithm corresponding to the smallest global majorization matrix. The proposed QLB algorithm is particularly suited to problems to which the EM-type algorithms are not applicable (e.g., logistic, multinomial logistic, and Cox’s proportional hazards models) while it retains the same EM ascent property and thus assures the monotonic convergence. Secondly, we generalize the QLB algorithm to penalized problems in which the penalty functions may not be totally differentiable. The proposed method thus provides an alternative algorithm for estimation in lasso logistic regression, where the convergence of the existing lasso algorithm is not generally ensured. Finally, by relaxing the ascent requirement, convergence speed can be further accelerated. We introduce a pseudo-Newton method that retains the simplicity of the QLB algorithm and the fast convergence of the Newton method. Theoretical justification and numerical examples show that the pseudo-Newton method is up to 71 (in terms of CPU time) or 107 (in terms of number of iterations) times faster than the fastest QLB algorithm and thus makes bootstrap variance estimation feasible. Simulations and comparisons are performed and three real examples (Down syndrome data, kyphosis data, and colon microarray data) are analyzed to illustrate the proposed methods. 相似文献