Sparse matrix linear models for structured high-throughput data
From MaRDI portal
Abstract: Recent technological advancements have led to the rapid generation of high-throughput biological data, which can be used to address novel scientific questions in broad areas of research. These data can be thought of as a large matrix with covariates annotating both rows and columns of this matrix. Matrix linear models provide a convenient way for modeling such data. In many situations, sparse estimation of these models is desired. We present fast, general methods for fitting sparse matrix linear models to structured high-throughput data. We induce model sparsity using an L penalty and consider the case when the response matrix and the covariate matrices are large. Due to data size, standard methods for estimation of these penalized regression models fail if the problem is converted to the corresponding univariate regression scenario. By leveraging matrix properties in the structure of our model, we develop several fast estimation algorithms (coordinate descent, FISTA, and ADMM) and discuss their trade-offs. We evaluate our method's performance on simulated data, E. coli chemical genetic screening data, and two Arabidopsis genetic datasets with multivariate responses. Our algorithms have been implemented in the Julia programming language and are available at https://github.com/senresearch/MatrixLMnet.jl.
Recommendations
Cites work
- scientific article; zbMATH DE number 3850830 (Why is no real title available?)
- scientific article; zbMATH DE number 1750184 (Why is no real title available?)
- scientific article; zbMATH DE number 845714 (Why is no real title available?)
- A Fast Iterative Shrinkage-Thresholding Algorithm for Linear Inverse Problems
- A differential equation for modeling Nesterov's accelerated gradient method: theory and insights
- A study of error variance estimation in Lasso regression
- Adaptive FISTA for Nonconvex Optimization
- Another look at the fast iterative shrinkage/thresholding algorithm (FISTA)
- Confidence Intervals and Hypothesis Testing for High-Dimensional Regression
- Coordinate descent algorithms for lasso penalized regression
- Distributed optimization and statistical learning via the alternating direction method of multipliers
- Functional data analysis.
- Julia: a fresh approach to numerical computing
- Learning graphical models with hubs
- Least angle regression. (With discussion)
- Regularization and Variable Selection Via the Elastic Net
Cited in
(4)
Describes a project that uses
Uses Software
This page was built for publication: Sparse matrix linear models for structured high-throughput data
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2135347)