Statistical challenges of high-dimensional data
DOI10.1098/RSTA.2009.0159zbMATH Open1185.62007OpenAlexW2153491803WikidataQ42790182 ScholiaQ42790182MaRDI QIDQ3559944FDOQ3559944
Authors: Iain M. Johnstone, D. M. Titterington
Publication date: 8 May 2010
Published in: Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1098/rsta.2009.0159
Recommendations
Classification and discrimination; cluster analysis (statistical aspects) (62H30) Linear regression; mixed models (62J05)
Cites Work
- Random forests
- Simultaneous analysis of Lasso and Dantzig selector
- The Dantzig selector: statistical estimation when \(p\) is much larger than \(n\). (With discussions and rejoinder).
- Covariance regularization by thresholding
- Higher criticism thresholding: Optimal feature selection when useful features are rare and weak
- Impossibility of successful classification when useful features are rare and weak
- On Consistency and Sparsity for Principal Components Analysis in High Dimensions
- Finite sample approximation results for principal component analysis: A matrix perturbation approach
- Decoding by Linear Programming
- Fisher lecture: Dimension reduction in regression
- Operator norm consistent estimation of large-dimensional sparse covariance matrices
- For most large underdetermined systems of linear equations the minimal 𝓁1‐norm solution is also the sparsest solution
- Variational Bayesian learning of directed graphical models with hidden variables
- Laplacian Eigenmaps for Dimensionality Reduction and Data Representation
- Multivariate analysis and Jacobi ensembles: largest eigenvalue, Tracy-Widom limits and rates of convergence
- Hessian eigenmaps: Locally linear embedding techniques for high-dimensional data
- Sequence alignment in bioinformatics
- Properties of Diagnostic Data Distributions
- Random matrix theory: A program of the statistics and applied mathematical sciences institute (SAMSI)
- Bayesian methods for neural networks and related models
- A report on the future of statistics (with comments and rejoinder)
Cited In (41)
- Recovery of partly sparse and dense signals
- Data science, big data and statistics
- Cohesion and Repulsion in Bayesian Distance Clustering
- High-Dimensional Data Classification
- Statistical challenges with high dimensionality: feature selection in knowledge discovery
- Power-expected-posterior priors for variable selection in Gaussian linear models
- Variable screening for high dimensional time series
- Bayesian growth curve model useful for high-dimensional longitudinal data
- A new test for part of high dimensional regression coefficients
- Optimally Weighted PCA for High-Dimensional Heteroscedastic Data
- High dimensional extension of the growth curve model and its application in genetics
- Statistical plasmode simulations-potentials, challenges and recommendations
- Detection of weak signals in high-dimensional complex-valued data
- Title not available (Why is that?)
- Robust PCA for high‐dimensional data based on characteristic transformation
- A guided random walk through some high dimensional problems
- Decomposition feature selection with applications in detecting correlated biomarkers of bipolar disorders
- Natural coordinate descent algorithm for \(\ell_1\)-penalised regression in generalised linear models
- On the sphericity test with large-dimensional observations
- Using scientifically and statistically sufficient statistics in comparing image segmentations
- Title not available (Why is that?)
- Asymptotic performance of PCA for high-dimensional heteroscedastic data
- Maximum interpoint distance of high-dimensional random vectors
- Fast stepwise regression based on multidimensional indexes
- A comparative study on high-dimensional bayesian regression with binary predictors
- Guest editor's introduction to the special issue on ``Modern dimension reduction methods for big data problems in ecology
- Some challenges for statistics
- Statistical analysis of very high-dimensional data sets of hierarchically structured binary variables with missing data: An application to marine corps readiness evaluations
- Network-based sparse Bayesian classification
- Expectation propagation in linear regression models with spike-and-slab priors
- The curse of dimensionality -- a challenge for mathematical statistics
- A new approach for the computation of halfspace depth in high dimensions
- Qualitative assumptions and regularization in high-dimensional statistics. Abstracts from the workshop held November 5--11, 2006.
- Sparse learning of the disease severity score for high-dimensional data
- A novel wavelength interval selection based on split regularized regression for spectroscopic data
- The singular values and vectors of low rank perturbations of large rectangular random matrices
- Point process convergence for symmetric functions of high-dimensional random vectors
- Point process convergence for the off-diagonal entries of sample covariance matrices
- High-Dimension, Low–Sample Size Perspectives in Constrained Statistical Inference
- Future of statistics
- Impacts of high dimensionality in finite samples
This page was built for publication: Statistical challenges of high-dimensional data
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q3559944)