Data science, big data and statistics
DOI10.1007/S11749-019-00651-9zbMATH Open1428.62021OpenAlexW2935794055MaRDI QIDQ2273155FDOQ2273155
Authors: Pedro Galeano, Daniel Peña
Publication date: 18 September 2019
Published in: Test (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1007/s11749-019-00651-9
Recommendations
- Data science. Theory, analysis and applications
- Data science. Foundations, statistics and machine learning
- Data science: theory and applications
- Statistical science in the world of big data
- Statistical methods and computing for big data
- Data science and machine learning. Mathematical and statistical methods
- scientific article; zbMATH DE number 1665312
- Big data analytics
- Statistical learning and data science
machine learningtime seriesnetwork analysisstatistical learningmultivariate datasparse model selection
Time series, auto-correlation, regression, etc. in statistics (GARCH) (62M10) Statistical aspects of big data and data science (62R07) Learning and adaptive systems in artificial intelligence (68T05) Foundations and philosophical topics in statistics (62A01) Estimation in multivariate analysis (62H12)
Cites Work
- Modelling the role of variables in model-based cluster analysis
- Outlier Detection and False Discovery Rates for Whole-Genome DNA Matching
- Forecasting multiple time series with one-sided dynamic principal components
- Image compression by sparse PCA coding in curvelet domain
- Nonlinear Time Series Analysis
- Computer age statistical inference. Algorithms, evidence, and data science
- Data learning from big data
- Journeys in big data statistics
- Handbook of big data analytics
- Handbook of big data
- Signal Detection in Underwater Sound Using Wavelets
- An introduction to envelopes. Dimension reduction for efficient estimation in multivariate statistics
- The most-cited statistical papers
- Nearest‐neighbors medians clustering
- A framework for feature selection in clustering
- Fast Algorithms for Large-Scale Generalized Distance Weighted Discrimination
- Regularized estimation in sparse high-dimensional time series models
- A penalized matrix decomposition, with applications to sparse principal components and canonical correlation analysis
- Finding Groups in Data
- Robust statistics. Theory and methods (with R)
- Estimating the dimension of a model
- Asymptotic normality and optimalities in estimation of large Gaussian graphical models
- Gaussian graphical model estimation with false discovery rate control
- Functional data analysis.
- Penalized model-based clustering with application to variable selection
- Extended Bayesian information criteria for model selection with large model spaces
- Title not available (Why is that?)
- Variable Selection for Model-Based High-Dimensional Clustering and Its Application to Microarray Data
- A partial overview of the theory of statistics with functional data
- Title not available (Why is that?)
- Forecasting Using Principal Components From a Large Number of Predictors
- Title not available (Why is that?)
- Title not available (Why is that?)
- Statistics for high-dimensional data. Methods, theory and applications.
- Determining the Number of Factors in Approximate Factor Models
- The Generalized Dynamic Factor Model
- Variable Selection for Model-Based Clustering
- A new look at the statistical model identification
- A survey of cross-validation procedures for model selection
- Support-vector networks
- High-dimensional graphs and variable selection with the Lasso
- Finite mixture and Markov switching models.
- Nonparametric Estimation from Incomplete Observations
- Near-Optimal Signal Recovery From Random Projections: Universal Encoding Strategies?
- Title not available (Why is that?)
- Model-Based Gaussian and Non-Gaussian Clustering
- Title not available (Why is that?)
- Time-series clustering
- Bayesian compressed regression
- Robust Estimation of a Location Parameter
- Nearest neighbor pattern classification
- Ridge Regression: Biased Estimation for Nonorthogonal Problems
- Statistical analysis of network data. Methods and models
- Trimmed \(k\)-means: An attempt to robustify quantizers
- Statistical modeling: The two cultures. (With comments and a rejoinder).
- Sparse inverse covariance estimation with the graphical lasso
- Model-based clustering of high-dimensional data: a review
- Network vector autoregression
- Data science vs. statistics: two cultures?
- Model selection via multifold cross validation
- Regularized estimation of large covariance matrices
- Direct estimation of differential networks
- Finding an unknown number of multivariate outliers
- Gaussian Process Regression Analysis for Functional Data
- Robust principal component analysis?
- Can the strengths of AIC and BIC be shared? A conflict between model indentification and regression estimation
- Identifying a Simplifying Structure in Time Series
- Title not available (Why is that?)
- Title not available (Why is that?)
- Cluster Identification Using Projections
- Introduction to Functional Data Analysis
- Title not available (Why is that?)
- On the Concept of Depth for Functional Data
- Testing differential networks with applications to the detection of gene-gene interactions
- Linear Model Selection by Cross-Validation
- A Bayesian approach to some outlier problems
- Title not available (Why is that?)
- Title not available (Why is that?)
- The Predictive Sample Reuse Method with Applications
- Statistics for big data: a perspective
- A constrained \(\ell _{1}\) minimization approach to sparse precision matrix estimation
- Title not available (Why is that?)
- Panning for Gold: ‘Model-X’ Knockoffs for High Dimensional Controlled Variable Selection
- Fast unfolding of communities in large networks
- Stable signal recovery from incomplete and inaccurate measurements
- Gene hunting with hidden Markov model knockoffs
- Controlling the false discovery rate via knockoffs
- Sparse principal component analysis via regularized low rank matrix approximation
- Adaptive thresholding for sparse covariance matrix estimation
- Estimation with quadratic loss.
- Discovering the False Discovery Rate
- For most large underdetermined systems of linear equations the minimal 𝓁1‐norm solution is also the sparsest solution
- Inference in an Authorship Problem
- Outlier Detection in Multivariate Time Series by Projection Pursuit
- Compressed sensing
- Optimal rates of convergence for sparse covariance matrix estimation
- 10.1162/15324430260185646
- The Grand Tour: A Tool for Viewing Multidimensional Data
- Title not available (Why is that?)
- Geometric Representation of High Dimension, Low Sample Size Data
- Local linear quantile estimation for nonstationary time series
- On the optimality of the simple Bayesian classifier under zero-one loss
- Selection of Variables for Cluster Analysis and Classification Rules
- Statistical challenges of high-dimensional data
- Clustering time series by linear dependency
- Approximation of conditional densities by smooth mixtures of regressions
- Multifold Predictive Validation in ARMAX Time Series Models
- Forecasting with nonstationary dynamic factor models
- Robust distances for outlier-free goodness-of-fit testing
- Network science. With Márton Pósfai
- Monge-Kantorovich depth, quantiles, ranks and signs
- A course in time series analysis. Lectures of the ECAS '97, Madrid, Spain, September 15--19, 1997
- Heterogeneous connection effects
- Local Harmonic Estimation in Musical Sound Signals
Cited In (27)
- Digital archives as big data
- Dynamic data science and official statistics
- Divide and recombine (D{\&}R) data science projects for deep analysis of big data and high computational complexity
- Data science vs. statistics: two cultures?
- Principles of experimental design for big data analysis
- Founding of the Big Data Statistics Branch
- Foreword. Applications of data sciences
- A practical guide to big data
- Big data and biostatistics: the death of the asymptotic Valhalla
- Big questions, informative data, excellent science
- Data learning from big data
- Big data: some statistical issues
- Journeys in big data statistics
- Statistics in the big data era: failures of the machine
- The future of statistics and data science
- Statistical methods and computing for big data
- Big data: a geometric explanation of a seemingly counterintuitive strategy
- Leveraging big data for official statistics: some recent developments
- A letter from the guest editors
- Data science and productivity analytics
- Big data: the next challenge for statistics
- The scientific problems and methodology of big data
- Using Ramsey theory to measure unavoidable spurious correlations in big data
- A taxonomy of big data for optimal predictive machine learning and data mining
- Recent developments in complex and spatially correlated functional data
- Local influence diagnostics with forward search in regression analysis
- Challenges in data science: a complex systems perspective
Uses Software
This page was built for publication: Data science, big data and statistics
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2273155)