B-scaling: a novel nonparametric data fusion method
From MaRDI portal
Publication:2170382
DOI10.1214/21-AOAS1537zbMATH Open1498.62011arXiv2109.09940OpenAlexW3201530611MaRDI QIDQ2170382FDOQ2170382
Authors: Yanyan Li
Publication date: 5 September 2022
Published in: The Annals of Applied Statistics (Search for Journal in Brave)
Abstract: Very often for the same scientific question, there may exist different techniques or experiments that measure the same numerical quantity. Historically, various methods have been developed to exploit the information within each type of data independently. However, statistical data fusion methods that could effectively integrate multi-source data under a unified framework are lacking. In this paper, we propose a novel data fusion method, called B-scaling, for integrating multi-source data. Consider measurements that are generated from different sources but measure the same latent variable through some linear or nonlinear ways. We seek to find a representation of the latent variable, named B-mean, which captures the common information contained in the measurements while takes into account the nonlinear mappings between them and the latent variable. We also establish the asymptotic property of the B-mean and apply the proposed method to integrate multiple histone modifications and DNA methylation levels for characterizing epigenomic landscape. Both numerical and empirical studies show that B-scaling is a powerful data fusion method with broad applications.
Full work available at URL: https://arxiv.org/abs/2109.09940
Recommendations
- Fused Lasso approach in regression coefficients clustering -- learning parameter heterogeneity in data integration
- A graph theoretical approach to data fusion
- Statistical data fusion
- Imputation in Data Fusion of Heterogeneous Data Sets A Model-Based Numerical Experiment
- Bayesian multidimensional scaling procedure with variable selection
Computational methods for problems pertaining to statistics (62-08) Applications of statistics to biology and medical sciences; meta analysis (62P10)
Cites Work
- Measurement Error in Nonlinear Models
- Multidimensional scaling by optimizing goodness of fit to a nonmetric hypothesis
- On Directional Regression for Dimension Reduction
- Multidimensional scaling. I: Theory and method
- Title not available (Why is that?)
- Title not available (Why is that?)
- Linear statistical models.
- Linear operator‐based statistical analysis: A useful paradigm for big data
Cited In (1)
This page was built for publication: B-scaling: a novel nonparametric data fusion method
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2170382)