Detecting simultaneous variant intervals in aligned sequences
From MaRDI portal
Abstract: Given a set of aligned sequences of independent noisy observations, we are concerned with detecting intervals where the mean values of the observations change simultaneously in a subset of the sequences. The intervals of changed means are typically short relative to the length of the sequences, the subset where the change occurs, the "carriers," can be relatively small, and the sizes of the changes can vary from one sequence to another. This problem is motivated by the scientific problem of detecting inherited copy number variants in aligned DNA samples. We suggest a statistic based on the assumption that for any given interval of changed means there is a given fraction of samples that carry the change. We derive an analytic approximation for the false positive error probability of a scan, which is shown by simulations to be reasonably accurate. We show that the new method usually improves on methods that analyze a single sample at a time and on our earlier multi-sample method, which is most efficient when the carriers form a large fraction of the set of sequences. The proposed procedure is also shown to be robust with respect to the assumed fraction of carriers of the changes.
Recommendations
- Detecting transcriptomic structural variants in heterogeneous contexts via the multiple compatible arrangements problem
- scientific article; zbMATH DE number 5296600
- scientific article; zbMATH DE number 2087058
- Large scale multiple sequence alignment with simultaneous phylogeny inference
- Comparative Genomics
Cites work
- A Modified Bayes Information Criterion with Applications to the Analysis of Comparative Genomic Hybridization Data
- A new representation for a renewal-theoretic constant appearing in asymptotic approximations of large deviations
- Circular binary segmentation for the analysis of array-based DNA copy number data
- Detecting simultaneous change points in multiple sequences
- Error Distribution for Gene Expression Data
- Self-organization and the dynamical nature of ventricular fibrillation
- Tail approximations for maxima of random fields by likelihood ratio transformations
- Tail probabilities for the null distribution of scanning statistics
- The statistics of gene mapping
Cited in
(31)- Asymptotic distribution-free change-point detection based on interpoint distances for high-dimensional data
- BayesProject: fast computation of a projection direction for multivariate changepoint detection
- Sequential tests controlling generalized familywise error rates
- Graph-based change-point detection
- Multiple testing with the structure-adaptive Benjamini-Hochberg algorithm
- Sequential detection of transient signal by moving likelihood ratio statistic in an exponential family
- Change-points: from sequential detection to biology and back
- Scan statistics for detecting a local change in variance for normal data with known variance
- On extremal index of max-stable random fields
- Multi-sensor slope change detection
- Higher criticism: \(p\)-values and criticism
- An online copy number variant detection method for short sequencing reads
- Detecting simultaneous change points in multiple sequences
- Scan statistics on Poisson random fields with applications in genomics
- Change-point model on nonhomogeneous Poisson processes with application in copy number profiling by next-generation DNA sequencing
- Some clustering-based change-point detection methods applicable to high dimension, low sample size data
- A novel change-point approach for the detection of gas emission sources using remotely contained concentration data
- Collective Anomaly Detection in High-Dimensional Var Models
- Sequential model selection-based segmentation to detect DNA copy number variation
- Author's response
- scientific article; zbMATH DE number 2087058 (Why is no real title available?)
- A combined SR-CUSUM procedure for detecting common changes in panel data
- Testing randomness online
- Simultaneous discovery of rare and common segment variants
- A comparative study on sequential detection of random mean change in multivariate normal data stream
- Exact tests for offline changepoint detection in multichannel binary and count data with application to networks
- Online multivariate changepoint detection with type I error control and constant time/memory updates per series
- Discussion on “Change-Points: From Sequential Detection to Biology and Back” by David Siegmund
- Sequential multi-sensor change-point detection
- Detection of subtle variations as consensus motifs
- Optimal detection of multi-sample aligned sparse signals
This page was built for publication: Detecting simultaneous variant intervals in aligned sequences
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q641121)