Deriving chemosensitivity from cell lines: forensic bioinformatics and reproducible research in high-throughput biology
From MaRDI portal
(Redirected from Publication:965099)
Abstract: High-throughput biological assays such as microarrays let us ask very detailed questions about how diseases operate, and promise to let us personalize therapy. Data processing, however, is often not described well enough to allow for exact reproduction of the results, leading to exercises in "forensic bioinformatics" where aspects of raw data and reported results are used to infer what methods must have been employed. Unfortunately, poor documentation can shift from an inconvenience to an active danger when it obscures not just methods but errors. In this report we examine several related papers purporting to use microarray-based signatures of drug sensitivity derived from cell lines to predict patient response. Patients in clinical trials are currently being allocated to treatment arms on the basis of these results. However, we show in five case studies that the results incorporate several simple errors that may be putting patients at risk. One theme that emerges is that the most common errors are simple (e.g., row or column offsets); conversely, it is our experience that the most simple errors are common. We then discuss steps we are taking to avoid such errors in our own investigations.
Recommendations
- A Compendium to Ensure Computational Reproducibility in High-Dimensional Classification Tasks
- Reproducible Research: A Bioinformatics Case Study
- Measuring reproducibility of high-throughput experiments
- Transparency and reproducibility in data analysis: the prostate cancer prevention trial
- Cross-study validation and combined analysis of gene expression microarray data
Cites work
Cited in
(15)- Reproducible research in statistics: a review and guidelines for the Biometrical Journal
- Liquid chromatography mass spectrometry-based proteomics: biological and technological as\-pects
- Challenges and Opportunities for Statistics in the Next 25 Years
- Reproducibility of biomarker identifications from mass spectrometry proteomic data in cancer studies
- Reproducible research practices: a tool for effective and efficient leadership in collaborative statistics
- Use of pretransformation to cope with extreme values in important candidate features
- Integrating Ethics into the Guidelines for Assessment and Instruction in Statistics Education (GAISE)
- Bayesian nonparametric models for peak identification in MALDI-TOF mass spectroscopy
- Applying statistical thinking to `big data' problems
- Error statistical modeling and inference: where methodology meets ontology
- Transparency and reproducibility in data analysis: the prostate cancer prevention trial
- Book review of: C. Gandrud, Reproducible research with R and Rstudio.
- Deriving chemosensitivity from cell lines: forensic bioinformatics and reproducible research in high-throughput biology
- A practical guide to big data
- Statistical proof? The problem of irreproducibility
This page was built for publication: Deriving chemosensitivity from cell lines: forensic bioinformatics and reproducible research in high-throughput biology
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q965099)