| Literature DB >> 32664690 |
Wei Perng1,2, Stella Aslibekyan3.
Abstract
Advancements in high-throughput technologies have made it feasible to study thousands of biological pathways simultaneously for a holistic assessment of health and disease risk via 'omics platforms. A major challenge in 'omics research revolves around the reproducibility of findings-a feat that hinges upon balancing false-positive associations with generalizability. Given the foundational role of reproducibility in scientific inference, replication and validation of 'omics findings are cornerstones of this effort. In this narrative review, we define key terms relevant to replication and validation, present issues surrounding each concept with historical and contemporary examples from genomics (the most well-established and upstream 'omics), discuss special issues and unique considerations for replication and validation in metabolomics (an emerging field and most downstream 'omics for which best practices remain yet to be established), and make suggestions for future research leveraging multiple 'omics datasets.Entities:
Keywords: genomics; integrative ‘omics; metabolomics; replication; validation; ‘omics
Year: 2020 PMID: 32664690 PMCID: PMC7408356 DOI: 10.3390/metabo10070286
Source DB: PubMed Journal: Metabolites ISSN: 2218-1989
Best practices for replication and validation of ‘omics findings.
|
|
| ● A priori calculations of the sample size required to detect a realistic effect. |
| ● Use of publicly available ‘omics datasets and/or pursuit of collaborations with other cohorts/consortia to maximize statistical power. |
| ● Stringent and appropriate corrections for multiple testing. |
| ● Transparent reporting of all relevant methods (from the laboratory work to bioinformatics pipelines, to data cleaning, to formal data analysis), features, and results (including those that failed to establish reproducibility). |
|
|
| ● Harmonizing data across the discovery and validation stages to reduce the likelihood of non-reproducible findings due to systemic differences. |
| ● Inclusion of diverse datasets in validation efforts and the use of appropriate statistical methods to account for the resulting heterogeneity. |
| ● Judicious incorporation of functional annotations and effect sizes (in addition to statistical significance) when selecting features for validation and interpreting findings. |
| ● Distinguishing between reproducibility, functional relevance, and predictive validity, and using the appropriate metrics for each. |
|
|
| ● Original (discovery) and confirmatory (replication) populations should be similar in terms of sex, age, and race/ethnic distributions. |
| ● Use of identical laboratory procedures, data processing pipelines, and analytical approaches. |