| Literature DB >> 25071867 |
Sarah A Pendergrass1,2, Marquitta J White3,4,5, Nuri Kodaman3,4,5, Timothy H Ciesielski3,4, Rafal S Sobota3,4,5, Minjun Huang3, Jacquelaine Bartlett4, Jing Li3, Qinxin Pan3, Jiang Gui4,6, Scott B Selleck2, Christopher I Amos4,6, Marylyn D Ritchie1,2, Jason H Moore3,4,6, Scott M Williams3,4.
Abstract
In omic research, such as genome wide association studies, researchers seek to repeat their results in other datasets to reduce false positive findings and thus provide evidence for the existence of true associations. Unfortunately this standard validation approach cannot completely eliminate false positive conclusions, and it can also mask many true associations that might otherwise advance our understanding of pathology. These issues beg the question: How can we increase the amount of knowledge gained from high throughput genetic data? To address this challenge, we present an approach that complements standard statistical validation methods by drawing attention to both potential false negative and false positive conclusions, as well as providing broad information for directing future research. The Diverse Convergent Evidence approach (DiCE) we propose integrates information from multiple sources (omics, informatics, and laboratory experiments) to estimate the strength of the available corroborating evidence supporting a given association. This process is designed to yield an evidence metric that has utility when etiologic heterogeneity, variable risk factor frequencies, and a variety of observational data imperfections might lead to false conclusions. We provide proof of principle examples in which DiCE identified strong evidence for associations that have established biological importance, when standard validation methods alone did not provide support. If used as an adjunct to standard validation methods this approach can leverage multiple distinct data types to improve genetic risk factor discovery/validation, promote effective science communication, and guide future research directions.Entities:
Keywords: Complex disease; False negatives; False positives; GWAS; Heterogeneity; Omics; Replication; Type 1 error; Type 2 error; Validation
Year: 2014 PMID: 25071867 PMCID: PMC4112852 DOI: 10.1186/1756-0381-7-10
Source DB: PubMed Journal: BioData Min ISSN: 1756-0381 Impact factor: 2.522
Figure 1Scoring system concept for prioritizing research findings in complex disease research. Heat Map image adapted from [47]. Manhattan Plot image adapted from [48]. PubMed image adapted from the PubMed database website (http://www.ncbi.nlm.nih.gov/pubmed[16]) after typing in “ppar gamma” (as seen on June 9, 2014). Pathway/network image adapted from [49]. Microscopy images adapted from [50]. Mouse images adapted from [51]. Underlying images adapted from [47-51] were published under the creative commons attribution license which allows for re-use without permission (http://www.plosone.org/static/licensehttp://creativecommons.org/licenses/by/3.0/http://creativecommons.org/licenses/by/3.0/legalcode).
Diverse Convergent Evidence (DiCE) point system concept
| Single Significant Finding | Evidence from PubMed, KEGG, GEO, GO or etc. linking the factor to the pathophysiology | Evidence from animal or cell/molecular models demonstrating a role of the factor in the pathophysiology |
| Yes (1 point) | Yes (3 points) | Yes (3 points) |
| No (0 points) | No (0 points) | No (0 points) |
| | | |
| Standard Statistical Validation (3 points) or | | |
| Alternative Statistical Validation (2 points) or | | |
| No Statistical Validation (0 points) |
Figure 2Hypothetical DiCE scoring system implementation. Underlying Manhattan Plot image adapted from [48]. The underlying image adapted from [48] was published under the creative commons attribution license which allows for re-use without permission (http://www.plosone.org/static/licensehttp://creativecommons.org/licenses/by/3.0/http://creativecommons.org/licenses/by/3.0/legalcode).
Results of implementing the DiCE evidence scoring system in four contexts
| | Single finding | Validation | | | |
| Hemoglobin S and malaria resistance (a positive control that would not be detected with traditional methods) | 1 | 2 | 3 | 3 | 9 |
| 1 | 2 | 3 | 3 | 9 | |
| 1 | 0 | 0 | 0 | 1 | |
| 0 | 0 | 3 | 3 | 6 | |
*a total score of 6–10 is considered strong evidence.