| Literature DB >> 26448358 |
Frederick E Dewey1, Megan E Grove1, James R Priest2, Daryl Waggott3, Prag Batra3, Clint L Miller4, Matthew Wheeler1, Amin Zia5, Cuiping Pan5, Konrad J Karzcewski6, Christina Miyake7, Michelle Whirl-Carrillo8, Teri E Klein8, Somalee Datta9, Russ B Altman8, Michael Snyder5, Thomas Quertermous4, Euan A Ashley10.
Abstract
High throughput sequencing has facilitated a precipitous drop in the cost of genomic sequencing, prompting predictions of a revolution in medicine via genetic personalization of diagnostic and therapeutic strategies. There are significant barriers to realizing this goal that are related to the difficult task of interpreting personal genetic variation. A comprehensive, widely accessible application for interpretation of whole genome sequence data is needed. Here, we present a series of methods for identification of genetic variants and genotypes with clinical associations, phasing genetic data and using Mendelian inheritance for quality control, and providing predictive genetic information about risk for rare disease phenotypes and response to pharmacological therapy in single individuals and father-mother-child trios. We demonstrate application of these methods for disease and drug response prognostication in whole genome sequence data from twelve unrelated adults, and for disease gene discovery in one father-mother-child trio with apparently simplex congenital ventricular arrhythmia. In doing so we identify clinically actionable inherited disease risk and drug response genotypes in pre-symptomatic individuals. We also nominate a new candidate gene in congenital arrhythmia, ATP2B4, and provide experimental evidence of a regulatory role for variants discovered using this framework.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26448358 PMCID: PMC4598191 DOI: 10.1371/journal.pgen.1005496
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Summary of STMP tiers 1–3 in whole genome sequence data from twelve unrelated adults recruited from primary care clinics.
| Variant classification | Definition | Median (range)–all variants | Median (range)–filtered by cohort allele frequency | % private | Variant alleles present in all participants, n | Variant alleles present in all East Asians, n |
|---|---|---|---|---|---|---|
| Genetic variants with previously reported disease associations | ||||||
| Tier 1 | Loss of function | 8.5 (6–14) | 3.5 (2–9) | 15 (0–36) | 0 | 0 |
| Tier 2 | Rare non-LOF variants | 147 (124–164) | 14 (5–76) | 3 (2–36) | 1 | 8 |
| Tier 3 | Common missense and nonframeshift indel variants | 148.5 (133–154) | 29 (26–68) | 6 (4–22) | 2 | 13 |
| Rare | — | 62,453 (59,813–66,207) | ||||
| Tier 1 | Loss of function | 13 (11–19) | 6 (3–13) | 23 (7–47) | 2 | 2 |
| Tier 2 | Missense/nonframeshift+ conserved | 53.5 (48–59) | 50.5 (42–55) | 66 (54–73) | 1 | 1 |
| Tier 3 | Missense/nonframeshift+ predicted pathogenic | 5 (4–8) | 5 (4–8) | 100 (60 - 100) | 0 | 0 |
*As defined by allele frequency < 0.01.
Abbreviations: LOF, loss of function.