| Literature DB >> 26221186 |
David R Crosslin1, Peggy D Robertson2, David S Carrell3, Adam S Gordon2, David S Hanna4, Amber Burt5, Stephanie M Fullerton6, Aaron Scrol3, James Ralston4, Kathleen Leppig4, Andrea Hartzler4, Eric Baldwin4, Mariza de Andrade7, Iftikhar J Kullo8, Gerard Tromp9, Kimberly F Doheny10, Marylyn D Ritchie11, Paul K Crane12, Deborah A Nickerson2, Eric B Larson3, Gail P Jarvik1.
Abstract
BACKGROUND: In an effort to return actionable results from variant data to electronic health records (EHRs), participants in the Electronic Medical Records and Genomics (eMERGE) Network are being sequenced with the targeted Pharmacogenomics Research Network sequence platform (PGRNseq). This cost-effective, highly-scalable, and highly-accurate platform was created to explore rare variation in 84 key pharmacogenetic genes with strong drug phenotype associations.Entities:
Year: 2015 PMID: 26221186 PMCID: PMC4517371 DOI: 10.1186/s13073-015-0181-z
Source DB: PubMed Journal: Genome Med ISSN: 1756-994X Impact factor: 11.117
Summary statistics of demographic data for the eMERGE participants with PGRNSeq data by self-reported sex and race
| Female | Male | Combined | |
|---|---|---|---|
| ( | ( | ( | |
| Self-Reported Race | |||
| American Indian orAlaska Native | 2 | 1 | 1 |
| Asian | 9 | 5 | 8 |
| Black or African American | 5 | 3 | 5 |
| Native Hawaiian or otherPacific Islander | 0 | 0 | 0 |
| Unknown | 1 | 5 | 2 |
| White | 83 | 86 | 84 |
| Ethnicity | |||
| No | 95 | 94 | 95 |
| Yes | 4 | 2 | 3 |
| Unknown | 0 | 5 | 2 |
| Hx | 0 | 1 | 0 |
| Hx of Hypertension | 79 | 80 | 79 |
| Hx of Arrhythmia | 48 | 53 | 50 |
| Hx of AF | 7 | 14 | 10 |
| Hx of CHF | 1 | 3 | 2 |
| Median LDL | 9 | 6 | 8 |
| Median Triglyceride | 6 | 11 | 8 |
| Median CK | 1 | 7 | 3 |
| Median TSH | 6 | 4 | 5 |
| Median INR >1.5 | 7 | 11 | 9 |
| Total PGx Variants(total variants) | 75.8±15.9 | 74.3±12.8 | 75.2±14.7 |
| Coding PGx Variants(coding variants) | 11.6±4.2 | 11.4±3.5 | 11.5±3.9 |
| Group Health PGx Variants (gh variants) | 40.0±9.6 | 38.5±8.7 | 39.4±9.3 |
The continuous variables total variants, coding variants, and gh variants are presented in terms of mean and ± standard deviation
aHx = History found in the EHR
Fig. 1PGx Variants by Participants (27 Genes). Distributions of total variants, high-moderate impact coding variants, and total variants for the following genes: 1. CACNA1S and RYR1 (malignant hyperthermia); 2. SCN5A, KCNH2, and RYR2 (arrhythmia); and 3. LDLR (high cholesterol)
Fig. 2Total Variants vs. Coding Variants. Total number of variants versus the number of high/moderate impact coding variants by participant. The 27 genes are listed in Additional file 4: Table S2, and the impact assignment is according to SnpEff annotation tool
Summary statistics of variant-level data for the eMERGE participants by effect prediction, whether coding with high/moderate impact and lower impact
| Impact | Other | High/Moderate | Combined |
|---|---|---|---|
| ( | ( | ( | |
| Variant Function | |||
| 3-prime-UTR | 26 | 0 | 18 |
| 5-prime-UTR | 6 | 0 | 4 |
| intron | 30 | 0 | 21 |
| intron-near-splice | 2 | 0 | 1 |
| missense | 0 | 95 | 29 |
| missense-near-splice | 0 | 2 | 1 |
| non-coding-exon | 0 | 0 | 0 |
| splice-acceptor | 0 | 1 | 0 |
| splice-donor | 0 | 0 | 0 |
| stop-gained | 0 | 2 | 1 |
| synonymous | 36 | 0 | 25 |
| synonymous-near-splice | 0 | 0 | 0 |
| Prior rsID | 67 | 71 | 68 |
| Prior Clinical Association | 6 | 18 | 9 |
| OMIM | 0 | 1 | 1 |
| ClinVar | 1 | 3 | 2 |
| HGMD | 2 | 30 | 10 |
| Gene List | |||
|
| 9 | 8 | 9 |
|
| 5 | 6 | 5 |
|
| 0 | 1 | 1 |
|
| 4 | 9 | 6 |
|
| 2 | 3 | 3 |
|
| 4 | 4 | 4 |
|
| 11 | 10 | 11 |
|
| 4 | 2 | 3 |
|
| 0 | 0 | 0 |
|
| 4 | 5 | 4 |
|
| 5 | 1 | 4 |
|
| 7 | 4 | 6 |
|
| 7 | 3 | 6 |
|
| 1 | 2 | 1 |
|
| 11 | 19 | 14 |
|
| 12 | 8 | 11 |
|
| 8 | 8 | 8 |
|
| 2 | 5 | 3 |
|
| 3 | 2 | 3 |
|
| 2 | 1 | 2 |
| Average Depth | 295/401/473 | 347/436/499 | 308/413/480 |
| GERP | −3.44/−0.47/1.61 | 2.01/3.90/4.82 | −2.31/0.49/3.50 |
| CADD | 2.12/5.37/8.66 | 10.34/14.36/18.59 | 3.10/7.28/11.70 |
Effect prediction details can be found on the SnpEff web site (Web Resources). In general, single nucleotide variants, structural variation, and copy number variation annotated as having high or moderate impact modify the coding and/or splice regions of a given gene. For the continuous variables average depth, GERP, and CADD, the three numbers represent quartiles of the distributions (25,50, and 75)