| Literature DB >> 27418169 |
Slavé Petrovski1,2, David B Goldstein3.
Abstract
An important application of modern genomics is diagnosing genetic disorders. We use the largest publicly available exome sequence database to show that this key clinical service can currently be performed much more effectively in individuals of European genetic ancestry.Entities:
Keywords: Clinical diagnostics; Disease-associated genes; Genetic ancestry; Genetic variation; Geographic ancestry; Healthcare inequality; Next generation sequencing; Precision medicine; Rare variants; Sequence interpretation
Mesh:
Year: 2016 PMID: 27418169 PMCID: PMC4944427 DOI: 10.1186/s13059-016-1016-y
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Fig. 1a Percentage representation of the 5965 IGM reference cohort across six geographic ancestry groupings. b A semi-transparent overlaid histogram representing the tally of candidate variants between IGM’s 5094 European (Eu) individuals (blue) and the collection of non-European individuals (red) (Mann–Whitney U test p < 1 × 10−320). The non-European distribution reflects individuals with a: Latino ethnicity (La), East Asian (EaAs), South Asian (SoAs), primarily African (Af), and unassigned (Un) ancestry. Estimates indicate the mean number of singleton non-synonymous variants among OMIM disease-associated genes. Singleton variants are identified based on a reference cohort of 5965 IGM sequenced samples. c Percentage representation of the combined 66,217 IGM and ExAC reference cohorts across six geographic ancestry / ethnic groupings. d Similar to b but singleton variants were identified based on the absence among the combined IGM and ExAC reference cohorts accumulating to 66,217 samples
Group summaries for the number of singleton non-synonymous candidate variants in OMIM disease-associated genes among IGM’s 5965 samples
| Geographic ancestry / ethnic group | Number of individuals | Number of singletons using internal reference cohort (n = 5,965) | Number of singletons using internal and ExAC reference cohorts (n = 66,217) | ||||
|---|---|---|---|---|---|---|---|
| Median | Mean | SD | Median | Mean | SD | ||
| European | 5,094 | 15 | 15.2 | 5.0 | 6 | 6.6 | 3.0 |
| African (African American) | 505 | 29 | 29.7 | 8.3 | 9 | 9.9 | 4.8 |
| Latino ethnicity | 93 | 28 | 28.3 | 6.0 | 10 | 10.9 | 5.5 |
| East Asian | 61 | 51 | 50.2 | 9.1 | 12 | 12.3 | 4.2 |
| South Asian | 38 | 49.5 | 50.5 | 7.4 | 12 | 12.7 | 3.8 |
| Unassigned | 174 | 24 | 22.7 | 11.9 | 8 | 8.3 | 5.8 |
Data reflect using only the internal reference cohort and then subsequently supplementing the IGM internal reference cohort with variant information from the ExAC reference cohort of 60,252 controls of convenience
SD standard deviation