| Literature DB >> 24627758 |
Gustavo Glusman1, Mike Cariaso2, Rafael Jimenez3, Daniel Swan4, Bastian Greshake5, Jong Bhak6, Darren W Logan7, Manuel Corpas8.
Abstract
Direct-to-consumer (DTC) genetic testing is a recent commercial endeavor that allows the general public to access personal genomic data. The growing availability of personal genomic data has in turn stimulated the development of non-commercial tools for DTC data analysis. Despite this new wealth of public resources, no systematic research has been carried out to assess these tools for interpretation of DTC data. Here, we provide an initial analysis benchmark in the context of a whole family, using single nucleotide polymorphism (SNP) data. Five blood-related DTC SNP chip data tests were analyzed in conjunction with one whole exome sequence. We report findings related to genomic similarity between individuals, genetic risks and an overall assessment of data quality; thus providing an evaluation of the current potential of public domain analysis tools for personal genomics. We envisage that as the use of personal genome tests spreads to the general population, publicly available tools will have a more prominent role in the interpretation of genomic data in the context of health risks and ancestry.Entities:
Year: 2012 PMID: 24627758 PMCID: PMC3941016 DOI: 10.12688/f1000research.1-3.v1
Source DB: PubMed Journal: F1000Res ISSN: 2046-1402
Figure 1. Family tree analyzed using DTC genotyping services.
Squares and circles denote male and female respectively. Filled shapes represent those for which genome data is available. ‘Son’, the individual whose exome was sequenced, is denoted with a red diamond. Other family members include Father, Mother, Daughter and Aunt, who is Mother’s sister.
Figure 2. Admixture analysis of individuals from Southern Europe from the Eurogenes Genetic Ancestry Project.
Mother (ES7) is denoted by a red arrow and Father (ES8) by a blue arrow. Mother and Father are the only family individuals included here as they have the most divergent genotypes within the family.
Summary of reported no-call rates for all 23andMe chips included in this study.
| Family member | No call rate | Chip version |
|---|---|---|
| Mother | 0.24% | v3 |
| Father | 0.21% | v3 |
| Daughter | 0.19% | v3 |
| Aunt | 0.17% | v3 |
| Son | 0.16% | v2 |
Figure 3. A distribution of all the different occurring genotypes as a percentage of the total for all individuals is shown.
For the purposes of unbiased comparison, only autosome data is included. I and D indicate insertion and deletion, respectively. Son’s percentages (v2) show slight differences to all other v3 individuals whose genotype proportions are more similar.
Number of heterozygous sites mistakenly reported as homozygous (based on the undercall rate, in autosomes).
| Member | Undercall | Heterozygous to
|
|---|---|---|
| Son | 0.25% | 661 |
| Daughter | 0.53% | 2007 |
| Mother | 0.60% | 2269 |
| Father | 0.50% | 2010 |
| Aunt | 0.51% | 1905 |
Mendelian Inheritance errors as estimated by direct parent/offspring relations.
| Relation | MIEs |
|---|---|
| Son/Father: | 36 |
| Son/Mother: | 24 |
| Daughter/Father: | 108 |
| Daughter/Mother: | 129 |
Incompatible sites between remaining relationships, identified as pseudo-MIEs.
| Relation | Pseudo-MIEs |
|---|---|
| Son/Daughter [siblings]: | 7,777 |
| Mother/Aunt [siblings]: | 15,401 |
| Son/Aunt: | 17,390 |
| Daughter/Aunt: | 30,215 |
| Father/Mother [unrelated]: | 53,937 |
| Father/Aunt [unrelated]: | 54,522 |
Example of hemizygous site reported as homozygous.
| Status | Father | Mother | Daughter |
|---|---|---|---|
| Reported | GT | GG | TT |
| Actual | GT | G- | T- |
Similarity comparison of all-against-all family genotypes plus a non-European (non-CEU) male of Indian ethnic background.
Matches denote identical genotypes for the same SNP (e.g. AA/AA); half-match, only one of the alleles is identical (e.g. AT/AA) and conflict means both alleles are different (e.g. CG/AT).
| Mother | matches:
| 930342
| 100.0%
| ||||||||||
| identity:
| 930342
| 100.0%
| |||||||||||
| total: | 930342 | 100.0% | |||||||||||
| Father | matches:
| 537331
| 57.8%
| 930342
| 100.0%
| ||||||||
| identity:
| 704910
| 75.8%
| 930342
| 100.0%
| |||||||||
| total: | 930342 | 100.0% | 930342 | 100.0% | |||||||||
| Daughter | matches:
| 650399
| 69.9%
| 653926
| 70.3%
| 930342
| 100.0%
| ||||||
| identity:
| 788566
| 84.8%
| 790668
| 85.0%
| 930342
| 100.0%
| |||||||
| total: | 930342 | 100.0% | 930342 | 100.0% | 930342 | 100.0% | |||||||
| Aunt | matches:
| 661316
| 71.1%
| 537723
| 57.8%
| 586549
| 63.0%
| 930342
| 100.0%
| ||||
| identity:
| 786455
| 84.5%
| 705254.5
| 75.8%
| 742016
| 79.7%
| 930342
| 100.0%
| |||||
| total: | 930342 | 100.0% | 930342 | 100.0% | 930342 | 100.0% | 930342 | 100.0% | |||||
| Son | matches:
| 357419
| 67.7%
| 358003
| 67.8%
| 386014
| 73.1%
| 321410
| 60.9%
| 556694
| 100.0%
| ||
| identity:
| 441760
| 83.7%
| 442175
| 83.8%
| 452361
| 85.7%
| 415285
| 78.7%
| 556694
| 100.0%
| |||
| total: | 527908 | 100.0% | 527908 | 100.0% | 527908 | 100.0% | 527908 | 100.0% | 556694 | 100.0% | |||
| Non-CEU | matches:
| 530234
| 57.0%
| 528933
| 56.9%
| 528747
| 56.8%
| 530171
| 57.0%
| 283484
| 53.6%
| 934670
| 100.0%
|
| identity:
| 700638
| 75.3%
| 699753
| 75.2%
| 700164
| 75.3%
| 700881
| 75.4%
| 388030
| 73.4%
| 934670
| 100.0%
| |
| total: | 930147 | 100.0% | 930147 | 100.0% | 930147 | 100.0% | 930147 | 100.0% | 528766 | 100.0% | 934670 | 100.0% | |
| Mother | Father | Daughter | Aunt | Son | Non-CEU | ||||||||
Figure 5. A graphical representation of the SNPs from chromosome 1 for Mother com pared with herself, Father, Son, Sister, Aunt and non-CEU.
Each pixel represents a SNP. Light blue represents match, dark blue half-match and red conflict. SNPs in Son that are not present in the genotypes of the other individuals are represented in grey.
Figure 4. Inheritance State Consistency Analysis (ISCA) plot for the Father-Mother-Son-Daughter quartet, depicting for each autosome the number of informative SNPs supporting each of the four possible inheritance states: "identical" ("id", red), "haploidentical maternal" ("hm", green), "haploidentical paternal" ("hp", yellow) and "nonidentical" ("ni", blue).
SNPs consistent with two inheritance states contribute 0.5 weight to each. SNP counts are binned in non-overlapping 1 Mb windows; within each window, the four inheritance states are sorted by decreasing level of support. Regions without support typically overlap centromeric repeats and heterochromatic regions. Pie chart inset: fraction of the genome observed in each inheritance state.
A comparison of all SNPedia annotations with Magnitude >= 3 for all family members.
Traits have been classified according to the general condition they relate. Red boxes are indicative of a particular phenotype being predicted in the individual. Descriptions for every matched phenotype, extracted directly from SNPedia, are shown in the right column.
| Condition | Mother | Father | Daughter | Aunt | Son | Phenotype |
|---|---|---|---|---|---|---|
| Baldness | 7x risk of baldness according to an article in Nature. That site may require paid access; the abstract is accessible. | |||||
| 2x increased risk of baldness 2x increased risk of baldness | ||||||
| Diabetes | Increased risk for type-2 diabetes | |||||
| 1.3x increased risk for type-2 diabetes | ||||||
| Cardiovascular/Thrombosis | 1.7x increased risk for heart disease. People with this genotype and a long history of high blood sugar are at 7x risk of CAD | |||||
| 1.5x increased risk for CAD; 1.5x higher risk for coronary artery disease | ||||||
| 7.3x increased risk of hypertension | ||||||
| Watch out for high fat in diet | ||||||
| 2.6 times higher odds of developing early stent thrombosis | ||||||
| Cancer | Increased risk of various types of cancer. This variant increases risk of numerous types of cancer in many studies. It is in a microRNA | |||||
| 2–3x higher prostate cancer risk if routinely exposed to the pesticide fonofos | ||||||
| Metabolism | You have 2 variations in MTHFR which influence homocystine levels. People with gs193 are more strongly affected. | |||||
| Impaired NSAID drug metabolism, which is a risk factor for gastrointestinal bleeding when taking any of these medications: aceclofenac, celecoxib, diclofenac, ibuprofen, indomethazine, lornoxicam, meloxicam, naproxen, piroxicam, tenoxicam and valdecoxib. You have one of these *CYP2C8*3 (rs11572080 and rs10509681) *CYP2C9*2 (rs1799853) *CYP2C9*3 (rs1057910) | ||||||
| CYP2C19 Intermediate Metabolizer. Your body breaks down some medicines at a slightly slower than normal rate (which is represented by gs150). Individuals with gs152 genotypes have even slower metabolism. *anti-epileptics (such as diazepam, phenytoin, and phenobarbitone) *anti-depressants (such as amitriptyline and clomipramine) *anti-platelet drug clopidogrel (Plavix) *anti-ulcer proton pump inhibitors like omeprazole (trade names Losec and Prilosec), esomeprazole (trade name Nexium), and lansoprazole (Prevacid) *hormones (estrogen, progesterone). | ||||||
| Higher odds of alcoholic liver disease, increased liver fat alcohol seems to be 3x more damaging to your liver than typical. Higher risk for developing fatty liver, fibrosis, and fibrosis progression, with a per allele odds ratio of 2.55, 3.13 and 2.64, respectively. news | ||||||
| Warfarin Metabolism | Approximately 30% of people are intermediate metabolizers of the popular anticoagulant Warfarin and would probably need a decreased dosage. This due to rs1799853 or rs1057910 respectively leading to the CYP2C9*2 or CYP2C9*3 alleles. For prodrugs that require activation by CYP2C9, an alternative treatment or increased dose should be considered. See also gs126 | |||||
| Probably impaired Warfarin metabolism. | ||||||
| Approximately 7–10% of people are poor metabolizers of the popular anticoagulant Warfarin and would probably need a decreased dosage. This due to mutations in rs1799853 or rs1057910 causing an inactive CYP2C9 gene. You are at increased risk of drug-induced side effects due to diminished drug elimination. Prodrugs dependent on CYP2C9 metabolism may fail to generate the active form of the drug. | ||||||
| Miscellaneous | Substantially increased odds of developing V617F-positive MPN. | |||||
| You are heterozygous at all 3 of the SNPs which are known to influence the ability to taste bitterness. This means you are better than average at detecting bitter tastes while young, but that this ability will decrease to less than average during adulthood. As a child you will probably hate brussel sprouts, and by early adulthood will discover that olives and brussel sprouts now taste good. A 2010 study shows the change bitter sensitivity which occurs over the lifespan (from bitter sensitive to less so) is more common in people with this genoset. Children with this genotype could perceive a bitter taste at lower PROP concentrations than could heterozygous adults. The threshold for adolescents was intermediate. The 3 SNPs are rs10246939, rs1726866, rs713598 in the gene TAS2R38. |