| Literature DB >> 26172475 |
Jun Ding1, Carlo Sidore2, Thomas J Butler1, Mary Kate Wing3, Yong Qian1, Osorio Meirelles1, Fabio Busonero4, Lam C Tsoi3, Andrea Maschio4, Andrea Angius5, Hyun Min Kang3, Ramaiah Nagaraja1, Francesco Cucca6, Gonçalo R Abecasis3, David Schlessinger1.
Abstract
DNA sequencing identifies common and rare genetic variants for association studies, but studies typically focus on variants in nuclear DNA and ignore the mitochondrial genome. In fact, analyzing variants in mitochondrial DNA (mtDNA) sequences presents special problems, which we resolve here with a general solution for the analysis of mtDNA in next-generation sequencing studies. The new program package comprises 1) an algorithm designed to identify mtDNA variants (i.e., homoplasmies and heteroplasmies), incorporating sequencing error rates at each base in a likelihood calculation and allowing allele fractions at a variant site to differ across individuals; and 2) an estimation of mtDNA copy number in a cell directly from whole-genome sequencing data. We also apply the methods to DNA sequence from lymphocytes of ~2,000 SardiNIA Project participants. As expected, mothers and offspring share all homoplasmies but a lesser proportion of heteroplasmies. Both homoplasmies and heteroplasmies show 5-fold higher transition/transversion ratios than variants in nuclear DNA. Also, heteroplasmy increases with age, though on average only ~1 heteroplasmy reaches the 4% level between ages 20 and 90. In addition, we find that mtDNA copy number averages ~110 copies/lymphocyte and is ~54% heritable, implying substantial genetic regulation of the level of mtDNA. Copy numbers also decrease modestly but significantly with age, and females on average have significantly more copies than males. The mtDNA copy numbers are significantly associated with waist circumference (p-value = 0.0031) and waist-hip ratio (p-value = 2.4×10-5), but not with body mass index, indicating an association with central fat distribution. To our knowledge, this is the largest population analysis to date of mtDNA dynamics, revealing the age-imposed increase in heteroplasmy, the relatively high heritability of copy number, and the association of copy number with metabolic traits.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26172475 PMCID: PMC4501845 DOI: 10.1371/journal.pgen.1005306
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Fig 1Analysis pipeline to identify mtDNA variants includes aligning sequence reads to the whole genome reference (including mtDNA rCRS reference); extracting mtDNA reads; combining mapped and unmapped reads to do a second alignment to the shifted rCRS reference; applying the mtDNA variant caller separately to the reads mapped to the two linear mtDNA reference genomes; and combining the two sets of called mtDNA variants.
Fig 2Sharing of mtDNA variants among 2,077 SardiNIA sequencing project participants.
Fig 3Two classes of base changes for homoplasmies and heteroplasmies.
Fig 4Distribution of homoplasmies and heteroplasmies in four functional categories.
Fig 5The effect of age on the number of heteroplasmies in the unrelated SardiNIA sequencing project participants.
The number of heteroplasmies increases with age with different (colored) minor allele fraction (MAF) thresholds. Each line plots the expected number of heteroplasmies based on the Poisson loglinear model against age at an MAF threshold; while the points represent the observed mean number of heteroplasmies in each age group (<40, 40–50, 50–60, 60–70, 70–80, >80).
The sharing of mtDNA variants in parent-child trios.
| Comparing child with parents | Comparing mother with father | |||||
|---|---|---|---|---|---|---|
| # families | # variants in Child | # shared with mother | # shared with father | # variants in mother | # shared with father | |
| Homoplasmies | 333 | 7273 | 7238 (99.5%) | 2940 (40.4%) | 7266 | 2937 (40.4%) |
| Heteroplasmies | 333 | 207 | 66 (31.9%) | 1 (0.4%) | 206 | 0 (0%) |
The effect of mtDNA copy number on a set of anthropometric, frailty, and lipid traits.
| Traits | Testing the effect of copy number | ||
|---|---|---|---|
| Effect | t value | p-value | |
| Height | 0.0025 | 0.44 | 0.66 |
| Weight | -0.0059 | -0.55 | 0.58 |
| BMI | -0.0032 | -0.80 | 0.42 |
| Waist | -0.0296 | -2.96 | 0.0031 |
| Waist-hip ratio | -0.00028 | -4.24 | 0.000024 |
| Walking speed | -0.00022 | -1.00 | 0.32 |
| Grip strength | 0.0130 | 1.60 | 0.11 |
| HDL-cholesterol | -0.0099 | -0.75 | 0.46 |
| LDL-cholesterol | -0.0082 | -0.25 | 0.80 |
*Effect is the slope of the linear regression model, which corresponds to the expected mean change of the trait per one unit increase of the mtDNA copy number. The regression models also include age, sex, and mean nuclear DNA sequence coverage as covariates.