| Literature DB >> 18545700 |
Erhan Bilal1, Raul Rabadan, Gabriela Alexe, Noriyuki Fuku, Hitomi Ueno, Yutaka Nishigaki, Yasunori Fujita, Masafumi Ito, Yasumichi Arai, Nobuyoshi Hirose, Andrei Ruckenstein, Gyan Bhanot, Masashi Tanaka.
Abstract
We report results from the analysis of complete mitochondrial DNA (mtDNA) sequences from 112 Japanese semi-supercentenarians (aged above 105 years) combined with previously published data from 96 patients in each of three non-disease phenotypes: centenarians (99-105 years of age), healthy non-obese males, obese young males and four disease phenotypes, diabetics with and without angiopathy, and Alzheimer's and Parkinson's disease patients. We analyze the correlation between mitochondrial polymorphisms and the longevity phenotype using two different methods. We first use an exhaustive algorithm to identify all maximal patterns of polymorphisms shared by at least five individuals and define a significance score for enrichment of the patterns in each phenotype relative to healthy normals. Our study confirms the correlations observed in a previous study showing enrichment of a hierarchy of haplogroups in the D clade for longevity. For the extreme longevity phenotype we see a single statistically significant signal: a progressive enrichment of certain "beneficial" patterns in centenarians and semi-supercentenarians in the D4a haplogroup. We then use Principal Component Spectral Analysis of the SNP-SNP Covariance Matrix to compare the measured eigenvalues to a Null distribution of eigenvalues on Gaussian datasets to determine whether the correlations in the data (due to longevity) arises from some property of the mutations themselves or whether they are due to population structure. The conclusion is that the correlations are entirely due to population structure (phylogenetic tree). We find no signal for a functional mtDNA SNP correlated with longevity. The fact that the correlations are from the population structure suggests that hitch-hiking on autosomal events is a possible explanation for the observed correlations.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18545700 PMCID: PMC2408726 DOI: 10.1371/journal.pone.0002421
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Age distribution across the samples (JD, ND, HN, KA, PD, TC/GC, SSC).
Figure 2The significance scores for the 1386 maximal patterns of mutations in the data.
The red region represents 124 patterns with q-value<0.05. The pattern enrichment score used here is LP.
Figure 3Significance score L at q-value<0.05 for the enrichment of SNPs in the three data sets – 96 centenarians, 96+112 centenarians and semi supercentenarians, 112 semi-supercentenarians compared to healthy normals.
3206T, 14979C and 8473C are markers for D4a. SNP 10410C is enriched 80% in D4a vs its wild-type form 10410T.
Figure 4Proportion of beneficial patterns in haplogroups = 100× (Number of positive patterns found in haplogroup)/(Total number of patterns found in haplogroup).
In this analysis, we used the 124 positive patterns from Figure 2 with q-value<0.05.
Figure 5Heat map of the agreement matrix for SNPs across significant patterns.
Note the presence of a group of 6 SNPs (upper left hand red box) strongly associated with longevity. These are: 3206T, 10410C, 8473C, 14979C, 152C and 16129A. The ones in bold are the same as in Figure 3 and are markers for D4a. The locations 152 and 16129 are known mutational hotspots in the hypervariable regions HVS-II and HVS-I respectively and hence their correlation with longevity cannot be considered significant.
List of most frequent mtSNPs from enhanced patterns (see Figure 5) in semi-supercentenarians vs. healthy normals.
| SNP | Haplogrooup specificity | rCRS | Mutant | Gene | Sense |
|
|
|
|
|
| |
| 489C | M Clade, J | T | C | HVS-III | |
| 709A | B5, G, M10, B4c, H13a | G | A | 12S rRNA | |
| 961C | None | T | C | 12S rRNA | |
| 3010A | D4 and many other groups | G | A | 16S rRNA | |
|
|
|
|
|
| |
| 4883T | D | C | T | ND2 | Syn |
| 5178A | D | C | A | ND2 | Nsyn |
| 8414T | D4 | C | T | ATP8 | Nsyn |
|
|
|
|
|
|
|
| 8701G | M Clade | A | G | ATP6 | Nsyn |
| 9540C | M Clade | T | C | CO3 | Syn |
| 10398G | None | A | G | ND3 | Nsyn |
| 10400T | M Clade | C | T | ND3 | Syn |
|
|
|
|
|
| |
| 10873C | M Clade | T | C | ND4 | Syn |
| 12705T | None | C | T | ND5 | Syn |
| 14668T | D4 | C | T | ND6 | Syn |
| 14783C | M Clade | T | C | Cytb | Syn |
|
|
|
|
|
|
|
| 15043A | M Clade | G | A | Cytb | Syn |
| 15301A | M Clade | G | A | Cytb | Syn |
| 16093C | None | T | C | HVS-I | |
|
|
|
|
|
| |
| 16223T | None | C | T | HVS-I | |
| 16319A | None | G | A | HVS-I | |
| 16362C | None | T | C | HVS-I | |
| 16519C | None | T | C | HVS-I |
Column 2 shows the haplogroup specificity for the SNP. The next two columns show the nucleotide in rCRS and the mutation. The last two columns are for the corresponding gene and the ‘sense’ of the mutation – synonymous or nonsynonymous. SNPs shown in bold blue are the six SNPs in the top left corner of Figure 5, the ones most correlated with patterns enriched in the longevity group. Note that these are either non-specific or markers for haplogroup D4a. The locations 152 and 16129 are known mutational hotspots in the hypervariable regions HVS-II and HVS-I respectively and hence their correlation with longevity cannot be considered significant.
The top enhanced SNPs in the D clade in the Japanese samples in Hapmap [5].
| #SNP rs# | SNP position | Gene(s) | Role | aa change | genotypes | p-value |
| rs12960296 | chr18:49408898 | GG[68.75%] GT[18.75%] TT[12.5%] | 1.11E-05 | |||
| rs17439885 | chr18:49417508 | AA[68.75%] AG[18.75%] GG[12.5%] | 1.11E-05 | |||
| rs5766424 | chr22:43823070 | AA[6.25%] AG[81.25%] GG[12.5%] | 1.57E-05 | |||
| rs3802082 | chr7:17143421 | AHR | Intron | - | AA[62.5%] AT[31.25%] TT[6.25%] | 1.63E-05 |
| rs13381188 | chr18:49397848 | CC[66.66%] CT[20.00%] TT[13.33%] | 2.99E-05 | |||
| rs4799386 | chr18:31341066 | C18orf37 | Promoter | - | AA[56.25%] AC[43.75%] CC[0.0%] | 3.63E-05 |
| rs7791070 | chr7:17174267 | CC[6.25%] CT[37.5%] TT[56.25%] | 4.08E-05 | |||
| rs12232754 | chr18:31309143 | C18orf37 | Intron | - | CC[56.25%] CT[43.75%] TT[0.0%] | 4.74E-05 |
|
|
|
|
|
|
|
|
| rs10269143 | chr7:17172223 | AA[12.5%] AT[31.25%] TT[56.25%] | 5.02E-05 | |||
| rs10274243 | chr7:17176936 | AA[12.5%] AG[31.25%] GG[56.25%] | 5.02E-05 | |||
| rs7780687 | chr7:17178958 | AA[56.25%] AG[31.25%] GG[12.5%] | 5.02E-05 | |||
| rs17137566 | chr7:17133761 | AHR | Intron | - | CC[6.25%] CT[37.5%] TT[56.25%] | 5.33E-05 |
| rs3910440 | chr12:29568570 | ARG99 | Intron | - | AA[18.75%] AG[6.25%] GG[75.0%] | 5.79E-05 |
| rs299454 | chr12:29579764 | ARG99 | Intron | - | CC[18.75%] CT[6.25%] TT[75.0%] | 5.79E-05 |
| rs299467 | chr12:29592557 | ARG99 | Intron | - | AA[75.0%] AG[6.25%] GG[18.75%] | 5.79E-05 |
| rs299470 | chr12:29593452 | ARG99 | Intron | - | AA[75.0%] AG[6.25%] GG[18.75%] | 5.79E-05 |
| rs299479 | chr12:29600357 | ARG99 | Intron | - | CC[75.0%] CT[6.25%] TT[18.75%] | 5.79E-05 |
| rs148898 | chr12:29606383 | ARG99 | Intron | - | CC[75.0%] CG[6.25%] GG[18.75%] | 5.79E-05 |
| rs299487 | chr12:29608184 | ARG99 | Intron | - | CC[75.0%] CT[6.25%] TT[18.75%] | 5.79E-05 |
| rs10169728 | chr2:65917282 | FLJ16124 | Intron | - | AA[75.0%] AC[6.25%] CC[18.75%] | 5.79E-05 |
| rs733312 | chr12:41827766 | AA[81.25%] AG[18.75%] GG[0.0%] | 5.81E-05 | |||
| rs8096007 | chr18:2199867 | AA[75.0%] AG[18.75%] GG[6.25%] | 8.55E-05 | |||
| rs11188799 | chr10:98250196 | TLL2 | Intron | - | CC[12.5%] CT[18.75%] TT[68.75%] | 9.16E-05 |
| rs9528775 | chr13:63976625 | CC[31.25%] CT[12.5%] TT[56.25%] | 9.89E-05 | |||
| rs7996275 | chr13:64055128 | CC[31.25%] CT[12.5%] TT[56.25%] | 9.89E-05 |
We compare the D clade samples with the rest of the Japanese samples. Enrichment of the SNP was determined using a chi-squared test at 0.0001 significance level. The choice of mtSNPs in Hapmap was too sparse to resolve the D samples into subhaplogroups. We also cannot correlate SNP 14979C with autosomal SNPs because no D4a defining markers were available in the HapMap dataset.