| Literature DB >> 29661983 |
Adam T McLain1, Christopher Faulk2.
Abstract
Gene promoters are evolutionarily conserved across holozoans and enriched in CpG sites, the target for DNA methylation. As animals age, the epigenetic pattern of DNA methylation degrades, with highly methylated CpG sites gradually becoming demethylated while CpG islands increase in methylation. Across vertebrates, aging is a trait that varies among species. We used this variation to determine whether promoter CpG density correlates with species' maximum lifespan. Human promoter sequences were used to identify conserved regions in 131 mammals and a subset of 28 primate genomes. We identified approximately 1000 gene promoters (5% of the total), that significantly correlated CpG density with lifespan. The correlations were performed via the phylogenetic least squares method to account for trait similarity by common descent using phylogenetic branch lengths. Gene set enrichment analysis revealed no significantly enriched pathways or processes, consistent with the hypothesis that aging is not under positive selection. However, within both mammals and primates, 95% of the promoters showed a positive correlation between increasing CpG density and species lifespan, and two thirds were shared between the primate subset and mammalian datasets. Thus, these genes may require greater buffering capacity against age-related dysregulation of DNA methylation in longer-lived species.Entities:
Keywords: CpG density; CpG site; DNA methylation; evolution; lifespan; varmints
Mesh:
Year: 2018 PMID: 29661983 PMCID: PMC5940106 DOI: 10.18632/aging.101413
Source DB: PubMed Journal: Aging (Albany NY) ISSN: 1945-4589 Impact factor: 5.682
Summary list of taxon orders examined in this study. Order names are given with number of promoters matched to human promoters at various percent identity cutoffs. Table S2 contains a full list of taxa with the number of identified promoters in each species.
| Order | No. Species | Promoters Identified with >=70% match | Promoters Identified with >=90% match | Promoters Identified with >=95% match | Promoters Identified with >=99% match |
| Afrosoricida | 2 | 1882 | 170 | 22 | 0 |
| Artiodactyla | 23 | 50580 | 4941 | 618 | 22 |
| Carnivora | 12 | 35548 | 4669 | 826 | 161 |
| Chiroptera | 14 | 27485 | 2346 | 276 | 12 |
| Cingulata | 1 | 1730 | 162 | 18 | 0 |
| Dasyuromorphia | 1 | 51 | 7 | 2 | 0 |
| Dermoptera | 1 | 4159 | 410 | 47 | 1 |
| Didelphimorphia | 1 | 63 | 11 | 2 | 0 |
| Diprotodontia | 1 | 35 | 6 | 1 | 0 |
| Eulypotyphla | 3 | 2377 | 283 | 35 | 3 |
| Hyracoidea | 1 | 1306 | 101 | 14 | 1 |
| Lagomorpha | 1 | 1641 | 163 | 24 | 1 |
| Macroscelidea | 1 | 810 | 89 | 15 | 1 |
| Monotremata | 1 | 19 | 3 | 1 | 1 |
| Perissodactyla | 4 | 13807 | 1349 | 180 | 5 |
| Pholidota | 2 | 13826 | 279 | 36 | 2 |
| Pilosa | 1 | 2195 | 193 | 19 | 0 |
| Primates | 28 | 445897 | 346165 | 172222 | 47784 |
| Proboscidea | 1 | 2067 | 176 | 18 | 1 |
| Rodentia | 28 | 25060 | 2738 | 353 | 5 |
| Scandentia | 2 | 3121 | 362 | 41 | 2 |
| Sirenia | 1 | 2574 | 203 | 23 | 1 |
| Tubulidentata | 1 | 1811 | 170 | 20 | 1 |
Figure 1Promoter matches by species and GC content. (A) Total number of promoters identified by BLAST search of 28 primate genomes. (B) The associated GC content (%) of those promoter regions. (C) Total number of promoters identified from 131 primate genomes. (D) The associated GC content (%) of those promoter regions.
Figure 2Length of promoters. The average length of BLAST detected promoters from the group of 28 primates and the set of 103 mammals excluding primates shows shorter length sequences in the mammalian group (p<0.0001).
Figure 3Example correlations of top hit and random promoter. Shown are the scatterplots of the log(max lifespan) vs. CpG density values for the most significantly correlated promoters from each of the mammalian and primate datasets as compared to a random non-correlated gene.
Figure 4Visualization of the data presented in . The number of promoters correlated with lifespan in the entire mammalian dataset, and in the primate dataset only.
Number of genes positively (q-value >0.05) and negatively (q-value <0.05) correlated with lifespan in both the entire mammalian dataset (131 species), and the primate subset (28 species) only.
| Promoters positively correlated with lifespan (q-value>0.05) | 1020 | 930 |
| Promoters negatively correlated with lifespan (q-value<0.05) | 59 | 57 |
| Total number of promoters with a significant q-value (>0.05) | 1079 | 987 |
Figure 5The number of negatively and positively correlated lifespan-related genes in the whole mammalian dataset compared to those specific to primates. Negatively correlated genes and positively correlated genes.