| Literature DB >> 24727287 |
Richard Jovelin1, Jennifer S Comstock2, Asher D Cutter3, Patrick C Phillips4.
Abstract
The discovery that genetic pathways can be manipulated to extend lifespan has revolutionized our understanding of aging, yet their function within natural populations remains poorly characterized. In particular, evolutionary theories of aging predict tradeoffs in resource investment toward somatic maintenance vs. reproductive output that should impose strong natural selection on genetic components that influence this balance. To explore such selective pressure at the molecular level, we examine population genetic variation in the insulin-like signaling pathway of the nematode Caenorhabditis remanei. We document a recent global selective sweep on the phosphoinositide-3-kinase pathway regulator, age-1, the first life-extension gene to have been identified. In particular, we find that age-1 has 5-20 times less genetic variation than any other insulin-like signaling pathway components and that evolutionary signatures of selection center on the age-1 locus within its genomic environment. These results demonstrate that critical components of aging-related pathways can be subject to shifting patterns of strong selection, as predicted by theory. This highly polymorphic outcrossing species offers high-resolution, population-level analyses of molecular variation as a complement to functional genetic studies within the self-reproducing C. elegans model system.Entities:
Keywords: Caenorhabditis; aging; insulin pathway; molecular evolution; selective sweep
Mesh:
Substances:
Year: 2014 PMID: 24727287 PMCID: PMC4065255 DOI: 10.1534/g3.114.010629
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Figure 1Core components of the insulin-like signaling pathway in C. elegans. Genes examined in this study are indicated by a bold outline (clear C. remanei homologs of akt-2 and daf-18 could not be identified).
Figure 2(A) Conserved synteny between C. elegans (top) and C. remanei (bottom) in the genomic region surrounding age-1. The structure is shown for each locus with boxes representing exons and is color-coded to show orthologous relationships. Arrows indicate orientation. (B) Genomic organization of age-1 and its immediate neighbors. Each gene was resequenced in three populations of C. remanei to investigate patterns of diversity and selection in this region. (C) Neighbor-joining networks showing the relationships among C. remanei strains, color-coded according to their population of origin. A small number of strains with short internal branches are omitted for easier representation. Reticulation indicates potential recombination among strains.
Pattern of nucleotide polymorphism in the coding sequence of the insulin-signaling genes in the Ohio population of C. remanei
| Locus | Chr | N | n | % Seq | NS | P | A | S | π | πa | πs |
|---|---|---|---|---|---|---|---|---|---|---|---|
| III | 11 | 22 | 35 | 1941 | 60 | 9 | 52 | 10.32 | 1.87 | 38.82 | |
| X | 9 | 18 | 92 | 2749 | 40 | 13 | 27 | 4.25 | 1.62 | 13.37 | |
| I | 15 | 30 | 84 | 1306 | 96 | 12 | 83 | 18.41 | 2.23 | 72.49 | |
| II | 24 | 48 | 100 | 3564 | 34 | 9 | 25 | 0.92 | 0.44 | 2.57 | |
| X | 11 | 22 | 95 | 1791 | 76 | 6 | 70 | 14.81 | 1.76 | 59.34 | |
| V | 14 | 28 | 79 | 1456 | 42 | 2 | 40 | 11.08 | 0.88 | 46.93 | |
| X | 12 | 24 | 88 | 1220 | 18 | 1 | 17 | 5.49 | 0.09 | 25.18 | |
| II | 13 | 26 | 93 | 1469 | 20 | 2 | 18 | 4.94 | 0.57 | 18.96 | |
| IV | 14 | 14 | 94 | 1715 | 78 | 21 | 60 | 14.03 | 3.73 | 49.81 |
π values are ×103. Chr, chromosome in C. elegans; N, number of strains, n, number of chromosomes, twice the number of the strains when heterozygote sites were present; % Seq, percent of the C. remanei coding sequence analyzed; NS, number of sites analyzed (excluding gap positions); P, number of polymorphic sites; A, number of amino acid replacement changes; S, number of synonymous changes; π, total nucleotide diversity; πa, nucleotide diversity at nonsynonymous sites; πs, nucleotide diversity at synonymous sites.
Pattern of nucleotide variation at age-1 and at its immediate upstream and downstream neighbors in three populations of C. remanei
| Sample | Locus | N | n | % Seq | NS | P | A | S | π | πa | πs | πsi | ||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Ohio | CRE02131 | 22 | 44 | 100 | 1860 | 66 | 3 | 23 | 10.66 | 0.76 | 29.24 | 19.97 | 1.069 | −1.208 |
| 25 | 50 | 100 | 1905 | 33 | 2 | 8 | 2.46 | 0.20 | 5.83 | 4.03 | −1.150 | −3.924 | ||
| CRE02129 | 24 | 24 | 100 | 961 | 8 | 1 | 1 | 1.73 | 0.16 | 1.40 | 3.58 | −0.498 | −0.536 | |
| 24 | 48 | 100 | 3961 | 39 | 9 | 25 | 1.09 | 0.44 | 2.57 | 2.56 | −1.821 | −2.604 | ||
| 17 | 17 | 100 | 2448 | 57 | 3 | 7 | 6.94 | 1.44 | 8.52 | 9.51 | −0.024 | −0.041 | ||
| CRE01736 | 22 | 44 | 100 | 509 | 11 | 4 | 3 | 8.77 | 5.19 | 15.23 | 14.89 | 2.004 | −1.492 | |
| CRE01735 | 19 | 38 | 98 | 3371 | 70 | 2 | 3 | 8.05 | 2.50 | 13.80 | 8.82 | 2.236 | 0.354 | |
| Germany | CRE02131 | 22 | 44 | 100 | 1869 | 68 | 3 | 20 | 8.51 | 0.74 | 17.90 | 15.76 | 0.073 | −1.804 |
| 23 | 46 | 100 | 1916 | 54 | 1 | 13 | 5.62 | 0.30 | 9.95 | 9.28 | −0.440 | −0.882 | ||
| CRE02129 | 24 | 48 | 100 | 960 | 14 | 3 | 3 | 1.50 | 0.39 | 1.74 | 2.82 | −1.470 | −0.347 | |
| 24 | 48 | 100 | 3959 | 71 | 18 | 38 | 3.00 | 1.23 | 8.24 | 7.08 | −0.910 | −2.293 | ||
| 23 | 46 | 68 | 1682 | 55 | 9 | 12 | 5.15 | 1.77 | 9.15 | 7.43 | −1.067 | −2.244 | ||
| CRE01736 | 25 | 50 | 100 | 510 | 13 | 8 | 11 | 6.01 | 4.63 | 9.67 | 8.34 | 0.999 | 1.052 | |
| CRE01735 | 19 | 38 | 99 | 3393 | 108 | 11 | 11 | 8.72 | 5.14 | 27.29 | 9.25 | 0.681 | −0.201 | |
| Ontario | CRE02131 | 19 | 38 | 100 | 1869 | 51 | 5 | 17 | 10.16 | 1.67 | 23.71 | 18.08 | 2.167 | −0.515 |
| 17 | 34 | 100 | 1922 | 40 | 0 | 8 | 5.82 | 0 | 10.32 | 9.79 | 0.516 | −0.926 | ||
| CRE02129 | 21 | 42 | 100 | 960 | 5 | 2 | 1 | 1.54 | 0.72 | 3.08 | 2.50 | 1.233 | 0.625 | |
| 20 | 40 | 100 | 3961 | 19 | 8 | 9 | 1.12 | 0.79 | 2.21 | 1.88 | −0.376 | −0.423 | ||
| 17 | 34 | 73 | 1802 | 33 | 6 | 5 | 3.62 | 1.46 | 4.59 | 5.02 | −0.586 | −2.262 | ||
| CRE01736 | 19 | 38 | 100 | 510 | 13 | 4 | 5 | 8.57 | 6.50 | 14.66 | 12.09 | 1.039 | 1.092 | |
| CRE01735 | 17 | 34 | 98 | 3374 | 87 | 4 | 5 | 6.92 | 2.73 | 14.79 | 7.52 | 0.355 | −2.233 | |
| Pooled | CRE02131 | 63 | 126 | 100 | 1855 | 101 | 8 | 30 | 10.62 | 1.20 | 25.98 | 19.54 | 0.237 | −1.062 |
| 65 | 130 | 100 | 1901 | 83 | 2 | 17 | 4.59 | 0.19 | 9.11 | 7.65 | −1.350 | −2.287 | ||
| CRE02129 | 69 | 138 | 100 | 960 | 22 | 6 | 4 | 1.88 | 0.43 | 2.52 | 3.59 | −1.233 | −0.873 | |
| 68 | 136 | 100 | 3959 | 94 | 23 | 53 | 1.89 | 0.87 | 4.73 | 4.24 | −1.839 | −2.897 | ||
| 57 | 114 | 68 | 1682 | 76 | 13 | 15 | 6.82 | 1.95 | 9.51 | 10.35 | −0.472 | −1.582 | ||
| CRE01736 | 66 | 132 | 100 | 509 | 20 | 9 | 5 | 9.08 | 6.33 | 15.29 | 13.77 | 1.367 | 0.716 | |
| CRE01735 | 55 | 110 | 97 | 3330 | 148 | 6 | 11 | 8.21 | 2.66 | 20.08 | 8.96 | −0.076 | −1.078 |
Tajima’s D was computed using silent sites diversity. π values are ×103. N, number of strains; n, number of chromosomes, twice the number of the strains when heterozygote sites were present; % Seq, percent of the C. remanei gene sequenced; NS, number of sites analyzed (excluding gap positions); P, number of polymorphic sites; A, number of amino acid replacement changes; S, number of synonymous changes; π, total nucleotide diversity; πa, nucleotide diversity at nonsynonymous sites; πs, nucleotide diversity at synonymous sites; πsi, nucleotide diversity at silent sites.
P < 0.05.
P < 0.01.
Figure 3The nucleotide diversity around age-1 is reduced in three populations of C. remanei and at a global spatial scale. The site frequency spectrum shows an excess of rare alleles (Tajima’s D) and an excess of derived high-frequency variants (Fay and Wu’s H) localizing directly at age-1 and/or on its close neighbors. Significance of the difference in silent site nucleotide diversity between age-1 and each of its neighbors was assessed using pairwise HKA tests. Significance of the Tajima’s D and Fay and Wu’s H statistics were determined by coalescent simulations. *P < 0.05, **P < 0.01, ***P < 0.001.
HKA likelihood ratio tests of selection at age-1
| Sampling scheme | Hypothesis | ||||
|---|---|---|---|---|---|
| No selection | 1 | −186.32 | |||
| Selection | 0.286 | −182.27 | 8.10 | 0.0044 | |
| No selection | 1 | −161.11 | |||
| Selection | 0.448 | −160.11 | 1.16 | 0.2815 | |
| No selection | 1 | −186.4 | |||
| Selection | 0.096 | −177.17 | 18.46 | < 0.0001 | |
| No selection | 1 | −178.76 | |||
| Selection | 0.516 | −175.98 | 5.56 | 0.0184 |
k, selection parameter, k < 1 indicates a reduction in diversity due to selection; L, log-likelihood of the hypothesis; 2ΔL, likelihood ratio statistics.
Figure 4Empirical distribution of Tajima’s D from 92 protein coding genes sequenced from the same population in Ohio. Cre-age-1 has the most negative Tajima’s D value (black bar), suggesting that the excess of rare variants at Cre-age-1 is not the result of genome-wide demographic effects. Tajima’s D was computed using synonymous site diversity.
Figure 5Composite likelihood ratio (CLR) test of selective sweep for genes in the genomic vicinity of the age-1 locus. The CLR suggests that age-1 is the direct target of a selective sweep in all three populations of C. remanei. The horizontal dashed lines indicate the 1% cut-off value as determined by simulations under the standard neutral model. The position of each gene within the 17-kb region is shown at the bottom and with gray boxes in each panel.
Figure 6(A) A sliding window of nucleotide diversity of genic and intergenic regions identifies a ~6-kb genomic region, including CRE2129 and Cre-age-1, with low polymorphism. (B) The composite likelihood ratio (CLR) test of selective sweep based on the SFS is shown in blue. The CLR is maximized within Cre-age-1. The distribution of the ω statistics detecting a selective sweep based on the pattern of linkage disequilibrium is shown in green. The ω is maximized within CRE02129. Both tests identify a narrow region as the target of a selective sweep. The horizontal dashed lines indicate the 1% cut-off value as determined by simulations under the standard neutral model. The position of each gene within the 17-kb region and scaled with the x axes of (A) and (B) is shown at the bottom and with gray boxes in each panel.
Sequence evolution and MK tests of adaptive divergence
| Locus | Da | Ds | Da/Ds | Pa | Ps | Pa/Ps | ||||
|---|---|---|---|---|---|---|---|---|---|---|
| CRE02131 | 0.0012 | 0.0829 | 0.0145 | 1 | 16 | 0.0625 | 8 | 31 | 0.2581 | 0.2502 |
| 0.0027 | 0.0887 | 0.0304 | 2 | 17 | 0.1176 | 2 | 17 | 0.1176 | 1 | |
| CRE02129 | 0.0019 | 0.1028 | 0.0185 | 0 | 15 | 0 | 6 | 4 | 1.5 | 0.0012 |
| 0.0318 | 0.1404 | 0.2265 | 79 | 94 | 0.8404 | 24 | 53 | 0.4528 | 0.0369 | |
| 0.0118 | 0.1098 | 0.1075 | 5 | 18 | 0.2778 | 13 | 16 | 0.8125 | 0.1415 | |
| CRE01736 | 0.0514 | 0.1223 | 0.4203 | 11 | 10 | 1.1 | 9 | 5 | 1.8 | 0.7282 |
| CRE01735 | 0 | 0.0089 | 0 | 0 | 1 | 0 | 4 | 3 | 1.3333 | 1 |
MK, McDonald-Kreitman; Da, fixed amino acid replacements; Ds, fixed synonymous changes; Pa, nonsynonymous polymorphisms; Ps, synonymous polymorphisms.