| Literature DB >> 22701678 |
Jeremy Semeiks1, Nick V Grishin.
Abstract
Evolutionary theory suggests that the force of natural selection decreases with age. To explore the extent to which this prediction directly affects protein structure and function, we used multiple regression to find longevity-selected positions, defined as the columns of a sequence alignment conserved in long-lived but not short-lived mammal species. We analyzed 7,590 orthologous protein families in 33 mammalian species, accounting for body mass, phylogeny, and species-specific mutation rate. Overall, we found that the number of longevity-selected positions in the mammalian proteome is much higher than would be expected by chance. Further, these positions are enriched in domains of several proteins that interact with one another in inflammation and other aging-related processes, as well as in organismal development. We present as an example the kinase domain of anti-müllerian hormone type-2 receptor (AMHR2). AMHR2 inhibits ovarian follicle recruitment and growth, and a homology model of the kinase domain shows that its longevity-selected positions cluster near a SNP associated with delayed human menopause. Distinct from its canonical role in development, this region of AMHR2 may function to regulate the protein's activity in a lifespan-specific manner.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22701678 PMCID: PMC3372525 DOI: 10.1371/journal.pone.0038595
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Conceptual example of our multiple regression method applied to a single column (Y465 of AMHR2; full result in Table S2).
(left) Characters shown ordered by species MLS. For each non-human species, we calculate the similarity score (";BLOSUM80";) for the species’ amino acid character versus the human character (here Y); e.g., this score for Tursiops would be the similarity score for H versus Y, which is 2. (right) We then fit the MLS of all non-human species to their similarity scores; e.g., Tursiops’ contribution to this fit is the point (52, 2). Not shown are the steps to correct for mutation rate and shared phylogeny, and the simultaneous fit of body mass. For this column, the data provide relatively strong support for a nonzero slope in the fit of similarity to MLS, even given trends in mutation rate, phylogeny, and body mass, and so this position is assigned a relatively significant p-value (p<0.01).
Figure 2Phylogeny of species used in this study.
Shown next to each species’ node are its binomial, common name, and MLS.
Number of positions and alignments selected, fit, and conserved, as defined in Sections 2.1 and 2.2.
| subset | columns (×106) | alignments |
| include human | 7.01 | 12,746 |
| –selected | 3.64 | 7,708 |
| ––fit | 0.73 | 7,590 |
| ––conserved | 2.91 | 118 |
Number of alignments that include at least one position in the indicated subset.
Number of alignments that include only conserved positions.
Figure 3Density histograms of p values.
A. Real data. B. Randomized control. Each histogram shows the n=734,741 fit positions with b>0. See Section 3.2 for details.
Predicted secondary structure in all positions of the human proteome and in two subsets of longevity-selected positions.
| α-helix | β-strand | random coil | total | |
| human proteome | 2,108,348 (29.95%) | 956,422 (13.59%) | 3,974,432 (56.46%) | 7,039,202 (100%) |
| real data | 1,922 (24.95%) | 736 (9.56%) | 5,044 (65.49%) | 7,702 (100%) |
| randomized control | 160 (23.26%) | 62 (9.01%) | 466 (67.73%) | 688 (100%) |
Selected clusters, not mutually exclusive, of ontology terms enriched in top protein domains.
| functional class |
| examples |
| extracellular region | 5.1e-5 | BTD, CP, LAMA2, LAMA3, PRSS12 |
| cytokine-mediated signaling pathway | 0.033 | JAK1, IL31RA, IL6ST, LIFR, RIKP1, KIT |
| protein tyrosine kinase activity | 0.018 | JAK1, ROS1, MST1R, NIN, OBSCN, KIT, TYK2 |
| multicopper oxidase; copper ion binding | 9.4e-3 | AFP, CP, F8, HEPHL1 |
| developmental process | 0.041 | AFP, AMHR2, ALOX15, CFTR, ATR, ATM, BRCA1, RECQL4 |
| motor activity; myosin complex | 0.015 | KIF18B, KIF20B, KIF22, MYO5C, MYO7B, MYO18A |
| complement and coagulation cascades;humoral immune response | 0.061 | F8, F11, CR2, C1R, CFD, LTF, CD83 |
| serine-type endopeptidase activity | 0.063 | F11, C1R, CFD, KLK6, LTF, PRSS12 |
Figure 4OrthoMaM alignment of the C-terminal regions of AMHR2 ortholog kinase domains.
Orthologs are ordered by species MLS. The five longevity-selected positions in this region (Y465, T469, F473, E513, and H515) are highlighted in gray. ";X"; indicates regions that we masked due to excessive divergence (Section 2.1). Long regions of gaps are not necessarily real genome deletions, but are more likely to have been missed during genome assembly or annotation.
Figure 5AMHR2 kinase domain mapped onto experimental structure of BMPR2 kinase domain.
All eight longevity-selected positions found in this domain are shown as black sticks and are further described in Table S2. The five longevity-selected positions discussed in the text are labeled; they are found on the αG–αH loop (Y465, T469, and F473) and the C-terminal loop following αI (E513 and H515).