| Literature DB >> 19361342 |
László Z Garamszegi1, Natasja G de Groot, Ronald E Bontrop.
Abstract
BACKGROUND: The major histocompatibility complex (MHC) is a key model of genetic polymorphism. Selection pressure by pathogens or other microevolutionary forces may result in a high rate of non-synonymous substitutions at the codons specifying the contact residues of the antigen binding sites (ABS), and the maintenance of extreme MHC allelic variation at the population/species level. Therefore, selection forces favouring MHC variability for any reason should cause a correlated evolution between substitution rates and allelic polymorphism. To investigate this prediction, we characterised nucleotide substitution rates and allelic polymorphism (i.e. the number of alleles detected in relation to the number of animals screened) of several Mhc class II DRB lineages in 46 primate species, and tested for a correlation between them.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19361342 PMCID: PMC2674423 DOI: 10.1186/1471-2148-9-73
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Figure 1Amino acid variability plot for 1174 MHC-DRB alleles from 46 primate species (all species and loci combined). The number of different amino acids found on the given position is shown. The considered contact residues of the antigen-binding sites (ABS) are drawn in black (see text for details).
Testing for consistent within-species and within-lineage variations in nucleotide substitution rates (N: non-synonymous, S: synonymous) of the MHC-DRB molecule at different regions (ABS: contact residues, non-ABS: non-contact residues)
| All lineages | Species-specific effects | Lineage-specific effects |
| dN, ABS | F42,148 = 2.455, P < 0.001 | F59,131 = 1.994, P < 0.001 |
| dS, ABS | F42,148 = 1.628, P = 0.018 | F59,131 = 1.544, P = 0.021 |
| dN, non-ABS | F42,148 = 1.450, P = 0.055 | F59,131 = 1.490, P = 0.031 |
| dS, non-ABS | F42,148 = 1.893, P = 0.003 | F59,131 = 2.568, P < 0.001 |
| dN, all sites | F42,148 = 2.227, P < 0.001 | F59,131 = 1.744, P = 0.005 |
| dS, all sites | F42,148 = 1.503, P = 0.040 | F59,131 = 1.782, P = 0.003 |
| Only HLA othologous | Species-specific effects | Lineage-specific effects |
| dN, ABS | F20,34 = 2.260, P = 0.018 | F16,38 = 0.798, P = 0.679 |
| dS, ABS | F20,34 = 1.632, P = 0.102 | F16,38 = 0.360, P = 0.984 |
| dN, non-ABS | F20,34 = 2.195, P = 0.021 | F16,38 = 1.412, P = 0.188 |
| dS, non-ABS | F20,34 = 2.187, P = 0.022 | F16,38 = 0.380, P = 0.980 |
| dN, all sites | F20,34 = 2.765, P = 0.004 | F16,38 = 0.783, P = 0.694 |
| dS, all sites | F20,34 = 2.329, P = 0.015 | F16,38 = 0.310, P = 0.993 |
Results form one-way ANOVA, when including all lineages and only human orthologues (i.e. after excluding W lineages).
Figure 2Consistent within-species (a) and within-lineage (b) variations in non-synonymous nucleotide substitution rates at the contact residues of the antigen binding sites. Columns are means, bars indicate standard errors. (a) Names are the four-letter abbreviations used for species names [27], and (b) lineage names.
Testing simultaneously for consistent within-species and within-lineage variations in nucleotide substitution rates (N: non-synonymous, S: synonymous) of the MHC-DRB molecule at different regions (ABS: contact residues, non-ABS: non-contact residues).
| All lineages | Species-specific effects | Lineage-specific effects |
| dN, ABS | F42,91 = 2.311, P < 0.001 | F59,89 = 1.875, P = 0.004 |
| dS, ABS | F42,91 = 1.809, P = 0.010 | F59,89 = 1.572, P = 0.026 |
| dN, non-ABS | F42,91 = 1.491, P = 0.058 | F59,89 = 1.490, P = 0.044 |
| dS, non-ABS | F42,91 = 1.690, P = 0.019 | F59,89 = 2.628, P < 0.001 |
| dN, all sites | F42,91 = 2.120, P = 0.002 | F59,89 = 1.694, P = 0.012 |
| dS, all sites | F42,91 = 1.908, P = 0.005 | F59,89 = 2.135, P < 0.001 |
| Only HLA orthologous | Species-specific effects | Lineage-specific effects |
| dN, ABS | F20,18 = 3.152, P = 0.009 | F16,18 = 1.155, P = 0.381 |
| dS, ABS | F20,18 = 1.632, P = 0.150 | F16,18 = 0.356, P = 0.979 |
| dN, non-ABS | F20,18 = 2.253, P = 0.044 | F16,18 = 1.104, P = 0.417 |
| dS, non-ABS | F20,18 = 3.058, P = 0.010 | F16,18 = 0.878, P = 0.601 |
| dN, all sites | F20,18 = 3.347, P = 0.006 | F16,18 = 0.986, P = 0.507 |
| dS, all sites | F20,18 = 2.842, P = 0.015 | F16,18 = 0.645, P = 0.809 |
GLM results, when including all lineages and only human orthologues (i.e. after excluding W lineages)
Correlation between different estimates of nucleotide substitution rates (N: non-synonymous, S: synonymous) of the MHC-DRB exon 2 of the molecule at different regions (ABS: contact residues, non-ABS: non-contact residues).
| All lineages, N = 191 | dS, ABS | dN, non-ABS | dS, non-ABS |
| dN, ABS | 0.630 | 0.815 | 0.619 |
| dS, ABS | 0.548 | 0.507 | |
| dN, non-ABS | 0.656 | ||
| Only HLA orthologues, N = 55 | dS, ABS | dN, non-ABS | dS, non-ABS |
| dN, ABS | 0.703 | 0.826 | 0.850 |
| dS, ABS | 0.635* | 0.641* | |
| dN, non-ABS | 0.825 | ||
Effect sizes in the form of Pearson's correlation coefficient as derived from the corresponding GLM model, which included species and lineage as random effects and was weighted by sample size (log10-transformed number of individuals). All relationships are significant at the 0.001 < P level, except *, which corresponds to P < 0.01.
Figure 3The relationship between non-synonymous (dN) and synonymous (dS) substitution rates at the a) ABS contact residues and b) ABS non-contact residues. Points are lineage-specific data when species are separated; solid lines are regression lines; dashed lines indicate y = x, which corresponds to neutrality. Regions below the neutral line denote positive selection pressure (dN > dS or dN:dS > 1).
Figure 4The relationship between non-synonymous (dN) substitution rate at the contact residues of the ABS and allelic variation of DRB lineages when a) including and b) excluding non-human orthologous lineages (designated with 'W' workshop numbers). Allelic variation was estimated as the number of identified alleles relative to the number of animals sampled. Data were derived from the appropriate GLM model that included species-specific and lineage-specific effects and also the number of animals sampled (see text for details). Back-transformed data are shown; lines are regression lines.
Correlation between different estimates of nucleotide substitution rates (N: non-synonymous, S: synonymous) of the DRB1*03 lineage when sampled across all available species (N = 18) and when only catarrhines are considered (N = 11).
| All species, N = 18 | dS, ABS | dN, non-ABS | dS, non-ABS |
| dN, ABS | 0.561* | 0.927*** | 0.771*** |
| dS, ABS | 0.480* | 0.418† | |
| dN, non-ABS | 0.749*** | ||
| Only catarrhine species, N = 11 | dS, ABS | dN, non-ABS | dS, non-ABS |
| dN, ABS | 0.673* | 0.947*** | 0.800** |
| dS, ABS | 0.689* | 0.679* | |
| dN, non-ABS | 0.778** | ||
ABS: contact residues, non-ABS: non-contact residues. Effect sizes as estimated from the PGLS model that adjusted for the phylogenetic relationships of species and used statistical weights (log10-sample size) in a combination that offered the best fitted to the data based on maximised log-likelihood. ***: P < 0.001; **: P < 0.01; *: P < 0.05; † < 0.1.
Figure 5Natural selection and correlated evolution of substitution rates and allelic variation in the . The interspecific relationship between non-synonymous (dN) and synonymous (dS) substitution rates at the a) ABS contact residues and b) ABS non-contact residues. c) The association between non-synonymous (dN) substitution rate at the ABS and allelic variation of the lineage (the number of alleles when the number of animals were held constant). Each data point represents the polymorphism of the DRB1*03 lineage in different species, and was obtained from the appropriate statistical model (see text) that considers the phylogenetic relatedness of species. Symbols represent different taxonomical groups of primates. Solid line is the regression line laid on the entire dataset; dotted line is the regression line that is obtained for New World primates only; dashed line indicates y = x. For illustration; we also included data from human (HLA-DRB1*03, HLA-DRB1*11, HLA-DRB1*12, HLA-DRB1*13 and HLA-DRB1*14 combined; data were subtracted from the IMGT/HLA database, ). We could not add human data to c), because the number of individuals screened for the 324 detected alleles is unknown and thus relative allele numbers cannot be calculated.