| Literature DB >> 21592397 |
Pauline L Kamath1, Wayne M Getz.
Abstract
BACKGROUND: Major Histocompatibility Complex (MHC) genes are central to vertebrate immune response and are believed to be under balancing selection by pathogens. This hypothesis has been supported by observations of extremely high polymorphism, elevated nonsynonymous to synonymous base pair substitution rates and trans-species polymorphisms at these loci. In equids, the organization and variability of this gene family has been described, however the full extent of diversity and selection is unknown. As selection is not expected to act uniformly on a functional gene, maximum likelihood codon-based models of selection that allow heterogeneity in selection across codon positions can be valuable for examining MHC gene evolution and the molecular basis for species adaptations.Entities:
Mesh:
Year: 2011 PMID: 21592397 PMCID: PMC3126738 DOI: 10.1186/1471-2148-11-128
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Diversity of the ELA-DRA, exon 2, by taxon
| Taxonomic group | MHC symbol | No. of nucleotide sequences | No. of protein sequences |
|---|---|---|---|
| BoLA | 1 | 1 | |
| DLA | 1 | 1 | |
| HLA | 1 | 1 | |
| NHP | 13 | 2 | |
| OLA | 3 | 3 | |
| SLA | 4 | 3 | |
| ELA | 22 | 10 |
Information extracted from the international ImMunoGeneTics (IMGT) information system® (http://www.imgt.org) and the Immuno Polymorphism Database - MHC (IPD-MHC) (http://www.ebi.ac.uk/ipd/mhc/).
*Equine data was compiled in this study.
Indices of diversity and selection at the ELA-DRA and DQA
| Length (bp) | PIP/VNP | R | K2P distance (%) | AA distance (%) | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| 243 | 33 | 22 | 9/15 | 4.75 | 1.3(0.4) | 1.7 (0.7) | 0.008 (0.003) | 0.025 (0.011) | 0.32 | |
| 201 | 55 | 48* | 70/96 | 1.34 | 13.7(2.2) | 21.6 (4.4) | 0.105 (0.019) | 0.106 (0.021) | 0.99 |
Length = number of base pairs (bp); N = number of alleles when considering identical alleles across taxa separately; A = number of alleles across taxa; PIP = parsimony informative positions; VNP = variable nucleotide positions; R = transition/transversion bias; K2P distance = average mean evolutionary distance determined using the Kimura 2-parameter gamma model (K2P+Γ); AA = average mean poisson-corrected amino acid distance; dS = synonymous base pair substitution rate; dN = non-synonymous base pair substitution rate; Standard errors of estimates are shown in parentheses. *Includes the allele Eqbu-DQA*21, with a stop codon, while all other estimates exclude it.
ELA-DRA and DQA diversity and selection within Equus spp
| Species | ||||||
|---|---|---|---|---|---|---|
| 1.2 (0.4) | 0.21 | 3/6 | 1.5 (0.9) | 0.72 | 0/2 | |
| 1.2 (0.4) | 0.23 | 6/11 | 16.9 (2.7) | 0.78 | 4/20 | |
| 1.4 (0.5) | 0.67 | 1/5 | 13.3 (2.2) | 1.10 | 3/21 | |
| 1.3 (0.7) | 0.00 | 1/2 | 12.1 (3.4) | 0.88 | 1/2 | |
| 1.3 (0.7) | 0.81 | 1/2 | 2.7 (1.2) | 0.53 | 0/2 | |
| 1.3 (0.7) | 0.81 | 2/2 | 8.6 (2.2) | 2.36 | 0/3 | |
| n/a | n/a | n/a | n/a | n/a | 1/1 | |
| 0.8 (0.4) | 0.40 | 4/5 | 12.4 (2.6) | 1.27 | 3/4 | |
d = Mean evolutionary K2P distance; dN/dS = nonsynonymous to synonymous mutation rate ratio; Shared/Total = number of shared out of total alleles found within species of the genus Equus.
Figure 1Predicted amino acid alignment of the ELA. Dots indicate sequence identity to first sequence in alignment, Eqas-DRA*01. E. burchelli alleles are shown in gray, with light gray highlighting alleles previously known and dark gray highlighting new alleles discovered in this study. Red stars above amino acids indicate putative antigen binding sites, based on the human HLA equivalents [52].
Figure 2Predicted amino acid alignment of the ELA. Dots indicate sequence identity to first sequence in alignment, Eqas-DQA*01. E. burchelli alleles are shown in gray, with light gray highlighting alleles previously known and dark gray highlighting new alleles discovered in this study. Red stars above amino acids indicate putative antigen binding sites, based on human HLA equivalents [52]. The asterisk (*) represents a stop codon.
Selection tests over all sites, antigen binding sites (ABS) and non-antigen binding sites (non-ABS)
| Sites | ||||
|---|---|---|---|---|
| Locus | ||||
| 81 | 20 | 61 | ||
| 0.302 | n/a* | 0.202 | ||
| 0.143 | 0.121 | 0.096 | ||
| 1 | 0.058 | 1 | ||
| 0.074 | 1 | 0.049 | ||
| 67 | 18 | 49 | ||
| 0.990 | 1.013 | 0.915 | ||
| 0.975 | 0.978 | 0.715 | ||
| 1 | 0.489 | 1 | ||
| 0.488 | 1 | 0.359 | ||
N = number of codons; dN/dS = synonymous to non-synonymous rate ratio; Z test p-values for rejecting the null hypothesis of neutrality (dN = dS) for the alternative hypotheses of non-neutrality (dN ≠ dS), positive selection (dN >dS), and purifying selection (dN
Parameter estimates, log-likelihood values and predicted sites under selection for codon evolution models
| Locus | Model code | ℓ | Parameter estimates | Sites under positive selection | 2Δℓ ( | |
|---|---|---|---|---|---|---|
| M0 (one ratio) | 1 | -481.93 | None | |||
| 14.34 ( | ||||||
| M3 (discrete) | 5 | -474.76 | Not analysed | |||
| M1a (nearly neutral) | 1 | -476.43 | Not allowed | |||
| 3.34 ( | ||||||
| M2a (positive selection) | 3 | -474.76 | 14,19, 47, 49 | |||
| M7 (beta) | 2 | -476.35 | Not allowed | |||
| 3.18 ( | ||||||
| M8 (beta and omega) | 4 | -474.76 | 14,19, 47, 49 | |||
| M0 (one ratio) | 1 | -1612.03 | None | |||
| 197.68 ( | ||||||
| M3 (discrete) | 5 | -1513.2 | Not analysed | |||
| M1a (nearly neutral) | 1 | -1545.23 | Not allowed | |||
| 66.8 ( | ||||||
| M2a (positive selection) | 3 | -1516.86 | ||||
| M7 (beta) | 2 | -1548.22 | Not allowed | |||
| 29.65 ( | ||||||
| M8 (beta and omega) | 4 | -1518.57 | ||||
P = number of free parameters in the ω distribution; ℓ = log-likelihood; Model parameter estimates include the nonsynonymous to synonymous rate ratio (ω) and proportion of sites (p) under each ω site class. Estimates for ω that are evidence for positive selection are bolded. Sites under selection were predicted using the Bayes Empirical Bayes (BEB) approach: sites inferred to be under positive selection with posterior probabilities >95% are in bold and sites with posterior probabilities of > 99% are indicated by an asterisk (*).
Figure 3Posterior means of . Posterior means of ω calculated over 11 site classes under the random-sites, codon-based model M8 (beta and omega) and Bayes empirical Bayes (BEB) approach as implemented in PAML [63]. Error bars indicate S.E. of the mean and the asterisk (*) denotes significant positive selection with a posterior probability > 95%. The dashed red line shows where ω = 1. Predicted antigen binding sites, based on HLA equivalents [52], are notated by the red triangles.
Figure 4Bayesian reconstruction of unique . Sequence data (243 bp) was partitioned by codon position and a GTR nucleotide substitution model was used, with equal rates across sites. Analyses were run with 6 chains for 7,000,000 generations, burnin = 17,500 trees. Posterior probabilities > 50% are reported at the nodes. Identical alleles across multiple species are indicated by the appropriate colored bars (see legend) and names were omitted from the tree: . Sequences from Bos taurus (DQ821713), Ovis aries (Z11600) and Sus scrofa (AY754888) were used as outgroups.
Figure 5Bayesian reconstruction of unique . Sequence data (205 bp) was partitioned by codon position and a GTR nucleotide substitution model was used, with gamma-distributed rates across sites. Analyses were run with 6 chains for 16,000,000 generations, burnin = 40,000 trees. Posterior probabilities > 50% are reported at the nodes. Identical alleles across multiple species are indicated by the appropriate colored bars (see legend) and names were omitted from the tree: has a stop codon, but was included in this analysis. Sequences from B. taurus (AB548942), O. aries (M33304) and S. scrofa (EU195146) were used as outgroups.