| Literature DB >> 20459756 |
Kelsey J Metzger1, Michael A Thomas.
Abstract
BACKGROUND: CC chemokine receptor proteins (CCR1 through CCR10) are seven-transmembrane G-protein coupled receptors whose signaling pathways are known for their important roles coordinating immune system responses through targeted trafficking of white blood cells. In addition, some of these receptors have been identified as fusion proteins for viral pathogens: for example, HIV-1 strains utilize CCR5, CCR2 and CCR3 proteins to obtain cellular entry in humans. The extracellular domains of these receptor proteins are involved in ligand-binding specificity as well as pathogen recognition interactions.In mammals, the majority of chemokine receptor genes are clustered together; in humans, seven of the ten genes are clustered in the 3p21-24 chromosome region. Gene conversion events, or exchange of DNA sequence between genes, have been reported in chemokine receptor paralogs in various mammalian lineages, especially between the cytogenetically closely located pairs CCR2/5 and CCR1/3. Datasets of mammalian orthologs for each gene were analyzed separately to minimize the potential confounding impact of analyzing highly similar sequences resulting from gene conversion events.Molecular evolution approaches and the software package Phylogenetic Analyses by Maximum Likelihood (PAML) were utilized to investigate the signature of selection that has acted on the mammalian CC chemokine receptor (CCR) gene family. The results of neutral vs. adaptive evolution (positive selection) hypothesis testing using Site Models are reported. In general, positive selection is defined by a ratio of nonsynonymous/synonymous nucleotide changes (dN/dS, or omega) >1.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20459756 PMCID: PMC2880985 DOI: 10.1186/1471-2148-10-139
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Model Parameter Estimates, dN/dS Ratios, Log Likelihood Values and Test Statistics for PAML Site Models.
| Gene | Model | Parameters | ||||
|---|---|---|---|---|---|---|
| M7: Neutral, beta | p = 0.33949, q = 1.05276 | 0.2417 | 2 | -4931.71 | ||
| M8: Selection, beta + ω | p0 = 0.97570, p = 0.45144 | 0.2728 | 4 | -4924.10 | M7 vs. M8: | |
| q = 1.67059, (p1 = 0.02430) | ||||||
| ω = 2.83526 | ||||||
| M1a: Nearly Neutral | ω0 = 0.07487, ω1 = 1.00 | 0.4314 | 2 | -5536.21 | ||
| p0 = 0.61460, (p1 = 0.38540) | ||||||
| M2a: Selection | ω0 = 0.07422, ω1 = 1.00 | 0.5014 | 4 | -5526.29 | M1a vs. M2a: | |
| ω2 = 5.44115, p0 = 0.60315 | ||||||
| p1 = 0.38339, (p2 = 0.01346) | ||||||
| M7: Neutral, beta | p = 0.22415, q = 0.35810 | 0.3850 | 2 | -5534.05 | ||
| M8: Selection, beta + ω | p0 = 0.98363, p = 0.23871 | 0.2662 | 4 | -5522.90 | M7 vs. M8: | |
| q = 0.3298, (p1 = 0.01637) | ||||||
| ω = 4.50735 |
The test statistic 2Δl is compared to a χ2 distribution with 2 degrees of freedom, critical values 5.99, 9.21, and 13.82 at 5%, 1%, and 0.1% significance, respectively. Significant results are indicated by asterisks. Values for non-significant comparisons are presented in Additional File 1.
Positively Selected Sites Under Different PAML Site Models Using Bayes Empirical Bayes Analysis.
| Gene | Model | Codon | Domain | Amino Acid | Posterior Probability | Post Mean +- SE for ω |
|---|---|---|---|---|---|---|
| M8: Selection, beta+ ω | 16 | EC | S | 0.632 | 1.536 +- 0.783 | |
| 23 | EC | F | 0.795 | 1.796 +- 0.781 | ||
| 43 | EC | Q | 0.0916 | 1.977 +- 0.729 | ||
| 95 | MB | L | 0.795 | 1.807 +- 0.810 | ||
| 115 | EC | L | 0.623 | 1.535 +- 0.825 | ||
| 183 | EC | K | 0.0926 | 1.986 +- 0.720 | ||
| 187 | EC | V | 0.705 | 1.678 +- 0.831 | ||
| 196 | EC | R | 0.961* | 2.028 +- 0.691 | ||
| 197 | EC | G | 0.862 | 1.910 +- 0.771 | ||
| M2a: Selection | 4 | EC | S | 0.638 | 3.459 +- 2.147 | |
| 5 | EC | L | 0.988* | 4.851 +- 1.496 | ||
| 20 | EC | V | 0.998** | 4.881 +- 1.453 | ||
| 179 | EC | T | 0.919 | 4.581+- 1.731 | ||
| 181 | EC | T | 0.878 | 4.378 +- 1.807 | ||
| M8: Selection, beta+ ω | 4 | EC | S | 0.824 | 2.296 +- 0.904 | |
| 5 | EC | L | 0.988* | 2.590 +- 0.683 | ||
| 20 | EC | V | 0.997** | 2.603 +- 0.667 | ||
| 31 | EC | A | 0.549 | 1.734 +- 0.921 | ||
| 95 | EC | R | 0.681 | 2.000 +- 0.947 | ||
| 96 | EC | G | 0.598 | 1.830 +- 0.940 | ||
| 104 | EC | G | 0.863 | 2.347 +- 0.833 | ||
| 177 | EC | L | 0.528 | 1.688 +- 0.902 | ||
| 179 | EC | E | 0.550 | 1.748 +- 0.967 | ||
| 181 | EC | T | 0.962* | 2.545 +- 0.729 |
Asterisks indicate posterior probability P > 95% (*) and P > 99%(**). For codon position, the amino acid number is given followed by an abbreviation for the domain in which the amino acid is found: EC = extracellular, MB = membrane bound, IC = intracellular.
Figure 1Location of CCR2 Positively Selected Codon Sites in Extracellular Receptor Protein Domains. Stars indicate the location of positively selected sites in CCR2: Green stars indicate positively selected sites with a Bayes Empirical Bayes posterior probability ≤ 95%, blue stars indicate positively selected sites with posterior probability ≥95%. Diagram created with RbDe online software application [62].
Figure 2Location of CCR3 Positively Selected Codon Sites in Extracellular Receptor Protein Domains. Stars indicate the location of positively selected sites in CCR3: Green stars indicate positively sites with a Bayes Empirical Bayes posterior probability ≤ 95%, blue stars indicate positively sites with posterior probability ≥95%, red stars indicate sites under positive selection with posterior probability of ≥99%. Diagram created with RbDe online software application [62].