| Literature DB >> 28989790 |
Khushboo Gulati1, Minal Jamsandekar1, Krishna Mohan Poluri1,2.
Abstract
Chemokines are chemotactic cytokines involved in leucocyte trafficking to infected tissue. Growth-related oncogene (GRO) chemokines namely CXCL1, CXCL2 and CXCL3 are neutrophil activating chemokines sharing a conserved three-dimensional structure, but encompassing functional diversity due to gene duplication and evolutionary events. However, the evolutionary mechanisms including selection pressures involved in diversification of GRO genes have not yet been characterized. Here, we performed comprehensive evolutionary analysis of GRO genes among different mammalian species. Phylogenetic analysis illustrated a species-specific evolution pattern. Selection analysis evidenced that these genes have undergone concerted evolution. Seventeen positively selected sites were obtained, although the majority of the protein is under purifying selection. Interestingly, these positively selected sites are more concentrated on the C-terminal/glycosaminoglycan (GAG) binding and dimerization segment compared to receptor binding domain. Substitution rate analysis confirmed the C-terminal domain of GRO genes as the highest substituted segment. Further, structural analysis established that the nucleotide alterations in the GAG binding domain are the source of surface charge modulation, thus generating the differential GAG binding surfaces and multiple binding sites as per evolutionary pressure, although the helical surface is primordial for GAG binding. Indeed, such variable electrostatic surfaces are crucial to regulate chemokine gradient formation during a host's defence against pathogens and also explain the significance of chemokine promiscuity.Entities:
Keywords: chemokines; glycosaminoglycans; growth-related oncogenes; molecular evolution; neutrophil trafficking
Year: 2017 PMID: 28989790 PMCID: PMC5627130 DOI: 10.1098/rsos.171059
Source DB: PubMed Journal: R Soc Open Sci ISSN: 2054-5703 Impact factor: 2.963
Figure 1.(a) Amino acid sequences of GRO family of chemokines from human (H), horse (S) and murine (M). Residues involved in canonical α-helical surface GAG binding are marked in blue and those specific to β-sheet surface GAG binding are marked in green; charge alteration among these residues is highlighted with red. (b) Structural elements in the monomeric structure of ELR-CXC chemokines. (c) Dimeric structure of HCXCL1. (d) Backbone H-bonding of β1–β1′ residues of dimer interface. (e) Essential dimer interface contacts between C-terminal helices and with β-strands. (f) Surface structure of HCXCL1 monomer marked with receptor binding residues (red) and GAG binding residues forming α-domain (pink) and β-domain (green).
Figure 2.Phylogenetic tree of GRO family chemokines among different mammalian species. Scale bar represents distance that corresponds to 5% amino acid differences between the sequences.
Selection analysis of GRO chemokines using different codon-based maximum-likelihood methods. Amino acids identified by more than one method are underlined.
| GRO genes | no. species | 2Δ | PAML | SLACa | FELa | RELb | iFELa | MEMEa | FUBARc | total no. sites | ||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| CXCL1 | 37 | −2994.58 | −2994.24 | 0.54 | n.a. | 4, 15, | 20, 69, 72 | |||||
| CXCL2 | 22 | −1654 | −1649.52 | 8.97* | — | 3, | — | 4, 5, 15, 55, 64 | ||||
| CXCL3 | 24 | −1913.98 | −1909.55 | 8.85* | 5, | — | 3, 15, 22, 33, | 46, | 50, 55, 72 | |||
| combined dataset | 83 | −5007.58 | −5003.71 | 7.7* | n.a. | 4, 5, 15, 20, 28, 29, 30, 50, 55,58, 60, 64, 69, 70, 71, 72, 73 |
aAmino acids with significance values less than 0.1.
bAmino acids with Bayes factor greater than 95.
cAmino acids with significance values greater than or equal to 0.9.
*p < 0.05.
Figure 3.(a) Posterior mean ω at each amino acid site across the GRO genes; blue, purifying selection; red, positive selection. The horizontal line represents cutoff value for positive selection. Asterisks mark indicate the sites detected as positively selected by DATAMONKEY methods. (b) Positively selected residues are marked as spheres on both the monomeric units of HCXCL1 homodimer. (c) Significant tertiary interactions observed among the residues forming coevolved pairs are shown on HCXCL1 structure.
Calculated AMI scores for the coevolving positively selected sites. Number of asterisks represents the significance of the coevolving interaction.
| coevolving amino acid pairs | AMI score | ||
|---|---|---|---|
| T5–S30 | 9.43 (**) | ||
| T5–A55 | 7.29 (*) | ||
| S30–P20 | 6.91 (*) | ||
| S30–K29 | 10.36 (***) | ||
| S30–A55 | 11.56 (***) | ||
| S69–D70 | 9.24 (**) | ||
| S69–K71 | 10.35 (***) | ||
| S69–S72 | 10.91 (***) | ||
| K71–K72 | 10.4 (***) |
Pairwise values of dS (below diagonal) and dN (above diagonal) observed between GRO sequences (CXCL1/CXCL2/CXCL3 denoted as 1/2/3) from H, human; O, orangutan; S, horse; B, cow; M, mouse; R, rat.
| H1 | H2 | H3 | O1 | O2 | O3 | S1 | S2 | S3 | B1 | B2 | B3 | M1 | M2 | M3 | R1 | R2 | R3 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| H1 | 0.06 | 0.09 | 0.01 | 0.05 | 0.08 | 0.16 | 0.16 | 0.15 | 0.16 | 0.16 | 0.14 | 0.22 | 0.20 | 0.26 | 0.19 | 0.25 | 0.29 | |
| H2 | 0.08 | 0.08 | 0.06 | 0.01 | 0.07 | 0.11 | 0.12 | 0.12 | 0.13 | 0.13 | 0.11 | 0.21 | 0.19 | 0.23 | 0.18 | 0.22 | 0.27 | |
| H3 | 0.15 | 0.15 | 0.09 | 0.08 | 0.01 | 0.12 | 0.12 | 0.11 | 0.12 | 0.12 | 0.10 | 0.24 | 0.18 | 0.27 | 0.21 | 0.21 | 0.29 | |
| O1 | 0.04 | 0.08 | 0.14 | 0.05 | 0.08 | 0.16 | 0.16 | 0.15 | 0.14 | 0.14 | 0.14 | 0.24 | 0.20 | 0.25 | 0.21 | 0.24 | 0.28 | |
| O2 | 0.08 | 0.02 | 0.06 | 0.08 | 0.06 | 0.12 | 0.13 | 0.13 | 0.13 | 0.13 | 0.12 | 0.21 | 0.18 | 0.23 | 0.19 | 0.22 | 0.28 | |
| O3 | 0.16 | 0.10 | 0.06 | 0.11 | 0.08 | 0.11 | 0.12 | 0.10 | 0.11 | 0.11 | 0.09 | 0.23 | 0.17 | 0.26 | 0.20 | 0.21 | 0.28 | |
| S1 | 0.28 | 0.41 | 0.41 | 0.29 | 0.35 | 0.34 | 0.04 | 0.05 | 0.11 | 0.11 | 0.07 | 0.20 | 0.20 | 0.25 | 0.17 | 0.21 | 0.26 | |
| S2 | 0.26 | 0.32 | 0.32 | 0.26 | 0.23 | 0.26 | 0.06 | 0.06 | 0.10 | 0.10 | 0.08 | 0.22 | 0.18 | 0.24 | 0.17 | 0.19 | 0.24 | |
| S3 | 0.33 | 0.42 | 0.42 | 0.27 | 0.30 | 0.36 | 0.14 | 0.12 | 0.10 | 0.10 | 0.08 | 0.24 | 0.22 | 0.30 | 0.21 | 0.24 | 0.31 | |
| B1 | 0.46 | 0.39 | 0.46 | 0.43 | 0.36 | 0.46 | 0.36 | 0.28 | 0.40 | 0.00 | 0.07 | 0.26 | 0.21 | 0.28 | 0.22 | 0.23 | 0.28 | |
| B2 | 0.42 | 0.36 | 0.42 | 0.39 | 0.33 | 0.42 | 0.33 | 0.25 | 0.36 | 0.02 | 0.07 | 0.26 | 0.21 | 0.28 | 0.22 | 0.23 | 0.28 | |
| B3 | 0.51 | 0.48 | 0.55 | 0.56 | 0.48 | 0.55 | 0.38 | 0.29 | 0.47 | 0.33 | 0.30 | 0.20 | 0.19 | 0.25 | 0.17 | 0.20 | 0.27 | |
| M1 | 0.52 | 0.53 | 0.64 | 0.55 | 0.53 | 0.56 | 0.54 | 0.39 | 0.48 | 0.49 | 0.52 | 0.59 | 0.25 | 0.29 | 0.04 | 0.25 | 0.27 | |
| M2 | 0.59 | 0.61 | 0.70 | 0.64 | 0.57 | 0.61 | 0.59 | 0.51 | 0.53 | 0.60 | 0.65 | 0.72 | 0.40 | 0.13 | 0.22 | 0.06 | 0.15 | |
| M3 | 0.70 | 0.69 | 0.67 | 0.66 | 0.68 | 0.59 | 0.54 | 0.55 | 0.45 | 0.42 | 0.56 | 0.72 | 0.51 | 0.24 | 0.24 | 0.13 | 0.09 | |
| R1 | 0.51 | 0.62 | 0.56 | 0.54 | 0.57 | 0.49 | 0.54 | 0.43 | 0.52 | 0.70 | 0.65 | 0.72 | 0.20 | 0.46 | 0.58 | 0.22 | 0.29 | |
| R2 | 0.72 | 0.67 | 0.79 | 0.78 | 0.65 | 0.69 | 0.63 | 0.55 | 0.65 | 0.64 | 0.69 | 0.71 | 0.52 | 0.23 | 0.24 | 0.53 | 0.16 | |
| R3 | 0.96 | 0.88 | 0.90 | 1.04 | 0.68 | 0.79 | 0.65 | 0.66 | 0.71 | 0.76 | 0.82 | 0.84 | 0.54 | 0.36 | 0.16 | 0.68 | 0.20 |
Figure 4.Substitution rates for (a) complete GRO sequences, and individual domains, namely: (b) N-terminal, (c) central and (d) C-terminal domain. H, human; O, orangutan; S, horse; B, cow; R, rat; M, mouse; 1/2/3 represent CXCL1/CXCL2/CXCL3. The brown horizontal line represents the lower threshold of substitution rate.
Figure 5.Pairwise comparison of omega values for different domains of GRO genes in different families. Cω, Mω and Nω represent the ω values for C-terminal, central and N-terminal domains, respectively.
Figure 6.Comparative nucleotide/amino acid analysis of GRO chemokines; H, human; S, horse; M, murine; 1/2/3 represent CXCL1/CXCL2/CXCL3. Alterations in the nucleotide/amino acid sequences are highlighted in red.
Figure 7.Electrostatic surface potential maps for the dimeric GRO proteins. (a–c) H, human; S, horse; M, murine chemokines, respectively. Both α-helical and β-sheet surfaces are presented. Pink arrows represent the plausible mode of GAG binding on β-sheet surfaces.