| Literature DB >> 25133778 |
Sílvia Gomes1, Patrícia I Marques2, Rune Matthiesen3, Susana Seixas1.
Abstract
A series of duplication events led to an expansion of clade B Serine Protease Inhibitors (SERPIN), currently displaying a large repertoire of functions in vertebrates. Accordingly, the recent duplicates SERPINB3 and B4 located in human 18q21.3 SERPIN cluster control the activity of different cysteine and serine proteases, respectively. Here, we aim to assess SERPINB3 and B4 coevolution with their target proteases in order to understand the evolutionary forces shaping the accelerated divergence of these duplicates. Phylogenetic analysis of primate sequences placed the duplication event in a Hominoidae ancestor (∼30 Mya) and the emergence of SERPINB3 in Homininae (∼9 Mya). We detected evidence of strong positive selection throughout SERPINB4/B3 primate tree and target proteases, cathepsin L2 (CTSL2) and G (CTSG) and chymase (CMA1). Specifically, in the Homininae clade a perfect match was observed between the adaptive evolution of SERPINB3 and cathepsin S (CTSS) and most of sites under positive selection were located at the inhibitor/protease interface. Altogether our results seem to favour a coevolution hypothesis for SERPINB3, CTSS and CTSL2 and for SERPINB4 and CTSG and CMA1. A scenario of an accelerated evolution driven by host-pathogen interactions is also possible since SERPINB3/B4 are potent inhibitors of exogenous proteases, released by infectious agents. Finally, similar patterns of expression and the sharing of many regulatory motifs suggest neofunctionalization as the best fitted model of the functional divergence of SERPINB3 and B4 duplicates.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25133778 PMCID: PMC4136820 DOI: 10.1371/journal.pone.0104935
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Origin of SERPINB3 and SERPINB4 duplicates.
A) The organization of SERPINB3 and SERPINB4 loci in human and eight non-human primates. Relative position to telomere (Tel) and centromere (Cen) is shown. Solid boxes represent functional genes; open boxes represent pseudogenes. B) Phylogenetic tree of SERPINB3 and SERPINB4 genes with the bootstrap percentages shown at interior nodes and the alignment of RCL regions (P17-P4′). The canonical scissile bond is marked by an arrow and a standard P1 and P1′ nomenclature is used to number amino acid positions N- and C-terminal outward from the scissile bond. AncB3/B4: ancestral SERPINB3/B4 gene.
Maximum likelihood estimates of positive selection for SERPINB3/B4 phylogeny.
| Phylogeny | N | M1avs. M2a | M7vs. M8 | Proportion of sites ω>1 | Positively selected sites a |
|
| 9 | 88.97 | 89.35 | ω = 7.12, p = 0.01 | 17Q, |
Likelihood ratio tests (−2Δl) comparing a null and positive selection models (M1a vs M2a, M7 vs M8); N, number of primate species with sequences in alignment; p: proportion of sites under positive selection in M8 model; ω: estimate the dN/dS of the sites under selection in M8 model; a Amino acid sites found to be under positive selection with posterior probabilities greater than 90% (blank), 95% (underlined) or 99% (bold) in the BEB analysis. The reference sequence is human SERPINB3.
** Significance with p<0.001.
Likelihood ratio test for branch-site model for SERPINB3/B4 phylogeny.
| Phylogeny | Parameter estimates Foreground vs. Background | −2Δ | Positively selected sites |
|
| p0 = 0.612, p1 = 0.376, p2a = 0.008, p2b = 0.005, ω0 = 0.018, ω1 = 1.000, ω2 = 19.207 | 6.10 | 327G, 351G, 352F |
|
| p0 = 0.630, p1 = 0.369, p2a = 0.000, p2b = 0.000, ω0 = 0.023, ω1 = 1.000, ω2 = 1.000 | 0 | NA |
−2ΔL, likelihood ratio test to detect positive selection with 1 degree of freedom; Foreground 1: H. Sapiens B3, P. Troglodytes B3 and G. Gorilla B3 lineages; Foreground 2: H. Sapiens B4, P. Troglodytes B4 and G. Gorilla B4 lineages. Amino acid sites found to be under positive selection with posterior probabilities greater than 80% (blank) are displayed; NA, not applicable because the neutral model fits better than positive selection.
** Significance with p<0.01.
Figure 2X-ray structure of SERPINB3 and predicted structure of SERPINB4.
The A β-sheet (shutter) is in orange, B β-sheet (breach) is in red and C β-sheet (gate) is in blue. Helices are shown in green. RCL: reactive center loop. Sites under positive selection are in black.
Phylogenetic tests of positive selection for target proteases.
| Target | M0 ωa | M0vs. Free-ratio | M1avs. M2a | M7vs. M8 | Proportion of sites ω>1 | Positively selected sitesc | |
|
|
| 0.22 | 22.35 | 2.72 (2d.f.) | 2.74 (2d.f.) | - | NA |
|
| 0.35 | 17.93 | 1.74 (2d.f.) | 1.91 (2d.f.) | - | NA | |
|
| 0.43 | 9.71 | 14.95 | 15.13 | ωb = 5.66 p = 0.02 |
| |
|
| 0.19 | 18.89 | 3.64 (2d.f.) | 4.76 (2d.f.) | - | NA | |
|
|
| 0.98 | 11.06 | 45.82 | 46.11 | ωb = 7.12 p = 0.02 | 25R, |
|
| 0.60 | 21.80 | 12.58 | 12.62 | ωb = 4.47 p = 28.58 | 12L, |
−2ΔL, likelihood ratio test to detect positive selection; p: proportion of sites under positive selection for M8 model; ωa: dN/dS estimate for M0; ωb: dN/dS estimate for M8 model; cPositively selected sites identified by M8 model: amino acid sites found to be under positive selection with posterior probabilities greater than 90% (blank), 95% (underlined) or 99% (bold) in the BEB analysis. a The reference sequence is human SERPINB3. NA, not applicable because the neutural model fits better than positive selection.
** Significance with p<0.001.
* Significance with p<0.05.
Likelihood ratio test for branch-site model for target proteases using H. sapiens, P. troglodytes and G. gorilla lineage as foreground.
| Gene | Parameter estimates Foreground vs. Background | -2ΔlnL | Positively selected sites |
|
| p0 = 0.802, p1 = 0.188, p2a = 0.008, p2b = 0.002, ω0 = 0.001, ω1 = 1.000, ω2 = 48.657 | 5.38* | 255R |
|
| p0 = 0.678, p1 = 0.321, p2a = 0.000, p2b = 0.000, ω0 = 0.051, ω1 = 1.000, ω2 = 1.000 | 0 | NA |
|
| p0 = 0.679, p1 = 0.320, p2a = 0.000, p2b = 0.000, ω0 = 0.000, ω1 = 1.000, ω2 = 1.000 | 0 | NA |
|
| p0 = 0.549, p1 = 0.066, p2a = 0.343, p2b = 0.041, ω0 = 0.000, ω1 = 1.000, ω2 = 1.000 | 0 | NA |
|
| p0 = 0.421, p1 = 0.540, p2a = 0.017, p2b = 0.021, ω0 = 0.000, ω1 = 1.000, ω2 = 7.081 | 0.98 | NA |
|
| p0 = 0.331, p1 = 0.200, p2a = 0.292, p2b = 0.177, ω0 = 0.000, ω1 = 1.000, ω2 = 2.196 | 0.17 | NA |
−2ΔL, likelihood ratio test to detect positive selection; Foreground: H. sapiens, P. troglodytes and G. gorilla lineage. *Significance with p<0.05; Positively selected sites, amino acid sites found to be under positive selection with a posterior probabilities greater 90%; NA, not applicable because the neutral model fits better than positive selection.
Inhibitor protein complexes tested by docking analysis.
| Model | HADDOCK score | i-RMSD | l-RMSD |
| SERPINB3/CTSK | −88.3+/−2.3 | 0.58+/−0.39 | 1.30+/−0.87 |
| SERPINB3/CTSL1 | −62.0+/−15.7 | 1.04+/−0.91 | 3.09+/−3.18 |
| SERPINB3/CTSL2 | −75.0+/−4.2 | 0.38+/−0.25 | 0.778+/−0.59 |
| SERPINB3/CTSS | −96.1+/−4.4 | 0.43+/−0.30 | 1.06+/−0.73 |
| SERPINB4/CTSS | −74.1+/−7.0 | 16.23+/−0.35 | 35.47+/−1.05 |
| SERPINB4/CMA1 | −85.8+/−7.5 | 0.52+/−0.36 | 1.48+/−1.07 |
| SERPINB4/CTSG | −74.5+/−5.6 | 0.47+/−0.32 | 1.02+/−0.75 |
i-RMSD: interfacial root mean square deviation; l-RMSD: ligand root mean square deviation; HADDOCK score is weighted sum of van der Waals, electrostatic, desolvation and restrained violation energies together with buried surface area.
Figure 3The best docking for SERPINB3 (green)/CTSS (blue) and SERPINB4 (green)/CTSG (blue) complex models generated using HADDOCK software.
Amino acids under positive selection at the SERPIN/protease interface are in black. Amino acids at the inhibitor scissile bond and forming the proteases catalytic triad are depicted in red. Arrows point the location of β-sheet A (SA), β-sheet B (SB) and β-sheet C (SC). Binding regions are enlarged for a more detailed view (left panel).
Figure 4Expression patterns of SERPINB3 and SERPINB4 in human tissues.
GAPDH amplification was used as a control. NC: negative control.