| Literature DB >> 23204305 |
Patrícia Isabel Marques, Rui Bernardino, Teresa Fernandes, Eric D Green, Belen Hurle, Victor Quesada, Susana Seixas.
Abstract
The kallikrein (KLK) gene family comprises the largest uninterrupted locus of serine proteases in the human genome and represents a notable case for studying the evolutionary fate of duplicated genes. In primates, a recent duplication event gave rise to KLK2 and KLK3, both encoding essential proteins for the cascade of seminal plasma liquefaction. We reconstructed the evolutionary history of KLK2 and KLK3 by comparative analysis of the orthologous sequences from 22 primate species, calculated d(N)/d(S) ratios, and addressed the hypothesis of coevolution with their substrates, the semenogelins (SEMG1 and SEMG2). Our findings support the placement of the KLK2-KLK3 duplication in the Catarrhini ancestor and unveil the frequent loss of KLK2 throughout primate evolution by different genomic mechanisms, including unequal crossing-over, deletions, and pseudogenization. We provide evidences for an adaptive evolution of KLK3 toward an expanded enzymatic spectrum, with an effect on the hydrolysis of semen coagulum. Furthermore, we found associations between mating system, the number of SEMG repeat units, and the number of functional KLK2 and KLK3, suggesting complex evolutionary dynamics shaped by reproductive biology.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23204305 PMCID: PMC3542562 DOI: 10.1093/gbe/evs111
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
FPhylogenetic analysis of KLK2 and KLK3 in primates. (A) Phylogenetic tree showing primate divergence times (Hedges et al. 2006) and functional status of KLK2 and KLK3. The criteria to define a nonfunctional KLK gene were the identification of at least one disrupting mutation. Gray square indicates a duplication event. The ancestral KLK3 branch is indicated (ancKLK3). (B) Alignment of exons IV–V for KLK2 and KLK3 in Catarrhini. The corresponding human genomic positions for these regions are represented at the top. Positions conserved with Gorilla gorilla (left panel) or Nomascus leucogenys (right panel) are in orange. Nonconserved positions are in blue. Sites conserved in all species were omitted.
Identified KLK2 Deleterious Mutations
| Species | Deleterious Mutations |
|---|---|
| None | |
| D120A | |
| G51R, | |
| V247fsX337 | |
| M1I, W47X, IVS3+1G>A, | |
| M1I, | |
| M1L, I25T, C184fsX189, and IVS4-2A>T |
aMutations are displayed according to the recommended nomenclature for the description of human sequence variations (den Dunnen and Antonarakis 2000).
bCatalytic triad mutation (Clark and Swanson 2005).
cPossible damaging as predicted by Polyphen2 (Adzhubei et al. 2010).
dPolymorphic site.
eSplice site mutation.
fPreviously identified mutations (Olsson et al. 2004).
Parameter Estimates and Likelihood Scores under Different Branch Models
| Model | Parameters for Branches | Likelihood ( |
|---|---|---|
| One ratio | ω | −4,390.93 |
| Two ratios | ω | −4,390.90 |
| ω | ||
| Three ratios | ω | −4,386.40 |
| ωp | ||
| ω | ||
| Four ratios | ω | −4,384.35 |
| ωp | ||
| ω | ||
| ωanc |
Note.—ω, ω for all KLK2 and KLK3 lineages; ω, ω for all KLK2 lineages; ω, ω for all KLK3 lineages; ωp, ω for KLK2 pseudogene lineages; ωanc, ω for the ancestral KLK3 lineage; df - degrees of freedom.
*Significant P < 0.05.
**Significant P < 0.01.
Model Comparisons of Variable ω Ratios among Sites
| Models Compared | −2Δ | Parameter Estimates under Selection | Positively Selected Sites |
|---|---|---|---|
| M1 vs. M2 | 6.56 | 109 | |
| M7 vs. M8 | 9.78 | ||
| M1 vs. M2 | 5.59 (df = 2) | 45, | |
| M7 vs. M8 | 6.37 | ||
| Branch-site (MA) | 13.06 | 13, 41, | |
Note.—ωbg, ω for background branches; ωfg, ω for foreground branch (ancestral KLK3 branch).
aSites with posterior probabilities >0.85 are indicated in regular type; P values > 0.90 are underlined and P values > 0.95 are in bold.
*Significant P < 0.05.
**Significant P < 0.01.
FPositive selected sites in biologically relevant regions. (A) Human KLK2 three-dimensional model showing amino acid replacements predicted to be under positive selection (Q109, H177, and G210). (B) Human KLK3 three-dimensional model showing D207S substitution predicted to be under positive selection in the ancestral branch. The catalytic triad is represented in light blue (H65, D120, and S213) and the binding sites in orange (S228, G230, and D207 in KLK2 or S207 in KLK3).
FEvolution of primate KLK2 and KLK3 related to mating factors. (A) Correlation of residual testis size (Anderson et al. 2004; Dixson and Anderson 2004; Wlasiuk and Nachman 2010) with the combined SEMG repeat units (Jensen-Seaman and Li 2003; Hurle et al. 2007). (B) Correlation between the number of SEMG1 and SEMG2 repeat units (Jensen-Seaman and Li 2003; Hurle et al. 2007) and the presence of functional KLK2 and KLK3. *P < 0.05. (C) Correlation between the mating system (Wlasiuk and Nachman 2010) and the presence of functional KLK2 and KLK3. UM, unimale; MM, multimale. *P < 0.05. (•), monoandrous; (▪), polyandrous; and (▴), ambiguous.