| Literature DB >> 20113469 |
Akira Kato1, Alejandro P Rooney, Yutaka Furutani, Shigehisa Hirose.
Abstract
BACKGROUND: Trappin is a multifunctional host-defense peptide that has antiproteolytic, antiinflammatory, and antimicrobial activities. The numbers and compositions of trappin paralogs vary among mammalian species: human and sheep have a single trappin-2 gene; mouse and rat have no trappin gene; pig and cow have multiple trappin genes; and guinea pig has a trappin gene and two other derivative genes. Independent duplications of trappin genes in pig and cow were observed recently after the species were separated. To determine whether these trappin gene duplications are restricted only to certain mammalian lineages, we analyzed recently-developed genome databases for the presence of duplicate trappin genes.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20113469 PMCID: PMC2831891 DOI: 10.1186/1471-2148-10-31
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Figure 1WAP domain of the . WAP domains of SLPI (A), trappin-related genes (B), trappin-2 (C), and the other species-specific trappin paralogs (D) are shown. Conserved and semiconserved residues among trappin-2 genes are indicated by light gray. Eight conserved Cys residues constituting the WAP motif signature sequence are shaded in black. The asterisk indicates the catalytically important Met residue. The variable region, which is thought to determine the specificity of WAP motifs, is boxed.
Figure 2Phylogenetic analyses of . (A) Phylogeny of trappin, SLPI, and trappin-related genes. The WAP-coding region of each gene was used for the analyses. (B) Phylogeny of trappin genes. Noncoding region of the trappin genes that was used for the analyses. The phylogenetic trees were constructed using neighbor joining (NJ), maximum parsimony (MP), and maximum likelihood (ML) methods. The NJ trees (left) with bootstrap values for both the NJ and MP methods and ML trees (right) are shown. Trappin genes with short nucleotide sequences were not included in the analyses, because inclusion of short sequences reduces the reliability of the analyses when we removed all the sites containing missing data and alignment gaps prior to the calculation (called the complete deletion in MEGA software). Bars indicate 10% replacement per site. Species- or lineage-specific trappin multigenes are shaded in gray.
Figure 3History of the evolution of . Phylogeny of mammalian species with the time scale was generated based on the works by Kumar and Hedges [31], Hasegawa et al. [48], Nishihara et al. [24], Hallstrom and Janke [49], and Arnason et al. [50]. Ranges of the estimated dates for gene duplications are indicated by open boxes. Bayesian estimations of duplications using the nucleotide and the amino acid sequences of trappin genes are indicated by arrows and arrow heads, respectively. The dates of duplication estimated by MEGA software using single reference point are indicated by asterisks. The dates of duplications individually estimated by MEGA software for each species are indicated by double asterisks. Mya, million years ago.
Purifying selection of trappin-2 genes.
| Trappin-2 | Other | All | |
|---|---|---|---|
| dn | 0.190 ± 0.041 | 0.455 ± 0.092 | 0.369 ± 0.071 |
| ds | 0.333 ± 0.057 | 0.388 ± 0.066 | 0.377 ± 0.054 |
| Dn/ds | 0.57 | 1.17 | 0.98 |
Average non-synonymous and synonymous distances were calculated on 150 bp WAP-coding regions among trappin-2 genes of various species, the other trappin paralogs, and all trappin genes.
Accelerated evolution of trappin multigenes.
| Line | Region | Armadillo | Cow | Pig | Elephant | Hyrax | Guinea pig |
|---|---|---|---|---|---|---|---|
| A1 | 5' flanking region | 0.059 ± 0.009 | 0.090 ± 0.008 | 0.010 ± 0.003 | 0.610 ± 0.051 | 0.935 ± 0.039 | ND |
| A2 | exon 1 (signal peptide) | 0.010 ± 0.007 | 0.066 ± 0.026 | 0.433 ± 0.099 | 0.708 ± 0.158 | 0.318 ± 0.063 | |
| A3 | intron 1 | 0.060 ± 0.006 | 0.101 ± 0.009 | 0.017 ± 0.004 | 0.914 ± 0.073 | 0.807 ± 0.046 | 0.397 ± 0.026 |
| A4 | exon 2 (TGS and WAP) | 0.430 ± 0.049 | 0.576 ± 0.066 | ND*** | |||
| A5 | intron 2 | 0.045 ± 0.010 | 0.057 ± 0.011 | 0.750 ± 0.089 | 0.645 ± 0.081 | 0.403 ± 0.040 | |
| A6 | exon 3 (non coding) | 0.077 ± 0.014 | 0.026 ± 0.009 | 0.004 ± 0.004 | 0.420 ± 0.069 | 0.662 ± 0.105 | 0.417 ± 0.052 |
| A7 | 3' flanking region | 0.026 ± 0.006 | 0.038 ± 0.006 | 0.016 ± 0.003 | 0.615 ± 0.049 | 0.862 ± 0.034 | 0.363 ± 0.035 |
| B1 | entire gene except coding region | 0.054 ± 0.004 | 0.053 ± 0.003 | 0.019 ± 0.002 | 0.716 ± 0.041 | 1.540 ± 0.024 | 0.376 ± 0.022 |
| C1 | pre | 0.012 ± 0.009 | 0.064 ± 0.022 | 0.271 ± 0.074 | 0.591 ± 0.141 | 0.318 ± 0.063 | |
| C2 | pre (non synonymous) | 0.009 ± 0.009 | 0.043 ± 0.029 | 0.226 ± 0.079 | 0.489 ± 0.152 | 0.287 ± 0.060 | |
| C3 | pre (synonymous) | 0.020 ± 0.020 | 0.120 ± 0.058 | 0.384 ± 0.196 | 0.942 ± 0.530 | 0.405 ± 0.155 | |
| dn/ds | 0.5 | 0.4 | 0.9 | 0.6 | 0.5 | 0.7 | |
| D1 | TGS | 0.118 ± 0.023* | 0.195 ± 0.031** | 0.604 ± 0.101 | 0.977 ± 0.168 | 0.474 ± 0.038 | |
| D2 | TGS (non synonymous) | 0.147 ± 0.036* | 0.211 ± 0.046** | 0.724 ± 0.165 | 1.097 ± 0.322 | 0.452 ± 0.049 | |
| D3 | TGS (synonymous) | 0.057 ± 0.027 | 0.153 ± 0.056 | 0.355 ± 0.126 | 0.726 ± 0.203 | 0.533 ± 0.086 | |
| dn/ds | 2.6 | 1.4 | 1.2 | 2.0 | 1.5 | 0.8 | |
| E1 | WAP | 0.337 ± 0.061 | 0.439 ± 0.073 | 0.601 ± 0.087 | |||
| E2 | WAP (non synonymous) | 0.310 ± 0.093 | 0.366 ± 0.096 | 0.609 ± 0.133 | |||
| E3 | WAP (synonymous) | 0.091 ± 0.039 | 0.460 ± 0.152 | 0.632 ± 0.233 | 0.733 ± 0.249 | ||
| dn/ds | 1.0 | 1.8 | 1.9 | 0.7 | 0.6 | 0.8 | |
Average JC (upper) and TN (lower) distances of each region among the trappin multigenes for each species (A1--E3) and estimated times of gene duplication (F1--F2) are shown. For each species, the accelerated evolution of each region (A1--A7, C1--E3) was assessed by making comparisons against the average distances of non-coding regions (B2). The bold letters indicate the distances of regions which evolved faster than the non-coding regions of the same genes. * P < 0.05, ** P < 0.01. *** In guinea pig, there is only one trappin gene that has both the TGS and WAP domains. ND, no data; pre, signal peptide (pre-sequence).
Figure 4Evolutional relationship between . (A) Schematic illustration of conserved synteny around trappin(s). Illustrations of human, mouse, rat, and dog loci were generated based on the works by Clauss et al. [20] and Hurle et al. [26]. Black and gray arrows indicate genes and pseudo genes, respectively. KCNS, potassium voltage-gated channel, member 1 (B) The WAP domains of WFDC12 and the WFDC12-like peptide encoded by intron 1 region of trappin-18. Conserved and semiconserved residues are indicated by light gray. Eight conserved Cys residues constituting the WAP motif signature sequence are shaded in black. (C) Harr plot analyses of the hyrax trappin-18 in comparison with the hyrax trappin-2 and human WFDC12. (D) Harr plot analyses of the elephant trappin-18 in comparison with the elephant trappin-2 and human WFDC12. Exons are indicated by boxes. WAP-coding regions are represented by black boxes. Black circles indicate regions encoding WFDC12 like peptide.
Positive selection of species-specific trappin paralogs.
| pTr-2 | |||||
|---|---|---|---|---|---|
| pTr-8 | - | pTr-8 | |||
| pTr-1 | 1.5 | 1.3 | pTr-1 | ||
| pTr-3 | 3.3* | 3.8 | 3.0 | pTr-3 | |
| pTr-9 | 4.9* | 5.2* | 2.4 | 2.5 | pTr-9 |
| pTr-7 | 1.5 | 1.5 | 2.3 | 1.5 | 2.1 |
| bTr-2 | |||||
| bTr-5 | 1.1 | bTr-5 | |||
| bTr-19 | 1.2 | 3.9 | bTr-19 | ||
| bTr-6 | 1.2 | 3.8 | - | bTr-6 | |
| bTr-20 | 2.6 | 5.8 | 3.0 | 1.3 | |
| aTr-2 | |||||
| aTr-15 | 1.0 | aTr-15 | |||
| aTr-16 | 0.7 | 0.7 | aTr-16 | ||
| aTr-17 | 1.6 | 0.9 | 0.9 | aTr-17 | |
| aTr-13 | 1.4 | 1.3 | 1.4 | 1.4 | aTr-13 |
| aTr-14 | 1.9 | 1.8 | 1.7 | 1.6 | 1.6 |
Pairwise comparison matrixes of dn/ds rates are shown. p, pig; b, bovine; a, armadillo; Tr, trappin; * P < 0.05.
Nucleotide substitutions between known cDNA and corresponding genomic sequences
| Armadillo | Rabbit | Cat | Elephant | Cow | Human | |
|---|---|---|---|---|---|---|
| 1/899 | 14/543 | 1/976 | 5/1222 | ND | 3/1650 | |
| 2/713 | 2/306 | 7/878 | 1/626 | 0/835 | 0/912 | |
| 0/876 | 2/1237 | 4/1072 | 0/926 | 0/950 | 0/1335 | |
| 6/413 | ND | 2/683 | 0/542 | 1/674 | 0/691 | |
| 0/476 | 1/468 | 0/586 | ND | 1/551 | 0/591 | |
| 0/646 | 0/672 | 2/564 | 2/677 | 2/653 | 0/673 | |
| 4/477 | 0/666 | 0/740 | 1/1077 | 7/1064 | 1/1114 | |
| total | 13/4500 | 19/3892 | 16/5499 | 9/5070 | 11/4727 | 4/6966 |
| substitution rate (%) | 0.29 | 0.49 | 0.29 | 0.10 | 0.23 | 0.06 |
| identity (%) | 99.7 | 99.5 | 99.7 | 99.9 | 99.8 | 99.9 |
Numbers show the nucleotide substitutions per the length of the sequences used for the analysis. The scaffold, contig, or accession numbers of the sequences are shown in Supplementary Table S1 (additional file 1). ND, no data.
Figure 5Schematic representations of TGS, . (A) The three-exon structure of mammalian TGS and trappin genes and the four-exon structure of mammalian SLPI gene are shown. (B) Structure of the trappin-related genes of chicken and opossum (upper) and platypus SLPI gene (lower). The homologous regions are shown by dotted lines. The exons are shown by boxes. TGS- and WAP-coding exons are indicated by light and dark gray, respectively. The signal-peptide-coding regions and noncoding regions are indicated by black and white boxes, respectively. S, signal-peptide-coding region.