| Literature DB >> 32178380 |
Kento Inamaru1, Ami Takeuchi1, Marie Maeda1, Hiroki Shibata2, Yasuyuki Fukumaki2, Naoko Oda-Ueda3, Shosaku Hattori4, Motonori Ohno1, Takahito Chijiwa1.
Abstract
Small serum proteins (SSPs) are low-molecular-weight proteins in snake serum with affinities for various venom proteins. Five SSPs, PfSSP-1 through PfSSP-5, have been reported in Protobothrops flavoviridis ("habu", Pf) serum so far. Recently, we reported that the five genes encoding these PfSSPs are arranged in tandem on a single chromosome. However, the physiological functions and evolutionary origins of the five SSPs remain poorly understood. In a detailed analysis of the habu draft genome, we found a gene encoding a novel SSP, SSP-6. Structural analysis of the genes encoding SSPs and their genomic arrangement revealed the following: (1) SSP-6 forms a third SSP subgroup; (2) SSP-5 and SSP-6 were present in all snake genomes before the divergence of non-venomous and venomous snakes, while SSP-4 was acquired only by venomous snakes; (3) the composition of paralogous SSP genes in snake genomes seems to reflect snake habitat differences; and (4) the evolutionary emergence of SSP genes is probably related to the physiological functions of SSPs, with an initial snake repertoire of SSP-6 and SSP-5. SSP-4 and its derivative, SSP-3, as well as SSP-1 and SSP-2, appear to be venom-related and were acquired later.Entities:
Keywords: Protobothrops flavoviridis; comparative genomics; evolution; gene array; small serum proteins
Mesh:
Substances:
Year: 2020 PMID: 32178380 PMCID: PMC7150969 DOI: 10.3390/toxins12030177
Source DB: PubMed Journal: Toxins (Basel) ISSN: 2072-6651 Impact factor: 4.546
Figure 1Alignment of the deduced amino acid sequences encoded by the open reading frames (ORFs) of six PfSSPs and human PSP94. Position numbers refer to amino acid residues of the mature proteins. Signal peptide sequences are in lower case letters. The cysteines are shaded. Abbreviations: Pf, P. flavoviridis. References: PfSSP-6 (this study); PfSSP-1 (AB360906.1); PfSSP-2 (AB360907.1); PfSSP-5 (AB360910.1); PfSSP-3 (AB360908.1); PfSSP-4 (AB360909.1) [31]; human PSP94 (NP_002434.1) [41]. Numerals in parentheses show percent identities of PfSSPs and human PSP94 with PfSSP-6.
Figure 2Schematic representation of the 16,048 bp genome segment containing PfSSP-6 in the 5′ region upstream of the array of five PfSSPs. Bold arrows indicate the areas and transcription directions of the genes in the segment.
Figure 3Schematic configurations of nucleotide sequences in the introns of six PfSSPs. Gray bars represent exons. Half-closed, hatched, open, and closed ellipses represent fragments of L1, CR1, L2, and Gypsy LINEs, respectively. Since the fragment of L2 LINE in the third intron of PfSSP-6 differs from those of L2 LINE in the third introns of PfSSP-1, 2, and 5, the ellipse of PfSSP-6 is shown in gray. Open and closed stars represent the fragments of Mariner and hAT DNA transposons. Positions of the corresponding fragments are linked with dashed lines. Positions of repetitive sequences (TAAAA and AATAA) are indicated with carets and numbers of repetitions are indicated as subscripts.
Figure 4Schematic comparison of the arrays of SSPs of eight snake taxa. Abbreviations: Cv: C. viridis; Da: D. acutus; Oh: O. hannah; Pb: P. bivittatus; Pf: P. flavoviridis; Pm: P. mucrosquamatus; Ts: T. sirtalis; Vb: V. berus. Bold arrows indicate the areas and transcription directions of the genes. Orthologs of SSP-6, SSP-5, and SSP-4 in each snake genome are linked with gray. Inverted genome segments of O. hannah, D. acutus, P. mucrosquamatus, and P. flavoviridis are linked with dashed lines. Double slashes indicate interruptions of the nucleotide sequences.
KA/KS ratios estimated between the ORFs of C. viridis SSPs.
|
|
|
| |
|---|---|---|---|
|
| 0.749 | 0.879 | |
|
| 0.306 | ||
|
|
KA/KS ratios estimated between the ORFs of D. acutus SSPs.
|
|
|
|
|
| |
|---|---|---|---|---|---|
|
| 1.61 | 0.934 | 0.823 | 0.832 | |
|
| 0.849 | 0.705 | 1.03 | ||
|
| 1.77 | 0.878 | |||
|
| 0.830 | ||||
|
|
KA/KS ratios estimated between the ORFs of O. hannah SSPs.
|
|
|
|
|
| |
|---|---|---|---|---|---|
|
| 0.931 | 0.875 | 0.700 | 0.919 | |
|
| 0.836 | 0.719 | 0.666 | ||
|
| 0.832 | 0.685 | |||
|
| 0.509 | ||||
|
|
KA/KS ratios estimated between the ORFs of P. bivittatus SSPs.
|
|
|
| |
|---|---|---|---|
|
| 1.594 | 0.343 | |
|
| 0.296 | ||
|
|
KA/KS ratios estimated between the ORFs of P. mucrosquamatus SSPs.
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|
|
| 1.49 | 0.891 | 0.821 | 0.639 | 0.694 | |
|
| 0.825 | 0.662 | 0.476 | 0.620 | ||
|
| 1.35 | 0.479 | 0.780 | |||
|
| 0.586 | 0.889 | ||||
|
| 0.370 | |||||
|
|
KA/KS ratios estimated between the ORFs of P. flavoviridis SSPs.
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|
|
| 1.80 | 0.660 | 0.790 | 0.597 | 0.792 | |
|
| 0.808 | 0.781 | 0.504 | 0.891 | ||
|
| 1.40 | 0.599 | 0.990 | |||
|
| 0.670 | 1.07 | ||||
|
| 0.626 | |||||
|
|
KA/KS ratios estimated between the ORFs of T. sirtalis SSPs.
|
|
|
|
| |
|---|---|---|---|---|
|
| 1.06 | 0.603 | 0.651 | |
|
| 0.675 | 0.447 | ||
|
| 0.659 | |||
|
|
KA/KS ratios estimated between the ORFs of V. berus SSPs.
|
|
| |
|---|---|---|
|
| 0.604 | |
|
|
KN values estimated between the introns of C. viridis SSPs.
|
|
|
| |
|---|---|---|---|
|
| 0.319 | 0.358 | |
|
| 0.372 | ||
|
|
KN values estimated between the introns of D. acutus SSPs.
|
|
|
|
|
| |
|---|---|---|---|---|---|
|
| 0.0227 | 0.248 | 0.253 | 0.249 | |
|
| 0.247 | 0.251 | 0.246 | ||
|
| 0.0050 | 0.285 | |||
|
| 0.288 | ||||
|
|
KN values estimated between the introns of O. hannah SSPs.
|
|
|
|
|
| |
|---|---|---|---|---|---|
|
| 0.331 | 0.340 | 0.324 | 0.530 | |
|
| 0.0615 | 0.0801 | 0.277 | ||
|
| 0.0857 | 0.226 | |||
|
| 0.275 | ||||
|
|
KN values estimated between the introns of P. mucrosquamatus SSPs.
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|
|
| 0.154 | 0.338 | 0.342 | 0.339 | 0.374 | |
|
| 0.281 | 0.284 | 0.262 | 0.316 | ||
|
| 0.0397 | 0.293 | 0.346 | |||
|
| 0.296 | 0.349 | ||||
|
| 0.295 | |||||
|
|
KN values estimated between the introns of P. flavoviridis SSPs.
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|
|
| 0.0317 | 0.251 | 0.258 | 0.248 | 0.231 | |
|
| 0.254 | 0.261 | 0.253 | 0.229 | ||
|
| 0.0283 | 0.261 | 0.279 | |||
|
| 0.270 | 0.287 | ||||
|
| 0.267 | |||||
|
|
Primers used to acquire the nucleotide sequences from the genome domain encompassing PfSSP-6 to PfSSP-4. The symbols (f) or (r) after the position numbers indicate the directions of the primers. Forward or reverse denote whether the direction of elongation was the same or opposite to that of transcription. Nucleotide positions refer to nucleotide sequences reported in this study (LC518073).
| Name | Positions | Nucleotide Sequence (GC Content: %, Tm: °C) |
|---|---|---|
| SSP6-5UTR-1 | 1–20 (f) | 5′- ggC gTC CCT CCT TCT CCT Tg -3′ (65, 66) |
| SSP6ex3-2 | 1431–1453 (r) | 5′- CTC gCA TTC CAT ACA ATT ggC Tg-3′ (48, 68) |
| SSP6ex3-1 | 1365–1385 (f) | 5′- TgT ggC CAA CCA AAT gCg Tgg -3′ (57, 66) |
| SSP6-3UTR-2 | 3581–3604 (r) | 5′- ACA TgA gAg ATT TAT TCC AgT gTg-3′(38, 66) |
| SSP6-3flank-1 | 3705–3727 (r) | 5′- CAg CTA TgC ATg CCT TAT ATC AC -3′ (43, 66) |
| Ireg64-1 | 3254–3275 (f) | 5′- CTC CAT gCA AAg gAg gAT TTC C -3′ (50, 66) |
| Ireg64-6 | 10,950–10,970 (r) | 5′- TAg gCC TTg ACA CAT gAT ggC -3′ (52, 64) |
| Ireg64-5 | 8449–8469 (f) | 5′- CAT TgT TgA gCA ACC CTT ggC -3′ (52, 64) |
| Ireg64-8 | 13,709–13,731 (r) | 5′- ggA CTA TTA AgC AgT ggA ATg gC -3′ (48, 68) |
| Ireg64-9 | 13,277–13,297 (f) | 5′- ggC CCT CTT CCA Agg ACA AgC -3′ (62, 68) |
| Ireg64-10 | 16,228–16,247 (r) | 5′- ACC TCg TTC CTC CAg CCA CT -3′ (60, 64) |
Primers utilized to determine nucleotide sequences of genome fragments containing SSPs of P. mucrosquamatus and O. hannah. The symbols (f) or (r) after the position numbers indicate the directions of the primers. Forward or reverse indicate whether the direction of elongation was the same or opposite to that of transcription. Nucleotide positions refer to the nucleotide sequences reported in this study. Abbreviations: Oh: O. hannah; Pm: P. mucrosquamatus.
| Name | Scaffold | Nucleotide Sequence (GC Content: %, Tm: °C) |
|---|---|---|
| PmSSP34-5UTR (f) | 5′- CAA ggg TTg gTC TTg gTT TTT g -3′ (45, 64) | |
| PmSSP3ex2-R (r) | 5′- ggT AgA gAA AAg CCC CCA AAg -3′ (52, 64) | |
| PmSSP3-F (f) | 5′- TgC TTT ggg ggC TTT TCT C -3′ (47, 56) | |
| PmSSP34-R (r) | 5′- CTT gAC TgA GAC TgA AgT TCC -3′ (45, 62) | |
| PmSSP4ex2-R (r) | 5′- CgT TTC Agg TAA Agg AAT ACT C -3′ (41, 62) | |
| PmSSP4-F (f) | 5′- gAg TAT TCC TTT ACC TgA AAC g -3′ (41, 62) | |
| OhSSPs-5UTR (f) | 5′- ATA AAT Tgg Agg AgC RgA TTC CT -3′ (43, 66) | |
| OhSSP5-ex2-R (r) | 5′- CTC AgC TTC AAA gCC CCA gg -3′ (60, 64) | |
| OhSSP5-F (f) | 5′- gAg CAT gCT TTA CCT ggg gC -3′ (60, 64) | |
| OhSSP5-R (r) | 5′- TCC ATg TgT AgA gAT CAA ACA Cg -3′ (43, 66) | |
| OhSSP2-ex2-R (r) | 5′- CTC AgC TTC AAA gAg CCC TCT -3′ (52, 64) | |
| OhSSP2-F (f) | 5′- gAg CAT gCT ATA gAg ggC TCT -3′ (52, 64) | |
| OhSSP2-R (r) | 5′- gAT CAA ACA TCA CAg CgC TgC -3′ (52, 64) | |
| OhSSP1-ex2-R (r) | 5′- TTA Agg AAC ACT CCA AAg CAC C -3′ (52, 64) | |
| OhSSP1-F (f) | 5′- gAg ggT gCT TTg gAg TgT TCC -3′ (45, 64) | |
| OhSSP1-R (r) | 5′- gAT CAg ACA CCA CAg CTg Tgg -3′ (57, 66) | |
| OhSSP6-ex2-R (r) | 5′- TAA ACT gAg gTT TAA AgA gAT CCA -3′ (33, 64) | |
| OhSSP6-F (f) | 5′- gCA gCA TgC TTC ATg gAT CTC -3′ (52, 64) | |
| OhSSP6-R (r) | 5′- CCg TgT gAA AAg NTC AgA CAT C -3′ (50, 66) |