| Literature DB >> 33527140 |
Rafael Eiji Iwama1,2, Michael Tessler3,4,5, Mark E Siddall, Sebastian Kvist1,2.
Abstract
Bloodfeeding is employed by many parasitic animals and requires specific innovations for efficient feeding. Some of these innovations are molecular features that are related to the inhibition of hemostasis. For example, bloodfeeding insects, bats, and leeches release proteins with anticoagulatory activity through their salivary secretions. The antistasin-like protein family, composed of serine protease inhibitors with one or more antistasin-like domains, is tightly linked to inhibition of hemostasis in leeches. However, this protein family has been recorded also in non-bloodfeeding invertebrates, such as cnidarians, mollusks, polychaetes, and oligochaetes. The present study aims to 1) root the antistasin-like gene tree and delimit the major orthologous groups, 2) identify potential independent origins of salivary proteins secreted by leeches, and 3) identify major changes in domain and/or motif structure within each orthologous group. Five clades containing leech antistasin-like proteins are distinguishable through rigorous phylogenetic analyses based on nine new transcriptomes and a diverse set of comparative data: the trypsin + leukocyte elastase inhibitors clade, the antistasin clade, the therostasin clade, and two additional, unnamed clades. The antistasin-like gene tree supports multiple origins of leech antistasin-like proteins due to the presence of both leech and non-leech sequences in one of the unnamed clades, but a single origin of factor Xa and trypsin + leukocyte elastase inhibitors. This is further supported by three sequence motifs that are exclusive to antistasins, the trypsin + leukocyte elastase inhibitor clade, and the therostasin clade, respectively. We discuss the implications of our findings for the evolution of this diverse family of leech anticoagulants.Entities:
Keywords: anticoagulants; antistasin; bloodfeeding; leeches; protein evolution
Year: 2021 PMID: 33527140 PMCID: PMC7851590 DOI: 10.1093/gbe/evaa242
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
List of Transcriptomes Used in the Present Study and Their Respective Statistics
| Species | SRR | TSA | TSA Used in This Study | Reference | Raw Reads | Contigs | ORFs | N50 |
|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| SRR10997429 | — | — |
| 30,043,367 | 95,949 | 74,793 | 1,543 |
|
| SRR9668457 | — | — |
| 22,053,563 | 150,477 | 108,372 | 2,250 |
|
| SRR10997419 | — | — |
| 18,614,401 | 83,045 | 49,988 | 1,260 |
|
| SRR5353276 | — | — |
| 22,692,862 | 113,512 | 58,490 | 725 |
|
| SRR5353263 | — | — |
| 29,866,771 | 142,713 | 89,432 | 1,448 |
|
| SRR5353279 | — | — |
| 30,019,522 | 21,555 | 11,349 | 626 |
|
| SRR5353252 | — | — |
| 25,982,583 | 152,918 | 77,737 | 1,042 |
|
| SRR5353275 | — | — |
| 28,843,703 | 66,148 | 37,199 | 1,001 |
|
| SRR5353272 | — | — |
| 8,536,064 | 139,997 | 79,610 | 939 |
|
| SRR5353264 | — | — |
| 21,051,306 | 98,804 | 65,050 | 1,075 |
|
| SRR11434463 | — | — |
| 54,545,318 | 116,168 | 67,198 | 2,168 |
|
| SRR5353257 | — | — |
| 6,930,865 | 58,513 | 41,218 | 1,098 |
|
| SRR5353250 | — | — |
| 25,630,160 | 120,312 | 64,644 | 1,260 |
|
| SRR5353249 | — | — |
| 24,922,104 | 53,921 | 40,186 | 1,494 |
Note.—New transcriptomes are denoted in bold.
Fig. 1Best scoring ML tree resulting from the analysis of the putative antistasin-like proteins (ln L = −62,971.666): (A) best scoring ML tree with Leech antistasins clade collaped; (B) Leech antistasins clade of the best scoring ML tree. Nodes with likelihood bootstrap support values above 75 are denoted by blue circles. Branches are drawn proportionate to change. Archetypal leech bioactive proteins are denoted in bold. Red and black branches represent leech and non-leech sequences, respectively.
Fig. 2Motif distribution on the best scoring ML tree based on the MEME suit. (A) Clade 1; (B) Clade 2; (C) Antistasin clade; (D) TLI clade; (E) Therostasin clade. Each color denotes a different motif. Archetypal leech bioactive proteins are denoted in bold. Red and black branches represent leech and non-leech sequences, respectively
Fig. 3Alignment of antistasin-like proteins isolated from Oligochaeta and Hirudinea: (A) archetypal antistasin (AAA29193) and ghilanten (P16242); (B) archetypal theromin (P82354) and therostasin (Q9NBW4); (C) archetypal poecistasin, piguamerin (P81499), guamerin (AAD09442), hirustasin (P80302), and bdellastasin (P82107); and (D) contig BP524404 from Eisenia andrei, BF422342 from Lumbricus rubellus, therostasin (Q9NBW4), theromin (P82354), guamerin (AAD09442), and antistasin (AAA29193). See text for further discussion.