| Literature DB >> 35190553 |
Neil D Young1, Andreas J Stroehlein2, Tao Wang2, Pasi K Korhonen2, Margaret Mentink-Kane3, J Russell Stothard4, David Rollinson5,6, Robin B Gasser7.
Abstract
Some snails act as intermediate hosts (vectors) for parasitic flatworms (flukes) that cause neglected tropical diseases, such as schistosomiases. Schistosoma haematobium is a blood fluke that causes urogenital schistosomiasis and induces bladder cancer and increased risk of HIV infection. Understanding the molecular biology of the snail and its relationship with the parasite could guide development of an intervention approach that interrupts transmission. Here, we define the genome for a key intermediate host of S. haematobium-called Bulinus truncatus-and explore protein groups inferred to play an integral role in the snail's biology and its relationship with the schistosome parasite. Bu. truncatus shared many orthologous protein groups with Biomphalaria glabrata-the key snail vector for S. mansoni which causes hepatointestinal schistosomiasis in people. Conspicuous were expansions in signalling and membrane trafficking proteins, peptidases and their inhibitors as well as gene families linked to immune response regulation, such as a large repertoire of lectin-like molecules. This work provides a sound basis for further studies of snail-parasite interactions in the search for targets to block schistosomiasis transmission.Entities:
Mesh:
Year: 2022 PMID: 35190553 PMCID: PMC8861042 DOI: 10.1038/s41467-022-28634-9
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Features of the genome (Btru.v1) of Bulinus truncatus.
| Assembly | Btru.v1 |
|---|---|
| Number of scaffolds | 523 |
| Total size of scaffolds | 1,221,777,273 |
| Longest scaffold; shortest scaffold | 36,501,513; 10,033 |
| Number of scaffolds of >100 kb; 1 Mb; 10 Mb | 484; 271; 20 |
| N50 scaffold length; L50 scaffold count | 4,956,851; 68 |
| Scaffold %GC | 36.27 |
| Scaffold %N | 0.44 |
| Number of contigs | 11,176 |
| Longest contig | 1,917,814 |
| Number of contigs of >100 kb; 1 Mb | 3717;28 |
| N50 contig length; L50 contig count | 234,265;1488 |
| Contig %GC | 36.43 |
| Genome completeness and accuracy: | |
| Complete BUSCOsa | 914 (95.8%) |
| Complete single-copy BUSCOs | 822 (86.2%) |
| Complete and duplicated BUSCOs | 92 (9.6%) |
| Fragmented BUSCOs | 8 (0.8%) |
| Missing BUSCOs | 32 (3.4%) |
aNumber of Benchmarking Universal Single-Copy Orthologs (BUSCOs) identified (genome-mode) and percentage of the 954 genes within the metazoa_odb10 dataset.
Comparison of the features of the draft genome of Bulinus truncatus (Btru.v1) with that of Biomphalaria glabrata GCF_000457365.1_ASM45736v1[12].
| Features | ||
|---|---|---|
| Number of genes/mRNA | 26,292/ 26,292 | 25,539, 36,662 |
| Gene lengtha | 11,860 ± 10,842 | 12,166 ± 17061 |
| mRNA length | 1600 ± 1527 | 1925 ± 1795 |
| Coding domain length | 1600 ± 1527 | 1296 ± 1356 |
| Number of exons | 9 ± 12 | 7 ± 8 |
| Exon length | 175 ± 298 | 263 ± 494 |
| Intron length | 1256 ± 2314 | 1603 ± 3933 |
| Protein length | 532 ± 509 | 431 ± 452 |
| Genes with transcriptional support | 19,274 (73.3%) | |
| Completeness: | ||
| Complete BUSCOsb | 905 (95.0%) | 847 (88.8%) |
| Complete single-copy BUSCOs | 737 (77.3%) | 825 (86.5%) |
| Complete and duplicated BUSCOs | 168 (17.6%) | 22 (2.3%) |
| Fragmented BUSCOs | 26 (2.7%) | 62 (6.5%) |
| Missing BUSCOs | 23 (2.4%) | 45 (4.7) |
aLengths (bp); mean ± standard deviation.
bNumber (%) of Benchmarking Universal Single-Copy Orthologs (BUSCOs) identified (in protein mode) using the Metazoa_odb10 dataset (954 genes) for comparison.
Annotation of the protein-encoding genes (n = 26,292) predicted for Bulinus truncatus.
| Description of approach | Genes (% of total) | Unique annotations |
|---|---|---|
| UniProt TrEMBL database annotation | 23,248 (88.4%) | 16,573 |
| UniProt SwissProt database annotation | 18,139 (69.0%) | 11,450 |
| eggNOG database | 20,436 (77.7%) | 12,788 |
| GeneOntology—eggNOG | 13,009 (49.5%) | 7418 |
| Enzymes—eggnog | 5256 (20.0%) | 1129 |
| InterProScan database annotation | 20,943 (79.7%) | 10,618 |
| Pfam annotation | 17,568 (66.8%) | 4933 |
| Gene Ontology | 15,606 (59.4%) | 2755 |
| MEROPS protease database | 1299 (4.9%) | 537 |
| KEGG Orthology (KO) annotation | ||
| KEGG protein families | 15,844 (60.3%) | 6403 |
| KEGG pathways | 9494 (36.1%) | 3464 |
| Annotated by ≥1 method/databasea | 21,951 (83.5%) | |
| Orphan genes/with transcriptional support | 4341/2053 | |
| Proteins predicted to encode a signal peptide domain | 3121 (11.9%) | |
| Proteins predicted to encode one or more transmembrane domains | 7786 (29.6%) | |
| Proteins predicted to be extracellular/secreted | 1096 (4.2%) |
aWithout homology-based matches in the UniProt TrEMBL database (accessed 20 December 2020).
Fig. 1Synteny between genomes.
Synteny and contiguity of the genome (Btru.v1) of Bulinus truncatus with the draft genome of Biomphalaria glabrata and the chromosomal-level reference genomes of each Achatina immaculata and Pecten maximus, respectively. Scaffolds are arranged in circular (circos) plots with reference scaffolds for Ac. immaculata or P. maximus linked to inferred syntenic blocks of the Bu. truncatus genome using distinctly-coloured bars. Linked syntenic blocks between scaffolds of Bu. truncatus and Bi. glabrata are shown (light blue). a Synteny between the reference genomes of Bu. truncatus and Bi. glabrata established based on the positions of 595 syntenic blocks each containing three or more single-copy orthologs (i.e. 4047 of a total of 11,051). b Synteny between the reference genomes of Bu. truncatus and Ac. immaculata, established based on the positions of mapped Bu. truncatus proteins in 243 syntenic blocks each containing six or more single-copy orthologs (i.e. 2234 of a total of 8517). c Synteny between the reference genome of Bu. truncatus and P. maximus established based on the position of 291 syntenic blocks containing three or more single-copy orthologs (i.e. 1139 of a total of 4685).
Fig. 2Orthologous protein groups.
Orthologous groups of one or more protein(s) in gastropod taxa with available gene annotation. Pecten maximus is an outgroup (bivalve). a Pairwise comparison of the orthologous groups common to Bulinus truncatus and Biomphalaria glabrata. b Genes in orthologous groups common to Bu. truncatus and Bi. glabrata. c UpSet plot of the intersections of unique or shared orthologous groups inferred from protein data sets for Bu. truncatus, Bi. glabrata, Aplysia californica, Elysia chlorotica and P. maximus. d Phylogenetic tree inferred from single copy orthologs aligned among selected gastropod taxa, including all available annotated genes. As the topology of the maximum likelihood (ML) and Bayesian inference (BI) trees was the same, the ML tree is displayed and shows nodal support values for both BI (pp) and ML (bootstrap). e Genes (annotated by KEGG) predicted to be unique to Bu. truncatus (coloured), or shared between or among the five molluscan species included here.
Genome-wide synteny comparisons of the genomes of Bulinus truncatus (Btru.v1), Biomphalaria glabrata (1316-R1/ASM1452496v1), Achatina immaculata and Pecten maximus in a pairwise manner.
| Comparison | Species | Genome size | Total no. of scaffolds | Scaffolds aligned (%) | Total no. of scaffolds aligned | Syntenic blocks/length (%) | No. of single-copy orthologs in syntenic blocks | No. of single-copy orthologs |
|---|---|---|---|---|---|---|---|---|
| 1.23 Gb | 523 | 1.21 Gb (99%) | 472 | 594/557.8 Mb (46%) | 6939 | 11,051 | ||
| 852.0 Mb | 927 | 807.7 Mb (95%) | 526 | 594/396.5 Mb (49%) | 6939 | 11,051 | ||
| 1.23 Gb | 523 | 1.19 Gb (97%) | 436 | 243/411.7 Mb (35%) | 2234 | 8517 | ||
| 1.65 Gb | 31 | 1.65 Gb (100%) | 31 | 243/372.6 Mb (23%) | 2234 | 8517 | ||
| 1.23 Gb | 523 | 1.17 Gb (95%) | 398 | 291/404.9 Mb (35%) | 1139 | 4685 | ||
| 918.3 Mb | 3983 | 844.3 Mb (92%) | 19 | 291/534.2 Mb (63%) | 1139 | 4685 |
Molluscan species/strains studied herein and relevant information on their genomes.
| Species (strain) | Genome code | Accession number | Gene annotation | Assembly level | Scaffold N50 | Size | Reference |
|---|---|---|---|---|---|---|---|
| ASM976088v1 | GCA_009760885.1 | No | Chromosome | 56.4 Mb | 1.65 Gb | Unpublished | |
| AplCal3.0 | GCF_000002075.1 | Yes | Scaffold | 917.5 kb | 927.3 Mb | Unpublished | |
| ASM45736v1 | GCA_000457365.1 | Yes | Scaffold | 48.1 kb | 916.4 Mb | [ | |
| ASM1452496v1 | GCA_014524965.1 | No | Scaffold | 2.60 Mb | 852.0 Mb | [ | |
| ElyChl2.0 | GCA_003991915.1 | Yes | Scaffold | 442.0 kb | 557.5 Mb | [ | |
| xPecMax1.1 | GCF_902652985.1 | Yes | Chromosome | 44.8 Mb | 918.3 Mb | [ |
Key protein groups in Bulinus truncatus and proposed roles and pathway associations – supported by published information.
| Protein group | Number of proteins predicted for | Number of ortho-groups | Known or proposed roles | Pathway associations | References |
|---|---|---|---|---|---|
| Guadeloupe resistance complex (GRC) | 11 (15)a | 8 | Reduces susceptibility to schistosome infection | Cellular processes: lysosome, adherens junction, endocytosis; metabolism: glycosaminoglycan degradation | [ |
| Polymorphic transmembrane cluster 2 (PTC2) | 8 (11)a | 5 | Reduces susceptibility to schistosome infection | Glycan biosynthesis and metabolism; glycosphingolipid biosynthesis | [ |
| BIRs/IAPs | 117 | 49 | Drug response, apoptosis, innate immune responses | Apoptosis, ubiquitin mediated proteolysis, NF-kappa beta signalling, | [ |
| Toll-/IL-1-related proteins | 123 | 49 | Reduces susceptibility to schistosome infection, immune response | Toll-like receptor signalling, necroptosis, NF-kappa beta signalling and HIF-1 signaling pathway | [ |
| Cathepsins | 21 | 7 | Reduces susceptibility to schistosome infection, excretory/secretory product | Transport and catabolism (lysosome, phagosome, autophagy), apoptosis, antigen processing and presentation | [ |
| Chitinases | 103 | 18 | Reduces susceptibility to schistosome infection, excretory/secretory product | Amino sugar and nucleotide sugar metabolism | [ |
| Calmodulins | 42 | 28 | Stress response, drug susceptibility | Immune system: C-type lectin receptor signaling pathway | [ |
| Lectins | 101 | 33 | Immune response | Immune system; C-type lectin receptor signalling pathway | [ |
| Fibrinogen-related proteinsb | 130 | 29 | Susceptibility to schistosome infection; immune response | Signalling molecules and interaction, ECM-receptor interaction, focal adhesion | [ |
aNumber of proteins inferred for Biomphalaria glabrata.
bA distinct group of lectins with a characteristic C-terminal fibrinogen domain.
Fig. 3Gene expansions in key protein groups.
Expansion of protein ortho-groups in Bulinus truncatus predicted to relate to snail-schistosome interactions, based on published information (see Table 6). Cluster dendograms showing orthogroups for Bu. truncatus with the largest gene expansions inferred from protein data sets of Bu. truncatus, Biomphalaria glabrata, Aplysia californica, Elysia chlorotica and Pecten maximus as defined using OrthoFinder. The locations of genes linked to the Guadeloupe resistance complex (GRC) or polymorphic transmembrane cluster 2 (PTC2) are indicated (boxed).
Fig. 4Comparison of select fibrinogen-related proteins (FReDs) of Bulinus truncatus and Biomphalaria glabrata.
a Phylogenetic relationship of a set of curated FReD-like proteins encoded in the genomes of Bu. truncatus and Bi. glabrata. Each branch tip is labelled with an existing FReD classification system for Bi. glabrata[36] (shape) or species (colour). Posterior probability (pp) values are indicated, and groups are numbered (1 to 7). Proteins with predicted signal peptide (SP) domains are indicated with cyan boxes. A distinct colour represents each FReD class, with gene accession numbers boxed. #Bu. truncatus ortho-groups inferred from proteins of Bu. truncatus, Bi. glabrata, Aplysia californica, Elysia chlorotica and Pecten maximus (cf. Figure 3). b Tertiary structure models for FReDs of Bu. truncatus employed for enhanced classification (A–F). C-terminal domain (*), as well as IgSF-like and α-helix and/or β-sheet structures, are shown.