| Literature DB >> 29045404 |
Sarah K Buddenborg1, Lijing Bu1, Si-Ming Zhang1, Faye D Schilkey2, Gerald M Mkoji3, Eric S Loker1.
Abstract
BACKGROUND: Biomphalaria pfeifferi is highly compatible with the widespread human-infecting blood fluke Schistosoma mansoni and transmits more cases of this parasite to people than any other snail species. For these reasons, B. pfeifferi is the world's most important vector snail for S. mansoni, yet we know relatively little at the molecular level regarding the interactions between B. pfeifferi and S. mansoni from early-stage sporocyst transformation to the development of cercariae. METHODOLOGY/PRINCIPALEntities:
Mesh:
Year: 2017 PMID: 29045404 PMCID: PMC5685644 DOI: 10.1371/journal.pntd.0005984
Source DB: PubMed Journal: PLoS Negl Trop Dis ISSN: 1935-2727
Samples used for the study with total read numbers and the percent of reads mapping to the S. mansoni genome that were filtered prior to de novo assemblies.
| Field-collected samples | Replicate | Abbreviation | Paired-end reads mapping to | Paired-End Reads/Sample (post- quality filtering) |
|---|---|---|---|---|
| 1 | control-R1 | 0.07% | 28,903,992 | |
| 2 | control-R2 | 0.08% | 34,318,971 | |
| 3 | control-R3 | 0.04% | 27,557,936 | |
| 1 | 1d-R1 | 0.1% | 36,450,649 | |
| 2 | 1d-R2 | 1.5% | 33,634,117 | |
| 3 | 1d-R3 | 1.9% | 30,932,207 | |
| 1 | 3d-R1 | 4.1% | 30,648,913 | |
| 2 | 3d-R2 | 0.1% | 26,445,297 | |
| 3 | 3d-R3 | 13.2% | 31,159,822 | |
| 1 | shedding-R1 | 3.7% | 32,200,842 | |
| 2 | shedding-R2 | 8.2% | 33,570,583 | |
| 3 | shedding-R3 | 0.5% | 27,569,638 | |
| 1 | 35,289,769 | |||
| 2 | 34,450,509 | |||
| 3 | 25,652,418 | |||
| 1 | 30,587,208 | |||
| 2 | 35,071,339 | |||
| 3 | 28,843,961 |
*Samples used in the assembly but expression results not discussed in this paper
‡ See Methods for explanation of S. mansoni read mapping
Fig 1Overview of novel bioinformatics pipeline developed to isolate and analyze B. pfeifferi transcriptomic expression from dual RNA-Seq data.
Illumina sequencing and B. pfeifferi de novo transcriptome assembly summary metrics.
| Number of paired-end reads sequenced | 563,288,171 |
| Number of reads sequenced | 1,126,576,342 |
| Reads surviving quality filtering and trimming | 1,120,661,048 |
| Reads surviving | 1,048,936,142 |
| Trinity | 201,573 |
| Trinity | 225,929 |
| Genome-guided Trinity | 62,682 |
| Genome-guided Trinity | 71,199 |
| Additional 454 reads | 1,244,113 |
| Okay + Okay alternate coding sequences (CDS) | |
| % GC | 44.24 |
| N50 | 654 |
| Longest CDS length | 28,302 |
| Median CDS length | 447 |
| Average CDS length | 634.57 |
| Clusters > = 1Kb | 24,802 |
| % positive strand orientation | 53.2% |
| % negative strand orientation | 46.8% |
| Total predicted ORFs (minimum length = 100 aa) | 166,921 |
| Longest ORF length (aa) | 9,434 |
| Median ORF length (aa) | 157 |
| Average ORF length (aa) | 232.07 |
| Average ORF size of 1,000 longest CDS | 2014.1 |
CDS and predicted protein annotations using publicly available databases.
| Public Database | Annotation Summary |
|---|---|
| BLASTp x nr | 140,484 CDSs (72.3%) |
| BLASTn x nt | 128,028 CDSs (65.9%) |
| InterProScan | 137,778 (70.9%) |
| Gene Ontology (GO) | 50,870 CDSs (26.2%) |
| Unique Molecular Function | 3,246 |
| Unique Cellular Component | 1,618 |
| Unique Biological Process | 8,282 |
| KEGG | 145,197 CDSs (74.7%) |
| Unique KEGG orthologous groups | 3,824 |
| Unique KEGG pathways | 387 |
| Unique KEGG classes | 46 |
| Unique KEGG categories | 6 |
| Cellular Processes | 13,845 |
| Environmental Information Processing | 16,093 |
| Genetic Information Processing | 13,722 |
| Human Diseases | 32,748 |
| Metabolism | 41,022 |
| Organismal Systems | 27,767 |
Fig 2Identification of the innate immune recognition receptors TLRs in B. pfeifferi.
Partial CDS counts had a BLAST hit against a known TLR but all necessary domains could not be confidently determined by InterProScan5.
Fig 3Identification of the innate immune recognition receptors VIgLs in B. pfeifferi with initial BLAST annotation and then verification of protein domains in InterProScan5.
Number of polypeptides queried in various molluscs and matches with B. pfeifferi TransDecoder-predicted ORFs.
| Reference | # Reference polypeptides | Download location | |
|---|---|---|---|
| 36,675 | 127,626 | ||
| 27,591 | 99,884 | ||
| 188,590 | 74,494 | ||
| 31,477 | 77,341 | ||
| 45,406 | 80,505 | ||
| 38,585 | 71,395 |
*Genome is publicly available at link provided
Fig 4Identification of all de novo assembled transcripts after S. mansoni read filtering.
Fig 5Sum of non-B. pfeifferi de novo assembled CDS for each replicate. CDS were counted as present if read count >0.
Fig 6Pie charts of unique CDS found to be differentially expressed in 3v3, 3v2, and 3v1 EBSeq analyses.
Highly up- or down-regulated B. pfeifferi CDS whose response may be required for maintaining a patent S. mansoni infection.
| Annotation | Log2FC 3v3 | Log2FC 3v2 | Log2FC 3v1 | |
|---|---|---|---|---|
| evgTRINITY_DN89401_c6_g1_i1 | GD13313-like | 9.14 | 9.58 | 6.75 |
| evgTRINITY_DN19832_c0_g1_i1 | deleted in malignant brain tumors 1 protein-like | 6.65 | 7.09 | 4.48 |
| evgTRINITY_DN95353_c0_g1_i1 | collagen alpha-3(VI) chain-like | 6.41 | 6.80 | |
| evglcl|G0WVJSS02FGR88 | cAMP-dependent prot kinase catalytic subunit-like | 6.17 | 6.60 | 4.15 |
| evgTRINITY_DN84392_c3_g1_i1 | galactocerebrosidase-like | 5.41 | 5.53 | 5.14 |
| evgTRINITY_DN104940_c0_g1_i1 | cAMP-dependent prot kinase catalytic subunit-like | 5.32 | 5.79 | 3.06 |
| evgTRINITY_DN84179_c0_g1_i1 | uncharacterized transporter slc-17.2-like | 5.31 | 5.21 | 5.20 |
| evgTRINITY_DN16840_c0_g1_i1 | papilin-like | 5.05 | 5.55 | |
| evgTRINITY_GG_14665_c0_g1_i2 | ctenidin-3-like | -5.43 | -4.89 | |
| evgTRINITY_DN92655_c9_g2_i1 | deoxyribonuclease-1-like | -4.84 | -4.63 | |
| evgTRINITY_DN68720_c0_g1_i1 | testisin-like | -4.69 | -4.18 |
Highlights of general, reproductive, and immune responses of B. pfeifferi in response to S. mansoni infection.
| | phospholipase A2s | Na and Cl dependent glycine transporter 2-like | dermatopontins |
| endoglucanases | neuropeptide Y receptor type 5-like | ficolin-like proteins | |
| Proteases and protease inhibitors | DBH-like monooxygenase protein 1 | macrophage man rec 1-like isoform X1 | |
| Guanine nucleotide-binding protein-like 3 | C-type lectin -6 member A-like | ||
| Translationally-controlled tumor protein | acidic mammalian chitinase-like | ||
| chitinase-3-like protein 1-like | |||
| hemocytin | |||
| laccase-15-like | |||
| laccase-1-like | |||
| tyrosinase-1-like | |||
| | FMRF-amide receptor-like | Cu, Zn superoxide dismutase | |
| Tyrosinase-like protein tyr-1 | GTPase IMAP family members 4 and 7 | ||
| beta 1,3 glucan-bind protein-like precursor | |||
| complement C1q-like protein | |||
| fibrinogen-related protein 2 (FREP2) | |||
| | ATP synthase FO6 | macrophage expressed gene-1 | |
| spermine oxidase | |||
| glutathione-S-transferase | |||
| laccase-2-like | |||
| | glyceraldehyde-3-phosphate dehydrogenase | ovipostatin 2 | FREP12 and its precursors |
| respiratory pigment hemoglobin | tyramine/dopamine β-hydroxylase-like | toll-like receptor 8 | |
| insulin-like peptide 7 –modestly down | FMRF-amide isoform X2 –modestly down | cytidine deaminase | |
| pedal peptide 2 | PTSP-like molecule | zinc metalloproteinase /disintegrin-like | |
| Na dependent nutrient aa transporter1-like | pheromone Alb-1 | ||
| enterin | type 1 serotonin receptor | ||
| FMRF-amide isoform X2 | schistosomin | ||
| cytidine deaminase | |||
| soma ferritins | |||
| collagens | |||
| acidic mammalian chitinase-like proteins | |||
| cytochrome c oxidases | |||
| Mucins | |||
| Cytochrome P450 family members | |||
| Multidrug resistance proteins | |||
| Some heat shock proteins | |||
| | phospholipase A2s | Na and Cl dependent glycine transporter 2-like | GTPase IMAPs- complex, but mostly up |
| endoglucanases | temptin-like | beta-1,3-glucan binding proteins | |
| Proteases and protease inhibitors | kynurenine 3-monooxygenase-like | complement C1q-like proteins | |
| 17-beta hydroxysteroid dehydrogenase type 6 | probable serine carboxypeptidases (1–5) | ||
| betaine homocysteine-methyltransferase 1-like | glutathione S-transferases | ||
| translationally controlled tumor protein homolog | laccase-2-like | ||
| hemoglobin type 1 | |||
| | insulin-related peptide-3-like | Tyrosinase-like protein tyr-1 | dermatopontins |
| cytochrome b | Na- and Cl-dependent taurine transporter-like | ficolins | |
| serine proteases alpha and beta | dopamine receptor 2-like | Cu-Zn superoxide dismutases | |
| ADP, ATP carrier-like protein | C-type lectin domain family 6, A-like | ||
| heparinase-like isoform X! | chitinase-3-like protein | ||
| serpin B6-like | chitotriosidase-1-like | ||
| aplysianin-like proteins | |||
| FREP2, FREP5 | |||
| macrophage-expressed gene 1 protein-like | |||
| laccase-15-like | |||
| tyrosinase-1-like | |||
| | profilin | ovipostatin 6 | hemocytin |
| cathepsin B and L1-like | yolk ferritin precursor | hemagglutinin/amoebocyte aggreg factor-like X1 | |
| neuroglobinase-like | DBH-like monooxygenase protein 1 | G-type lysozyme | |
| chymotrypsin-like elastase family member | sialate–O-acetylesterase-like protein | ||
| histone transcription factor | peroxidase-like protein | ||
| fibrinogen-like protein A | |||
| FREP 7 | |||
| peptidoglycan-recognition proteins SC2-like | |||
| LRR-containing 15-like, toll-like receptor 13 | |||
| | glyceraldehyde-3-phosphate dehydrogenase | FMRF-amide-like isoform X2 –modestly down | caveolin-1-like |
| aryl hydrocarbon recep nucl translocator-like | tyramine/dopamine β-hydroxylase-like | disintegrin/metalloprot containing prot t17-like | |
| FMRF-amide isoform X2 –modestly down | FREP12 precursors | ||
| PTSP-like molecule | LRR contain G-prot coupled rec 5-like | ||
| pheromone Alb-1 | alpha-crystalline B chain-like | ||
| type 1 serotonin receptor | toll-like receptors 4 and 8 | ||
| schistosomin | cytidine deaminase | ||
| | tyramine/dopamine β-hydroxylase-like | ||
| ornithine decarboxylase | |||
| actins | |||
| collagens | |||
| tubulins | |||
| mucins | |||
| cytochrome P450 members- mostly up | |||
| multidrug resistance proteins | |||
| heat shock proteins | |||
| thioredoxins | |||
| annexins | |||
| putative copper-containing amine oxidases | |||
| soma ferritins | |||
| | FMRF-amide receptor-like—modestly up | dopamine beta hydroxylase-like | pcrotocadherein Fat 3 or 4-like |
| small cardioactive peptides | FMRF-amide receptor-like | ADAM family mig-17-like | |
| phospholipases A2s | ovipostatin 5 | zinc metalloproteinase nas 13- & 14-like | |
| arginase-1-like isoform X2 | DBH-like monooxygenase protein | ficolins | |
| reverse transcriptase | |||
| protease inhibitors BPTI Kunitz-domain class | |||
| ubiquitin ISG15 | |||
| Angiopoietin–1 receptor | |||
| Angiopoietin-related 2-like | |||
| mucins—complex but most are up | |||
| soma ferritins | |||
| | endonuclease G mitochondrial-like | yolk ferritin-like and snail yolk ferritin molecules | aplysianin-A-like |
| zinc carboxypeptide A 1-like | Neuropeptide Y receptor type 5-like | mammal ependymin-related prot 1- like | |
| serpinB3-like protease inhibitor | zinc carboxypeptidase A1-like | ||
| cystatin protease inhibitor | beta-1,3-glucan binding protein-like | ||
| putative amine-oxidases (copper containing) | FREP 2, 7 and 14 | ||
| | multiple epidermal growth factor-like domains | macrophage-expressed gene | |
| serine/threonine-protein kinase mos-like | C-type lectin -6 member A-like | ||
| chitinase-3-like-protein | |||
| LRR and Ig domain containing protein | |||
| toll-like receptor 3 | |||
| | insulin-like gr fact protein acid labile subunit | ovipostatin 2 | toll-like receptor 7 |
| pedal peptide 2 | dopamine beta-hydroxylase-like | galectin-6 | |
| profilin-like isoform X1 | probable serine carboxypeptidase CPVL | ||
| neuroglobin-like | |||
| calreticulin-like | |||
| tyrosinase tyr-3 | |||
| | hemoglobin | dihydropyrimidinase | |
| collagen-related | macrophage man recep1-like protein | ||
| tyrosinase-3-like | |||
| | tyrosinase-1-like | ||
| collagens, mixed but mostly down | |||
| cathepsins | |||
| tubulins | |||
| cytochromes | |||
| ankyrins | |||
| Rho GTPase-activity protein 1-like | |||
| cytochrome P450 family members | |||
| multidrug resistance proteins | |||
| glutathione-S-transferases | |||
| dermatopontins | |||
| GTPase IMAP family members | |||
| thioredoxins—but mostly up | |||
Fig 7Multidimensional scaling (MDS) plots of pairwise comparisons of control versus 1d, 3d, and shedding replicates used for differential expression analyses.
Fig 8Biomphalaria pfeifferi differential expression profiles in 1d, 3d, and shedding snails.
(A) Overall expression profiles for up- and down-regulated B. pfeifferi CDS in the 3v3 DE analysis with proportions shown for CDS with annotation known (white) and without annotation (gray) from one of the 5 databases searched (Table 3). Numbers by bars refer to numbers of up- and down-regulated features. (B) Heat map of differentially expressed B. pfeifferi CDS. (C) Up- and down-regulated B. pfeifferi CDS shared between 1d, 3d, and shedding snail groups in the 3v3 DE analysis are shown.
Fig 9Biomphalaria pfeifferi CDS identified as neuropeptides, hormones, or involved in reproduction that are differentially expressed in 1d, 3d, and shedding snails.
Note that the 3v3 comparison includes all 3 infected snails within a time point, whereas 3v2 includes the two infected snails with the most S. mansoni reads and the 3v1 includes only the infected snail with the fewest S. mansoni reads.
Fig 10Differential expression of Biomphalaria pfeifferi defense-related CDS in 1d, 3d, and shedding snails.
(A) Defense CDS in the 3v3 DE analysis. (B) Pie charts of proportions of CDS found to be DE in 3v3, 3v2, and 3v1 analyses. (C) Heat maps show expression levels from each of the three DE analyses highlighting the most relevant biological functional groups. Note that the 3v3 comparison includes all 3 infected snails within a time point, whereas 3v2 includes the two infected snails with the most S. mansoni reads and the 3v1 includes only the infected snail with the fewest S. mansoni reads.
Fig 11qPCR results validate Illumina RNA-Seq differential expression results.
(A) Quantitative real-time PCR verifies Illumina trends among biological replicates in 1d, 3d, and shedding samples. (B) Corresponding Illumina DE results for the four genes tested. Asterisks indicate genes that are significantly DE.