| Literature DB >> 34781332 |
Sarah D Davey1, Iain W Chalmers1, Narcis Fernandez-Fuentes1, Martin T Swain1, Dan Smith2, Syed M Abbas Abidi3, Mohammad K Saifullah3, Muthusamy Raman4, Gopalakrishnan Ravikumar4, Paul McVeigh5, Aaron G Maule5, Peter M Brophy1, Russell M Morphew1.
Abstract
Fasciola gigantica is one of the aetiological trematodes associated with fascioliasis, which heavily impacts food-production systems and human and animal welfare on a global scale. In the absence of a vaccine, fascioliasis control and treatment is restricted to pasture management, such as clean grazing, and a limited array of chemotherapies, to which signs of resistance are beginning to appear. Research into novel control strategies is therefore urgently required and the advent of 'omics technologies presents considerable opportunity for novel drug and vaccine target discovery. Here, interrogation of the first available F. gigantica newly excysted juvenile (NEJ) transcriptome revealed several protein families of current interest to parasitic flatworm vaccine research, including orthologues of mammalian complement regulator CD59 of the Ly6 family. Ly6 proteins have previously been identified on the tegument of Schistosoma mansoni and induced protective immunity in vaccination trials. Incorporating the recently available F. gigantica genome, the current work revealed 20 novel Ly6 family members in F. gigantica and, in parallel, significantly extended the F. hepatica complement from 3 to 18 members. Phylogenetic analysis revealed several distinct clades within the family, some of which are unique to Fasciola spp. trematodes. Analysis of available proteomic databases also revealed three of the newly discovered FhLy6s were present in extracellular vesicles, which have previously been prioritised in studying the host-parasite interface. The presentation of this new transcriptomic resource, in addition to the Ly6 family proteins here identified, represents a wealth of opportunity for future vaccine research.Entities:
Mesh:
Year: 2022 PMID: 34781332 PMCID: PMC8763315 DOI: 10.1039/d1mo00254f
Source DB: PubMed Journal: Mol Omics ISSN: 2515-4184
Key assembly statistics for the F. gigantica NEJ transcriptome, generated using ‘assembly-stats’ JavaScript and the OmicsBox functional analysis workflow
| Metric |
|
|---|---|
| Total Sequences (Contigs) | 16 551 |
| Mean sequence length (bp) | 799 |
| Longest transcript (bp) | 4239 |
| Contig N50 length (bp) | 988 |
| Contig N90 length (bp) | 442 |
| GC content (%) | 45.14 |
| AT content (%) | 54.86 |
|
| 0.00 |
Top 20 identified InterPro domains in the NEJ F. gigantica transcriptome according to number of sequences per domain. Example helminth vaccine or drug candidates for each domain type given for reference, where possible. Full domain annotations are provided in the supplementary files
| InterPro ID (IPR-). | Domain. | Number of NEJ transcripts. | Example vaccines or drug candidates |
|---|---|---|---|
| 002048 | EF-hand | 109 | Tegument-allergen like (TAL) proteins.[ |
| 000504 | RNA recognition motif | 102 | — |
| 000719 | Protein kinase | 98 | ePKs (conventional protein kinases)[ |
| 017986 | WD40-repeat-containing | 73 | — |
| 005225 | Small GTP-binding protein | 63 | — |
| 001841 | Zinc finger, RING-type | 45 | — |
| 013087 | Zinc finger, C2H2-type | 44 | — |
| 020683 | Ankyrin repeat-containing | 36 | — |
| 013026 | Tetratricopeptide repeat-containing | 33 | — |
| 001650 | Helicase, C terminal | 32 | — |
| 008139 | Saposin B type | 31 | Saposins.[ |
| 000477 | Reverse transcriptase | 30 | — |
| 000608 | Ubiquitin-conjugating enzyme E2 | 29 | — |
| 001452 | SH3 | 29 | — |
| 014001 | Helicase superfamily 1/2, ATP binding | 29 | — |
| 017452 | GPCR, rhodopsin-like, 7TM | 28 | G-coupled protein receptors[ |
| 001781 | Zinc finger, LIM-type | 28 | — |
| 001623 | DnaJ | 28 | — |
| 004088 | K homology, type 1 | 27 | — |
| 001478 | PDZ | 26 | Syntenin[ |
Fig. 1Multiple sequence alignment of domain region protein sequences (C1XXC2 motif to GPI anchor, N-terminal signal peptide and C terminal hydrophobic region removed) for the twenty novel FgCD59 members. Reference sequences from S. mansoni (SmLy6B and SmLy6G) and F. hepatica (FhCD59-1.1) are included as denoted by the black bar to the left of the alignment. Conserved motifs are indicated above the alignment, with the ten cysteines highlighted by the black boxes. GPI anchors are denoted in grey. Predicted cysteine bonds are indicated by the grey brackets above the alignment. Alignments produced in CLUSTAL Omega and annotated in Jalview.
Summary of all FgLy6s identified by BLAST similarity to known F. hepatica and S. mansoni sequences (at E < 1 × 10−5) and presence of uPAR-domain features. FgLy6s were identified across three databases (NEJ transcriptome, adult transcriptome and F. gigantica genome). Signal peptides were predicted using Signal P 5.0. Transmembrane domains were predicted using TmPred. Protein sequence data corresponding with the NEJ transcript IDs are also available in Supplementary Data
| Ly6 | GenBank accession | Sequence length (amino acids) | Signal peptide | C-terminal transmembrane domain | NEJ | Transcriptomic support in adult |
|
|
|---|---|---|---|---|---|---|---|---|
| A | TPP67930 | 127 | + | + | 01777 | + | + | +b |
| B | TPP57504 | 122 | + | + | 05428 | + | + | − |
| C | TPP61687 | 132 | + | + | 04026 | − | + | +b |
| D | TPP58533 | 121 | + | + | 11947 | + | + | +ab |
| E | — | 109 | + | − | 11274 | − | − | −* |
| F | TPP58767 | 122 | + | + | 05429 | − | + | +b |
| G | TPP66235 | 122 | + | + | 11065 | − | + | +b |
| H | TPP66236 | 120 | + | + | 14360 | − | + | +b |
| I | TPP67438 | 150 | + | + | 10885 | + | + | +ab |
| J | TPP56171 | 143 | + | + | 10181 | + | + | +ab |
| K | TPP58682 | 152 | + | + | 11200 | + | + | +ab |
| L | TPP59839 | 138 | + | + | 01142 | + | + | +ab |
| M | TPP61345 | 153 | + | + | 09765 | + | + | +ab |
| N | — | 117 | + | + | 02813 | + | + | +ab |
| O | TPP67399 | 189 | + | + | 06228 | + | + | +ab |
| P | TPP57978 | 127 | − | + | 02241 | + | + | − |
| Q | TPP67483 | 129 | + | + | − | + | + | +b |
| R | TPP67398 | 197 | + | + | − | + | + | −* |
| S | TPP57976 | 166 | − | − | − | + | + | +a |
| T | TPP67484 | 121 | + | + | − | − | + | −* |
Fig. 2De novo tertiary structural predictions for the twenty novel FgLy6s, produced using Rosetta. Beta strands, loops and helices are represented in yellow, green and red, respectively. Note the conserved central fold and presence of a defined ‘three finger’ motif, primarily characterised by three protruding loops at the top of the protein structure. C and N terminals are also annotated, where visible. The strand breaks visible in FgLy6-B, -I and -J (blue boxes) are artefacts of uniform snapshot scaling and are not present in the full model.
Fig. 3Phylogenetic analysis of all known CD59 family members of F. gigantica, F. hepatica and S. mansoni. Sequences were processed to remove the N-terminal sequence prior to the C1XXC2 motif and C-terminal post GPI anchor before alignment using Clustal Omega. SmLy6D, a double-domain CD59, was further divided into two (D1 and D2). Maximum likelihood analysis was performed in MegaX to 2000 bootstraps with JTT substitution. Life stage-specific expression profiles are indicated by boxes in addition to in vitro expression data where applicable. Clades were defined with an ancestral support cut-off of ≥ 40.