| Literature DB >> 21408104 |
Neil D Young1, Aaron R Jex, Cinzia Cantacessi, Ross S Hall, Bronwyn E Campbell, Terence W Spithill, Sirikachorn Tangkawattana, Prasarn Tangkawattana, Thewarach Laha, Robin B Gasser.
Abstract
Fasciola gigantica (Digenea) is an important foodborne trematode that causes liver fluke disease (fascioliasis) in mammals, including ungulates and humans, mainly in tropical climatic zones of the world. Despite its socioeconomic impact, almost nothing is known about the molecular biology of this parasite, its interplay with its hosts, and the pathogenesis of fascioliasis. Modern genomic technologies now provide unique opportunities to rapidly tackle these exciting areas. The present study reports the first transcriptome representing the adult stage of F. gigantica (of bovid origin), defined using a massively parallel sequencing-coupled bioinformatic approach. From >20 million raw sequence reads, >30,000 contiguous sequences were assembled, of which most were novel. Relative levels of transcription were determined for individual molecules, which were also characterized (at the inferred amino acid level) based on homology, gene ontology, and/or pathway mapping. Comparisons of the transcriptome of F. gigantica with those of other trematodes, including F. hepatica, revealed similarities in transcription for molecules inferred to have key roles in parasite-host interactions. Overall, the present dataset should provide a solid foundation for future fundamental genomic, proteomic, and metabolomic explorations of F. gigantica, as well as a basis for applied outcomes such as the development of novel methods of intervention against this neglected parasite.Entities:
Mesh:
Year: 2011 PMID: 21408104 PMCID: PMC3051338 DOI: 10.1371/journal.pntd.0001004
Source DB: PubMed Journal: PLoS Negl Trop Dis ISSN: 1935-2727
Characteristics of the transcriptome of the adult stage of Fasciola gigantica.
|
| |
| Sequence reads before assembly | 21.9 million, each 76 bp in length |
| Total unique sequences after assembly | 30,525 (524±524; 201–18,098) |
| Possible host or other contaminant sequences quarantined | 12 |
| GC content (%) | 46.0±4.2 |
| Illumina reads mapped to sequences | 5.7 million (26.20%) |
| Coverage (number of reads mapped to each sequence) | 187.8±469.0 (3 – 17,118) |
a Summarized as number of sequences (average sequence length ± standard deviation; minimum and maximum sequence lengths).
b Summarized as number of sequences (proportion of total sequences used for the analysis).
c Based on homology to sequences within NCBI non-redundant, Clonorchis sinensis, Fasciola hepatica, Opisthorchis viverrini, Schistosoma mansoni and S. japonicum sequence databases.
Protein sequence homology inferred between or among Fasciola gigantica, other parasitic trematodes, and selected eukaryotic organisms.
|
| Number of sequences with homology (%) | ||
| <1E−05 | <1E−15 | <1E−30 | |
|
| 27,354 (89.6) | 25,390 (83.2) | 20,798 (68.2) |
|
| 13,762 (45.1) | 10,135 (33.2) | 6,410 (21) |
|
| 12,913 (42.3) | 9,252 (30.3) | 5,543 (18.2) |
|
| 11,005 (36.1) | 6,859 (22.5) | 3,773 (12.4) |
|
| 10,752 (35.2) | 6,841 (22.4) | 3,900 (12.8) |
| Sequences with homology to at least one member of Class Trematoda | 27,745 (90.9) | 25,725 (84.3) | 21,100 (69.2) |
|
| 9,222 (30.2) | 5,969 (19.6) | 3,323 (10.9) |
|
| 9,226 (30.2) | 5,910 (19.4) | 3,262 (10.7) |
|
| 9,196 (30.1) | 5,969 (19.6) | 3,321 (10.9) |
|
| 9,101 (29.8) | 5,850 (19.2) | 3,225 (10.6) |
|
| 8,761 (28.7) | 5,596 (18.3) | 3,085 (10.1) |
|
| 8,702 (28.5) | 5,563 (18.2) | 3,046 (10) |
|
| 8,267 (27.1) | 5,241 (17.2) | 2,854 (9.4) |
|
| 7,249 (23.8) | 4,372 (14.3) | 2,353 (7.7) |
|
| 4,175 (13.7) | 2,248 (7.4) | 1,175 (3.9) |
| Sequences with homology to at least one non-trematode | 10,364 (34) | 6,623 (21.7) | 3,641 (11.9) |
a All amino acid sequences conceptually translated from sequence data were searched against protein databases using BLASTx employing permissive (E-value of 1E−05), moderate (E-value of 1E−15) and stringent (E-value of 1E−30) search strategies.
b Sequence database contains available transcriptomic data.
c Sequence database contains translated proteins from entire genomic sequence data.
Comparative genomic analysis between or among the Trematodaa.
|
| Number of sequences with homology (%) | ||
| <1E−05 | <1E−15 | <1E−30 | |
| Fasciolidae | 27,354 (89.6) | 25,390 (83.2) | 20,798 (68.2) |
| Fasciolidae and Schistosomatidae | 11,678 (38.3) | 8,328 (27.3) | 4,940 (16.2) |
| Fasciolidae and Opisthorchiidae | 8,179 (26.8) | 4,372 (14.3) | 2,120 (6.9) |
| Fasciolidae and Opisthorchiidae and Schistosomatidae | 6,583 (21.6) | 3,476 (11.4) | 1,641 (5.4) |
| Fasciolidae and Opisthorchiidae and Schistosomatidae and ENSEMBL | 5,878 (19.3) | 2,995 (9.8) | 1,370 (4.5) |
| Sequences common to all trematodes and not present in ENSEMBL | 705 (2.3) | 481 (1.6) | 253 (0.8) |
a Including representatives of the Fasciolidae (Fasciola gigantica and Fasciola hepatica), Opisthorchiidae (Clonorchis sinensis and Opisthorchis viverrini) and Schistosomatidae (Schistosoma mansoni and S. japonicum).
b All amino acid sequences conceptually translated from sequence data were searched against protein databases using BLASTx employing permissive (E-value of 1E−05), moderate (E-value of 1E−15) and stringent (E-value of 1E−30) search strategies.
c Inferred proteins homologous to those of eukaryotic model organisms (cf. Table 2).
Biological pathways inferred from the transcriptome of the adult stage of Fasciola gigantica.
| Parent KEGG pathway | Sequences | Top KEGG pathway terms |
|
| ||
| Behavior | 1 (0.0) | Circadian rhythm [ko04710] |
| Cell communication | 510 (11.4) | Focal adhesion [ko04510]; Tight junction [ko04530] |
| Cell growth and death | 274 (6.1) | Cell cycle [ko04110]; Cell cycle - yeast [ko04111] |
| Cell motility | 151 (3.4) | Regulation of actin cytoskeleton [ko04810] |
| Development | 92 (2.1) | Axon guidance [ko04360]; Dorso-ventral axis formation [ko04320] |
| Endocrine system | 481 (10.8) | Insulin signaling pathway [ko04910]; Melanogenesis [ko04916] |
| Immune system | 290 (6.5) | Antigen processing and presentation [ko04612]; Leukocyte transendothelial migration [ko04670] |
| Nervous system | 144 (3.2) | Long-term potentiation [ko04720]; Long-term depression [ko04730] |
| Sensory system | 42 (0.9) | Olfactory transduction [ko04740]; Taste transduction [ko04742] |
|
| ||
| Membrane transport | 154 (3.4) | ABC transporters [ko02010]; Other ion-coupled transporters [ko00000] |
| Signal transduction | 798 (17.9) | MAPK signaling pathway [ko04010]; Calcium signaling pathway [ko04020] |
| Signaling molecules and interaction | 97 (2.2) | ECM-receptor interaction [ko04512]; Cell adhesion molecules (CAMs) [ko04514] |
|
| ||
| Folding, sorting and degradation | 278 (6.2) | Ubiquitin mediated proteolysis [ko04120]; Proteasome [ko03050] |
| Replication and repair | 97 (2.2) | Other replication, recombination and repair proteins [ko00000]; DNA polymerase [ko03030] |
| Transcription | 83 (1.9) | RNA polymerase [ko03020]; Basal transcription factors [ko03022] |
| Translation | 207 (4.6) | Ribosome [ko03010]; Aminoacyl-tRNA biosynthesis [ko00970] |
|
| ||
| Cancers | 535 (12.0) | Colorectal cancer [ko05210]; Prostate cancer [ko05215] |
| Infectious diseases | 47 (1.1) | Epithelial cell signaling in |
| Metabolic disorders | 36 (0.8) | Type II diabetes mellitus [ko04930]; Type I diabetes mellitus [ko04940] |
| Neurodegenerative disorders | 105 (2.4) | Huntington's disease [ko05040]; Alzheimer's disease [ko05010] |
|
| ||
| Amino acid metabolism | 486 (10.9) | Lysine degradation [ko00310]; Glutamate metabolism [ko00251] |
| Biosynthesis of polyketides and nonribosomal peptides | 8 (0.2) | Biosynthesis of vancomycin group antibiotics [ko01055]; Polyketide sugar unit biosynthesis [ko00523] |
| Biosynthesis of secondary metabolites | 75 (1.7) | Limonene and pinene degradation [ko00903]; Alkaloid biosynthesis II [ko00960] |
| Carbohydrate metabolism | 630 (14.1) | Starch and sucrose metabolism [ko00500]; Glycolysis/Gluconeogenesis [ko00010] |
| Energy metabolism | 221 (4.9) | Oxidative phosphorylation [ko00190]; Carbon fixation [ko00710] |
| Glycan biosynthesis and metabolism | 182 (4.1) | N-Glycan biosynthesis [ko00510]; O-Glycan biosynthesis [ko00512] |
| Lipid metabolism | 324 (7.3) | Glycerophospholipid metabolism [ko00564]; Glycerolipid metabolism [ko00561] |
| Metabolism of cofactors and vitamins | 191 (4.3) | Folate biosynthesis [ko00790]; Nicotinate and nicotinamide metabolism [ko00760] |
| Metabolism of other amino acids | 96 (2.1) | Selenoamino acid metabolism [ko00450]; Glutathione metabolism [ko00480] |
| Nucleotide metabolism | 253 (5.7) | Purine metabolism [ko00230]; Pyrimidine metabolism [ko00240] |
| Xenobiotics biodegradation and metabolism | 138 (3.1) | Benzoate degradation via CoA ligation [ko00632]; Caprolactam degradation [ko00930] |
a Pathway mapping was based on homology to annotated proteins in the Kyoto encyclopedia of genes and genomes (KEGG) database.
b Within parentheses is the percentage of the total number of sequences predicted to be homologous to KEGG orthologues.
c The two most frequently reported KEGG pathways.
Figure 1A summary of conserved groups of protein inferred from the transcriptome of the adult stage of Fasciola gigantica.
Absolute numbers and percentages are given in parentheses. Category 1 (A) and category 2 (B) proteins were inferred based on homology to proteins in the Kyoto encyclopedia of genes and genomes (KEGG) database. Protein kinases (C) and proteases (D) were inferred based on homology to proteins in the EMBL Sarfari kinase and/or MEROPS databases. Kinase-like molecules were grouped within the: cyclin-dependent, mitogen-activated protein, glycogen synthase and CDK-like serine/threonine kinases (CMGC); Ca2+/calmodulin-dependent serine/threonine kinases (CAMK); cAMP-dependent, cGMP-dependent and protein kinase C serine/threonine kinases (AGC); serine/threonine protein kinases associated with the mitogen-activated protein kinase cascade (STE); tyrosine kinase (TK); tyrosine kinase-like (TKL); and other unclassified kinases (Other).