| Literature DB >> 30325945 |
Devendra Kumar Biswal1, Tanmoy Roychowdhury2, Priyatama Pandey2, Veena Tandon3.
Abstract
Many trematode parasites cause infection in humans and are thought to be a major public health problem. Their ecological diversity in different regions provides challenging questions on evolution of these organisms. In this report, we perform transcriptome analysis of the giant intestinal fluke, Fasciolopsis buski, using next generation sequencing technology. Short read sequences derived from polyA containing RNA of this organism were assembled into 30,677 unigenes that led to the annotation of 12,380 genes. Annotation of the assembled transcripts enabled insight into processes and pathways in the intestinal fluke, such as RNAi pathway and energy metabolism. The expressed kinome of the organism was characterized by identifying all protein kinases. A rough draft genome assembly for Fasciolopsis buski is also reported herewith with SRA accessions for crosschecking the findings in the analyzed transcriptome data. Transcriptome data also helped us to identify some of the expressed transposable elements. Though many Long Interspersed elements (LINEs) were identified, only two Short Interspersed Elements (SINEs) were visible. Overall transcriptome and draft genome analysis of F. buski helped us to characterize some of its important biological characteristics and provided enormous resources for development of a suitable diagnostic system and anti-parasitic therapeutic molecules.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30325945 PMCID: PMC6191129 DOI: 10.1371/journal.pone.0205570
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1COG classification of F. buski transcriptome.
The X-axis and Y-axis represent different COG categories and number of unigenes associated with each COG category respectively.
Fig 2Gene ontology classification of F. buski transcriptome.
GO terms assigned to Unigenes were classified into three major functional classes; Cellular component, Biological process and Molecular function.
Fig 3Venn diagram showing a number of annotated unigenes using sequence based and domain based comparisons.
A) Unigenes were compared with NCBI NR, Uniprot and Nembase4 for sequence-based annotation. Circle intersections represent number of unigenes found in more than one databases. B) Unigenes were compared against COG and InterPro databases for domain-based annotation. C) Circle intersections represent number of unigenes annotated using sequence based and domain based comparisons respectively as well as number of shared unigenes in both categories. The numbers outside circles represent number of unannotated sequences after employing all these comparisons.
Comparative sequence analysis of F. buski transcriptome with different trematodes.
| Organism | 1E-05 | 1E-15 | 1E-30 |
|---|---|---|---|
| 15,538 (50.6%) | 12,376 (40.3%) | 9,880 (32.2%) | |
| 11,635 (37.9%) | 9,250 (30.1%) | 7,289 (23.7%) | |
| 11,188 (36.4%) | 8,858 (28.8%) | 6,984 (22.7%) | |
| 11,004 (35.8%) | 7,207 (23.4%) | 4,695 (15.3%) | |
| 9,742 (31.7%) | 7,146 (23.2%) | 4,984 (16.2%) | |
| 8,634 (28.1%) | 6,433 (20.9%) | 4,685 (15.2%) |
Fig 4Venn diagram showing number of homologs found in F. buski transcriptome against transcriptome/EST datasets of major trematode families of Opisthorchiidae (Opisthorchis viverrini and Clonorchis sinensis), Fasciolidae (Fasciola hepatica and Fasciola gigantica) and Schistosomatidae (Schistosoma mansoni and Schistosoma japonicum).
The number (14,046) outside the circles represents the count of unmatched genes from a total of 30,677 predicted genes in the F. buski transcriptome.
Fig 5Bayesian phylogenetic relationship among the representative helminth species based on 12 PCGs from their mitochondrial DNA.
Phylogenetic analyses of concatenated nucleotide sequence datasets for all 12 PCGs were performed using four MCMC chains in bayesian analysis run for 1,000,000 generations, sampled every 1,000 generations. Bayesian posterior probability (BPP) values were determined after discarding first 25% of trees as burn-in. Posterior support values appear at nodes. Species representing Nematoda (Ascaridida) were taken as outgroup.
KEGG pathways identified from F. buski transcriptome using KAAS.
| Metabolic Pathways | Unique KO terms | Top KEGG pathway terms |
|---|---|---|
| Carbohydrate metabolism | 196 | Glycolysis/Gluconeogenesis[ko00010] |
| Energy metabolism | 111 | Oxidative phosphorylation [ko00190] |
| Lipid metabolism | 105 | Glycerophospholipid metabolism [ko00564] |
| Nucleotide metabolism | 111 | Purine metabolism [ko00230] |
| Amino acid metabolism | 131 | Valine, leucine, isoleucine degradation [ko00280] |
| Metabolism of other amino acids | 37 | Glutathione metabolism [ko00480] |
| Glycan biosynthesis and metabolism | 91 | N-Glycan biosynthesis [ko00510] |
| Metabolism of cofactor and vitamins | 69 | Porphyrin and chlorophyll metabolism [ko00860] |
| Metabolism of terpenoids and polyketides | 18 | Terpenoid backbone biosynthesis [ko00900] |
| Biosynthesis of other secondary metabolites | 14 | Isoquinoline alkaloid biosynthesis [ko00950] |
| Xenobiotics biodegradation and metabolism | 35 | Drug metabolism–other enzymes [ko00983] |
| Transcription | 142 | Spliceosome [ko03040] |
| Translation | 293 | RNA transport [ko3013] |
| Folding, sorting and degradation | 256 | Protein processing in endoplasmic reticulum [ko04141] |
| Replication and repair | 130 | Nucleotide excision repair [ko03420] |
| Membrane transport | 12 | ABC transporters [ko02010] |
| Signal transduction | 310 | PI3K-AKt signaling pathway |
| Signaling molecules and interaction | 21 | Neuroactive ligand-receptor interaction |
| Transport and catabolism | 154 | Endocytosis [ko04144] |
| Cell motility | 39 | Regulation of actin cytoskeleton [ko04810] |
| Cell growth and death | 184 | Cell cycle [ko04681] |
| Cell communication | 103 | Focal adhesion [ko04510] |
| Immune system | 167 | Chemokine signaling pathway [ko04062] |
| Endocrine system | 128 | Insulin signaling pathway [ko04910] |
| Circulatory system | 34 | Vascular smooth muscle contraction [ko04270] |
| Digestive system | 86 | Pancreatic secretion [ko04972] |
| Excretory system | 54 | Vasopressin-regulated water reabsorption [ko04962] |
| Nervous system | 209 | Neurotrophin signaling pathway [ko04722] |
| Sensory system | 19 | Phototransduction—fly [ko04745] |
| Development | 46 | Axon guidance [ko04360] |
| Environmental adaptation | 37 | Circadian entrainment [ko04713] |
| Cancers | 334 | Pathways in cancer [ko05200] |
| Immune diseases | 25 | Rheumatoid arthritis [ko05323] |
| Neurodegenerative diseases | 226 | Huntington’s disease [ko05016] |
| Substance dependence | 58 | Alcoholism [ko05034] |
| Cardiovascular diseases | 24 | Viral myocarditis [ko05416] |
| Endocrine and metabolic diseases | 10 | Type–II diabetes mellitus [ko04930] |
| Infectious diseases | 419 | Epstein–Barr virus infection [ko05169] |
Fig 6A pie chart displaying a number of significant matches found against nine different protein kinase classes (according to EMBL kinase database) in F. buski transcriptome.
The protein kinase groups are represented by: (i) CMGC- cyclin-dependent, mitogen activated, glycogen synthase and CDK-like serine/threonine kinases; (ii) CAMK—Calcium/Calmodulin-dependent serine/threonine kinases; (iii) TK–Tyrosine kinases; (iv) TKL—Tyrosine kinase-like; (v) AGC—cAMP-dependent, cGMP-dependent and protein kinase C serine/threonine kinases; (vi) STE–serine/threonine protein kinases associated with MAP kinase cascade; (vii) CK1—Casein kinases and close relatives; (viii) RGC—Receptor guanylate cyclase kinases: represented by a single protein, receptor guanylate cyclase kinase and (ix) other unclassified kinases.
Fig 7Protease and protease inhibitors (MEROPS terms) found in F. buski transcriptome.
Numbers in brackets indicate the count of MEROPS terms.
Proteins of RNAi pathway identified from F. buski transcriptome.
| Small RNA biosynthesis | dcr-1, drh-1/3, drsh-1, pash-1, rde-4, xpo-1/2/3 | dcr-1, drh-1/3, drsh-1, pash-1, xpo-1/2/3 |
| dsRNA uptake and spreading | sid-1/2, rsd-3/6 | rsd-3 |
| siRNA amplification | rsd-2, ego-1, rrf-1/3, smg-2/5/6 | smg-2/6 |
| Argonautes | alg-1/2/3/4, csr-1, ergo-1, nrde-3, ppw-1/2, prg-1/2, rde-1, sago-1/2 | alg-1/2 |
| RISC proteins | ain-1/2, tsn-1, vig-1 | tsn-1 |
| RNAi inhibitors | adr-1/2, eri-1/3/5/6/7, lin-15b, xrn-1/2 | eri-1/7, xrn-1/2 |
| Nuclear RNAi effectors | cid-1, ekl-1/4/5/6, gfl-1, mes-2/3/6, mut-2/7/16, rde-2, rha-1,zfp-1 | cid-1, ekl-4/6, gfl-1, mes-2/6, rha-1, zfp-1 |
Number of different repeat elements identified from genome and transcriptome of F. buski.
| Number of elements in Genome | Number of elements in Transcriptome | |
|---|---|---|
| Retroelements | 3,556 | 3,720 |
| SINE | 0 | 2 |
| Penelope | 50 | 129 |
| LINEs: | 3,155 | 3,477 |
| L2/CR1/Rex | 94 | 96 |
| R2/R4/NeSL | 6 | 11 |
| RTE/Bov-B | 2,808 | 2,975 |
| LTR elements | 401 | 241 |
| BEL/Pao | 14 | 6 |
| Gypsy/DIRS1 | 387 | 235 |
| DNA transposons | 0 | 0 |
| Simple repeats | 51,778 | 19,541 |
| Low complexity | 425 | 1,443 |