| Literature DB >> 21738807 |
Won Gi Yoo1, Dae-Won Kim, Jung-Won Ju, Pyo Yun Cho, Tae Im Kim, Shin-Hyeong Cho, Sang-Haeng Choi, Hong-Seog Park, Tong-Soo Kim, Sung-Jong Hong.
Abstract
Clonorchis sinensis is the causative agent of the life-threatening disease endemic to China, Korea, and Vietnam. It is estimated that about 15 million people are infected with this fluke. C. sinensis provokes inflammation, epithelial hyperplasia, and periductal fibrosis in bile ducts, and may cause cholangiocarcinoma in chronically infected individuals. Accumulation of a large amount of biological information about the adult stage of this liver fluke in recent years has advanced our understanding of the pathological interplay between this parasite and its hosts. However, no developmental gene expression profiles of C. sinensis have been published. In this study, we generated gene expression profiles of three developmental stages of C. sinensis by analyzing expressed sequence tags (ESTs). Complementary DNA libraries were constructed from the adult, metacercaria, and egg developmental stages of C. sinensis. A total of 52,745 ESTs were generated and assembled into 12,830 C. sinensis assembled EST sequences, and then these assemblies were further categorized into groups according to biological functions and developmental stages. Most of the genes that were differentially expressed in the different stages were consistent with the biological and physical features of the particular developmental stage; high energy metabolism, motility and reproduction genes were differentially expressed in adults, minimal metabolism and final host adaptation genes were differentially expressed in metacercariae, and embryonic genes were differentially expressed in eggs. The higher expression of glucose transporters, proteases, and antioxidant enzymes in the adults accounts for active uptake of nutrients and defense against host immune attacks. The types of ion channels present in C. sinensis are consistent with its parasitic nature and phylogenetic placement in the tree of life. We anticipate that the transcriptomic information on essential regulators of development, bile chemotaxis, and physico-metabolic pathways in C. sinensis that presented in this study will guide further studies to identify novel drug targets and diagnostic antigens.Entities:
Mesh:
Year: 2011 PMID: 21738807 PMCID: PMC3125140 DOI: 10.1371/journal.pntd.0001208
Source DB: PubMed Journal: PLoS Negl Trop Dis ISSN: 1935-2727
Transcriptome feature of three developmental stages from Clonorchis sinensis.
| Adults | Metacercariae | Eggs | All | |
|
| 30,144 | 20,256 | 10,368 | 60,768 |
|
| 27,070 | 15,872 | 9,803 | 52,745 |
|
| 528 | 524 | 569 | 522 |
|
| 7,779 | 5,398 | 2,660 | 12,830 |
|
| 3,921 | 2,728 | 1,523 | 7,184 |
|
| 3,858 | 2,670 | 1,137 | 5,646 |
|
| 720 | 671 | 758 | 724 |
|
| 3,993 | 2,718 | 1,630 | 6,413 |
|
| 3,488 | 2,427 | 1,504 | 5,269 |
|
| 1,674(48%) | 1,138(47%) | 868(58%) | 2,356(45%) |
|
| 1,814(52%) | 1,289(53%) | 636(42%) | 2,913(55%) |
|
| 3,786 | 2,680 | 1,030 | 6,417 |
+: : Total analyzed reads were determined using the stringent quality filtering and contained reads with a minimum of 100 bases and a phred quality≥20.
*: Total known genes were determined by homology ≥25% and length of minimum exact match ≥30 amino acids by using BLASTX homology search.
‡: : Each stage (column) was generated from its own dataset. Independently, data of all stage were assembled from the combination of adults, metacercariae and eggs.
Figure 1Predicted functions of C. sinensis transcripts based on gene ontology analysis.
Distribution of molecular functional categories (a) and biological process categories (b) according to homology to genes in Uniprot with a gene ontology classification.
SNPs of C. sinensis identified using AutoSNP software.
| No. of sequences in each contig | No. of contigs with SNPs | No. of total SNPs | Total consensus length (bp) | SNP frequency (per 100 bp) |
| 2 | 950 | 2,398 | 689,883 | 0.35 |
| 3 | 760 | 2,396 | 604,004 | 0.40 |
| 4 | 520 | 2,267 | 451,608 | 0.50 |
| 5 | 128 | 305 | 112,313 | 0.27 |
| 6 | 123 | 300 | 117,116 | 0.26 |
| 7–10 | 108 | 212 | 96,884 | 0.22 |
| 11–20 | 108 | 279 | 108,014 | 0.26 |
| 21–30 | 59 | 171 | 78,977 | 0.22 |
| 31–50 | 64 | 226 | 104,073 | 0.22 |
| >50 | 76 | 523 | 112,516 | 0.46 |
| Total | 2,896 | 9,077 | 2,475,388 | 0.37 |
Comparison of the transcriptomes of C. sinensis and selected eukaryotes.
| Unigene(%) | ||||||
| Homology |
|
|
|
|
| All organisms |
|
| 19(0.2%) | 18(0.2%) | 16(0.2%) | 18(0.2%) | 16(0.2%) | 37(0.3%) |
|
| 158(1.6%) | 159(1.6%) | 137(1.4%) | 114(1.3%) | 205(2.0%) | 346(3.0%) |
|
| 905(8.6%) | 906(8.7%) | 733(7.6%) | 564(6.2%) | 1,125(11.2%) | 1,674(14.7%) |
|
| 1,696(16.5%) | 1,687(16.4%) | 1,325(14.4%) | 1,078(12.3%) | 1,936(20.4%) | 2,850(26.0%) |
|
| 2,305(22.9%) | 2,302(23.0%) | 1,821(20.5%) | 1,468(17.8%) | 2,424(26.6%) | 3,654(33.9%) |
|
| 3,110(31.7%) | 3,092(31.8%) | 2,396(28.2%) | 1,946(24.9%) | 2,910(33.3%) | 4,643(43.6%) |
|
| 3,477(35.4%) | 3,412(35.2%) | 2,669(31.4%) | 2,193(28.2%) | 3,028(35.0%) | 5,269(49.1%) |
+: EST data were searched against the entire protein database by BLASTX.
*More than 25% homology with at least 30 amino acid residues deduced from CsAEs.
%the percent ratio of matched clusters of total 12,830 CsAEs to the protein database of model eukaryotes.
Ultraconserved contigs of C. sinensis.
| Accession No. | Descriptions |
|
|
|
|
| All |
| CAO79607.1 | Beta-tubulin | ○ | ○ | ○ | ○ | ○ | ○ |
| AAQ16109.1 | Elongation factor1-alpha | ○ | ○ | ○ | ○ | ○ | ○ |
| ABS52704.1 | Heat shock protein 70 | ○ | ○ | ○ | ○ | - | ○ |
| NP_001086877.1 | Translation elongation factor 2 | ○ | ○ | ○ | ○ | - | ○ |
| ABS81352.1 | Phospho glucose isomerase | ○ | ○ | ○ | ○ | - | ○ |
| AAH83344.1 | Tubulin, alpha1A | ○ | ○ | ○ | ○ | ○ | ○ |
| CAQ13492.1 | Propionyl CoenzymeA carboxylase, beta polypeptide | ○ | ○ | - | ○ | - | ○ |
| AAW27581.1 | SJCHGC09453 protein | ○ | ○ | ○ | ○ | ○ | ○ |
| AAI61792.1 | Unknown | ○ | ○ | ○ | ○ | - | ○ |
| AAS93901.1 | Glycogen phosphorylase | ○ | ○ | ○ | ○ | ○ | ○ |
| AAM69406.1 | Heat shock protein HSP60 | ○ | ○ | ○ | ○ | ○ | ○ |
| BAB84579.1 | Actin2 | ○ | ○ | ○ | ○ | ○ | ○ |
| AAW27659.1 | SJCHGC00820 protein | ○ | ○ | ○ | ○ | ○ | ○ |
| XP_001118968.1 | Glucan(1,4-alpha-), branching enzyme1 | ○ | ○ | ○ | ○ | - | ○ |
| XP_001107041.1 | Oxoglutarate dehydrogenase-like isoform2 | ○ | ○ | ○ | ○ | - | ○ |
| XP_001089681.1 | ATP synthase, H+ transporting, mitochondrial F1 complex, alpha subunit | ○ | ○ | ○ | ○ | - | ○ |
| XP_001627515.1 | Predicted protein | - | ○ | ○ | - | - | ○ |
| AAW27320.1 | SJCHGC06322 protein | - | - | - | - | ○ | ○ |
| AAW27782.1 | SJCHGC00653 protein | - | - | - | - | ○ | ○ |
| AAW26140.1 | SJCHGC02536 protein | - | - | - | - | ○ | ○ |
| AAX27366.2 | SJCHGC05847 protein | - | - | - | - | ○ | ○ |
| AAW27345.1 | SJCHGC09272 protein | - | - | - | - | ○ | ○ |
| AAW26056.1 | SJCHGC05577 protein | - | - | - | - | ○ | - |
| AAW27129.1 | SJCHGC06305 protein | - | - | - | - | ○ | - |
| XP_001177707.1 | Gag-polpoly protein | - | - | - | - | - | ○ |
| XP_001628202.1 | Predicted protein | - | - | - | - | - | ○ |
| XP_001952362.1 | Zinc finger protein | - | - | - | - | - | ○ |
| ABI26619.1 | Enolase | - | - | - | - | - | ○ |
*Cut-off value <1e−200.
Figure 2Global relative similarity between C. sinensis and other species analyzed at the whole transcriptome scale
. Each C. sinensis contigs and singlets were searched against the whole transcriptome using TBLASTX score (a cut-off of ≥50). Similarity comparison of parasitic organisms with a free-living flatworm (A) or with free-living nematode (B). Square tiles indicate genes, with the squares colored by their highest TBLASTX score to each of the databases: red ≥300; yellow ≥200; green ≥150, blue ≥100 and purple <100.
Developmental expression of ion-channels and transporters in C. sinensis.
| No. of reads | ||||
| Category | Description | Adult | Metacercaria | Egg |
|
| Potassium channel protein | 10 | 4 | 2 |
|
| High voltage-gated Ca+2-channel, batasubunit,CavB | 2 | 2 | 1 |
| High voltage-activated Ca+2-channel, CavA | 0 | 1 | 0 | |
|
| Amiloide-sensitive sodium channel-related protein | 1 | 0 | 0 |
|
| Chloride channel protein 7 | 7 | 4 | 1 |
|
| Solute carrier family 17 (sodium phosphate) | 2 | 0 | 0 |
| Solute carrier family 5 (sodium/glucose cotransporter) | 77 | 0 | 0 | |
| Sodium/sialic acid cotransporter, putative | 2 | 0 | 0 | |
| Sodium/bile acid cotransporter | 3 | 14 | 0 | |
| Potassium-dependent sodium-calcium exchanger | 1 | 0 | 0 | |
| Sodium/hydrogen exchanger 7, 9 | 1 | 0 | 0 | |
| Sodium/myo-inositol cotransporter | 1 | 0 | 0 | |
| Sodium/proton exchanger 3 | 1 | 0 | 0 | |
|
| Na+/K+-transporting ATPase beta | 4 | 0 | 0 |
| Na+/K+-transporting ATPase beta-2 chain | 1 | 0 | 0 | |
|
| Glucose transporter | 47 | 0 | 20 |
| Sodium/glucose co-transporter | 77 | 0 | 0 | |
|
| Amino acid transporter | 22 | 2 | 9 |
|
| Zinc transporter | 15 | 5 | 4 |
|
| Glycerol-3-phosphate transporter | 2 | 2 | 3 |
| Phosphate transporter | 2 | 0 | 1 | |
|
| Fatty acid transporter, member 1 | 16 | 18 | 2 |
Figure 3Developmental expression of proteases and protease inhibitors, antioxidant enzymes, and heat shock proteins in C. sinensis.
Relative reads refer to the number of calculated reads in proportion to the read size for each developmental stage (adult: metacercaria : egg = 2.76∶1.62∶1). (A) Proteases, (B) protease inhibitors, (C) antioxidant enzymes, (D) stress response proteins. Dyp, dye-decolorizing peroxidase; GST, glutathione-s-transferase; SOD, superoxide dismutase; GPX, glutathione peroxidase; GRX, glutaredoxin; PRX, Peroxiredoxin; TRXR, thioredoxin reductase; TRX, thioredoxin; HSP, heat hock protein.
Expression of genes associated with apoptosis, cell proliferation, and cancer development in C. sinensis.
| Category | Description | No. of reads | ||
| Adult | Metacercaria | Egg | ||
|
| Inhibitor of apoptosis protein | 9 | 8 | 0 |
| Apoptosis-linked gene 2 protein | 1 | 3 | 9 | |
| Cell-cycle and apoptosis regulatory protein 1 | 14 | 8 | 0 | |
|
| Granulin | 7 | 4 | 2 |
| Granulin precursor (Proepithelin) (PEPI) | 36 | 1 | 17 | |
| EGF | 1 | 2 | 0 | |
| EGF/Laminin | 9 | 2 | 13 | |
| Multiple EGF-like-domains 6 | 0 | 2 | 0 | |
| TGF-beta receptor interacting protein 1 | 2 | 3 | 0 | |
|
| c-Jun N-terminal kinase | 23 | 11 | 0 |
| Catenin (cadherin-associated protein) | 5 | 15 | 8 | |
| Axin 1 | 0 | 3 | 0 | |
| Cell division control protein 42 | 2 | 2 | 3 | |
| Cyclin-dependent kinase | 11 | 4 | 3 | |
| Death-associated protein kinase | 2 | 0 | 0 | |
| DNA mismatch repair protein | 3 | 2 | 0 | |
| DNA repair protein | 10 | 0 | 0 | |
| Growth factor receptor-binding protein | 1 | 1 | 4 | |
| Histone deacetylase | 12 | 9 | 9 | |
| Integrin beta | 12 | 3 | 0 | |
| Laminin | 8 | 4 | 13 | |
| MFS transporter, SP family, solute carrier family | 77 | 0 | 0 | |
| Mitogen-activated protein kinase kinase | 6 | 0 | 2 | |
| RING-box protein 1 | 9 | 0 | 0 | |
| Serine/threonine-protein kinase | 32 | 29 | 23 | |
| Transcription elongation factor | 8 | 1 | 2 | |
| Transcription factor | 27 | 20 | 15 | |
| Tropomyosin | 30 | 38 | 12 | |
*Refer to Reference No. 25.
Putative serodiagnostic antigens.
| EST ID | Description |
| CL1051Contig1 | Male sterility protein |
| CSA22042 | Cathepsin L-like cysteine proteinase A |
| CL1502Contig1 | Protein disulfide isomerase-related protein P5 precursor |
| CL2119Contig1 | LAG1 (longevity assurance homolog) |
| CL2671Contig1 | Flagelliform silk protein |
| CL11Contig2 | Cryptopsoridial mucin, large thr stretch, signal peptide sequence |
| CSM15703 | TGF-beta receptor interacting protein 1 |
| CSA19952 | DJ-1_PfpI |
| CL3175Contig1 | Eukaryotic translation initiation factor 4A2 isoform 1 |
| CL1573Contig1 | Predicted protein |
| CL3180Contig1 | Peptidyl-prolyl cis-trans isomerase B precursor (PPIase B) (Cyclophilin B) |
| CL6354Contig1 | UDP-GlcNAc:betaGal beta-1,3-N-acetylglucosaminyltransferase 7 |
| CL6614Contig1 | Oxysterol binding protein |
| CL5658Contig1 | Vesicular mannose-binding lectin |
| CL222Contig1 | Glycoprotein X precursor |
| CSA21481 | Stromal interaction molecule 2 |
| CSA01410 | Calreticulin precursor (SM4 protein) |
| CL31Contig2 | Melibiase family protein |
| CSA17420 | Group 14 allergen protein |