| Literature DB >> 27619014 |
Ben-Wen Li1, Samantha N McNulty2, Bruce A Rosa2, Rahul Tyagi2, Qing Ren Zeng3, Kong-Zhen Gu3, Gary J Weil1, Makedonka Mitreva4,5.
Abstract
BACKGROUND: Paragonimiasis is an important and widespread neglected tropical disease. Fifteen Paragonimus species are human pathogens, but two of these, Paragonimus westermani and P. skrjabini, are responsible for the bulk of human disease. Despite their medical and economic significance, there is limited information on the gene content and expression of Paragonimus lung flukes.Entities:
Keywords: Adult stage; Comparative transcriptomics; Lung fluke; Paragonimus; Paragonimus skrjabini; Paragonimus westermani; RNA-Seq
Mesh:
Substances:
Year: 2016 PMID: 27619014 PMCID: PMC5020434 DOI: 10.1186/s13071-016-1785-x
Source DB: PubMed Journal: Parasit Vectors ISSN: 1756-3305 Impact factor: 3.876
Sequencing, assembly and annotation of the transcriptomes of adult P. westermani and P. skrjabini
|
|
| |
|---|---|---|
| Sequence data | ||
| Raw read pairs | 46,468,226 | 49,816,749 |
| Clean read pairs | 34,096,586 | 38,071,235 |
| Raw transcript assembly | ||
| Unigenes | 54,488 | 90,091 |
| Transcripts | 71,317 | 126,745 |
| Filtered transcript assembly | ||
| Unigenes | 21,586 | 25,825 |
| Transcripts | 27,842 | 35,312 |
| Mean unigene length (bp) | 813.0 ± 598.5 | 772.4 ± 570.0 |
| Mean transcript length (bp) | 853.9 ± 600.1 | 834.8 ± 595.9 |
| Fragmentation rate | 24.3 % | 26.7 % |
| Predicted proteins | ||
| Unique protein translations | 26,431 | 32,706 |
| Unigenes | 21,585 | 25,822 |
| Transcripts | 27,838 | 35,305 |
| Mean protein length (aa) | 271.0 ± 199.5 | 257.5 ± 190.0 |
| Annotation (functional terms / unigenes) | ||
| Unique InterPro domains | 4,190 / 8,853 | 3,263 / 7,152 |
| Unique GO terms | 1,197 / 6,964 | 1,024 / 5,460 |
| Unique KEGG orthologous groups | 3,618 / 13,257 | 3,605 / 12,168 |
| Unique KEGG pathways | 313 / 8,081 | 313 / 7,426 |
| Unique KEGG pathway modules | 218 / 3,189 | 220 / 2,897 |
Fig. 1Unigene length distribution for Paragonimus westermani (a) and Paragonimus skrjabini (b)
Fig. 2Paragonimus-specific proteins from Paragonimus westermani and Paragonimus skrjabini. Predicted proteins from 6,513 P. westermani unigenes and 10,171 P. skrjabini unigenes found no significant match to non-Paragonimus sequences in NR. Of the unigenes with no BLAST match in NR, 1,217 from each assembly were matched to a non-hit unigene in the other assembly
BLASTP comparisons of P. westermani and P. skrjabini proteins with selected trematode species. The number of transcripts/unigenes with BLASTP match (e-value < 1e-05) to subject proteins is indicated. Average percent identity was calculated based on the top hit to the longest isoform of each unigene
|
|
| |
|---|---|---|
|
| 18,144 / 14,164 (62.3 %) | 19,271 / 14,441 (64.0 %) |
|
| 18,489 / 14,423 (61.7 %) | 20,186 / 14,974 (63.6 %) |
|
| 17,246 / 13,403 (59.6 %) | 17,848 / 13,308 (62.2 %) |
|
| 19,529 / 15,075 (85.5 %) | 22,493 / 16,296 (86.4 %) |
|
| – | 22,496 / 16,290 (85.8 %) |
|
| 18,890 / 14,693 (84.3 %) | – |
KEGG categorization of assembled unigenes
|
|
| |
|---|---|---|
| Cellular processes | 1,709 | 1,645 |
| Cell communication | 504 | 512 |
| Cell growth and death | 561 | 495 |
| Cell motility | 239 | 220 |
| Transport and catabolism | 795 | 787 |
| Environmental information processing | 1,455 | 1,372 |
| Membrane transport | 72 | 74 |
| Signal transduction | 1,306 | 1,220 |
| Signaling molecules and interaction | 159 | 157 |
| Genetic information processing | 3,303 | 3,011 |
| Folding, sorting and degradation | 1,142 | 1,087 |
| Replication and repair | 460 | 361 |
| Transcription | 775 | 706 |
| Translation | 1,208 | 1,138 |
| Metabolism | 2,407 | 2,223 |
| Amino acid metabolism | 425 | 395 |
| Biosynthesis of other secondary metabolites | 59 | 54 |
| Carbohydrate metabolism | 539 | 514 |
| Energy metabolism | 428 | 402 |
| Glycan biosynthesis and metabolism | 316 | 304 |
| Lipid metabolism | 386 | 353 |
| Metabolism of cofactors and vitamins | 246 | 216 |
| Metabolism of other amino acids | 185 | 169 |
| Metabolism of terpenoids and polyketides | 102 | 90 |
| Nucleotide metabolism | 391 | 375 |
| Xenobiotics biodegradation and metabolism | 87 | 104 |
Fig. 3Distribution of InterPro domains and KEGG orthologous groups among selected food-borne trematodes. InterPro protein domains and KEGG orthologous groups were assigned to proteins from the complete genomes of Clonorchis sinensis and Fasciola hepatica and to proteins predicted from the Paragonimus westermani and Paragonimus skrjabini transcriptome assemblies, and intersections were determined. Abbreviations: Pw, P. westermani; Cs, C. sinensis; Fh, F. hepatica; Ps, P. skrjabini
Fig. 4Estimated expression of Paragonimus westermani and P. skrjabini assembly unigenes. Corresponding P. westermani and P. skrjabini unigenes identified by bi-directional blast search. The expression levels of each unigene (FPKM) were estimated, and the expression of matched unigenes were compared. a The average fold change plus or minus 1.96 times the standard deviation (corresponding to the top 5th percentile of differential regulation) was used as a cutoff to select unigenes differentially expressed between the two species. b Expression levels of corresponding unigenes were plotted, and differentially expressed unigenes are colored
Top 30 differentially expressed unigenes of P. westermani and P. skrjabini
|
|
| Top non- |
|
| Fold change ( |
|---|---|---|---|---|---|
| comp24563_c2 | comp79108_c0 |
| 39,412.97 | 12.39 | 11.64 |
| comp20235_c3 | comp97009_c0 |
| 4,150.67 | 1.72 | 11.24 |
| comp18122_c0 | comp67960_c0 |
| 4,831.75 | 2.47 | 10.93 |
| comp26350_c1 | comp73392_c0 |
| 3,368.34 | 1.79 | 10.88 |
| comp22100_c0 | comp49569_c0 |
| 3,992.12 | 2.35 | 10.73 |
| comp23043_c0 | comp67491_c0 |
| 2,567.33 | 1.55 | 10.69 |
| comp22520_c0 | comp77222_c0 |
| 2,189.89 | 1.59 | 10.43 |
| comp26308_c0 | comp83382_c0 |
| 16,728.14 | 13.71 | 10.25 |
| comp23905_c0 | comp85010_c1 |
| 29,015.14 | 25.29 | 10.16 |
| comp24308_c0 | comp80461_c0 |
| 3,638.33 | 3.32 | 10.10 |
| comp19972_c0 | comp81354_c0 |
| 2,334.18 | 2.22 | 10.04 |
| comp25450_c1 | comp80216_c0 |
| 4,984.93 | 5.99 | 9.70 |
| comp26673_c2 | comp57972_c0 |
| 2,399.15 | 3.70 | 9.34 |
| comp19193_c2 | comp66866_c0 |
| 2,431.20 | 5.89 | 8.69 |
| comp27091_c0 | comp77961_c0 |
| 286.23 | 2.07 | 7.11 |
| comp16826_c0 | comp85805_c0 |
| 2.13 | 130.95 | -5.94 |
| comp19146_c1 | comp87478_c0 |
| 6.19 | 407.90 | -6.04 |
| comp13982_c0 | comp90793_c1 |
| 2.03 | 139.37 | -6.10 |
| comp29521_c0 | comp86043_c1 | – | 3.25 | 255.19 | -6.29 |
| comp17946_c0 | comp90932_c0 | – | 1.58 | 150.89 | -6.58 |
| comp19442_c0 | comp86832_c0 |
| 4.02 | 395.13 | -6.62 |
| comp65706_c0 | comp89911_c1 | – | 1.36 | 174.31 | -7.00 |
| comp18789_c0 | comp86285_c0 |
| 1.82 | 346.66 | -7.57 |
| comp63230_c0 | comp84630_c0 | – | 4.22 | 1,369.68 | -8.34 |
| comp7414_c0 | comp80611_c0 |
| 7.50 | 2,512.29 | -8.39 |
| comp29913_c0 | comp84815_c0 | – | 4.75 | 4,443.78 | -9.87 |
| comp6439_c0 | comp78453_c1 |
| 3.18 | 4,014.38 | -10.30 |
| comp14255_c0 | comp79740_c0 | – | 3.97 | 6,073.01 | -10.58 |
| comp14876_c0 | comp83945_c0 |
| 4.09 | 6,662.22 | -10.67 |
| comp16223_c0 | comp82281_c1 | – | 3.83 | 8,683.69 | -11.15 |
Fig. 5Alignment of myoglobin orthologs from Paragonimus species and other trematodes. Although assembly fragmentation resulted in a truncated sequence from P. skrjabini, it had greater 90 % similarity with Paragonimus myoglobin (at the amino acid level), with much less similarity to myoglobins from other trematodes. Abbreviations: Pk, Pk34178_txpt1 [16]; Pw, comp20873_c0_seq2; Ps, comp80973_c0_seq3; Cs, C. sinensis gi:349998765; Ov, Opisthorchis viverrini gi: 663047528; Fh, F. hepatica gi:159461074; Sm, S. mansoni gi:256084837; Sj, S. japonicum gi:226487206