| Literature DB >> 31730621 |
Awtum M Brashear1,2, Wanlapa Roobsoong3, Faiza A Siddiqui2, Wang Nguitragool4, Jetsumon Sattabongkot3, Margarita M López-Uribe1, Jun Miao2, Liwang Cui1,2.
Abstract
Plasmodium ovale accounts for a disproportionate number of travel-related malaria cases. This parasite is understudied since there is a reliance on clinical samples. We collected a P. ovale curtisi parasite isolate from a clinical case in western Thailand and performed RNA-seq analysis on the blood stage transcriptomes. Using both de novo assembly and alignment-based methods, we detected the transcripts for 6628 out of 7280 annotated genes. For those lacking evidence of expression, the vast majority belonged to the PIR and STP1 gene families. We identified new splicing patterns for over 2500 genes, and mapped at least one untranslated region for over half of all annotated genes. Our analysis also detected a notable presence of anti-sense transcripts for over 10% of P. ovale curtisi genes. This transcriptomic analysis provides new insights into the blood-stage biology of this neglected parasite.Entities:
Mesh:
Year: 2019 PMID: 31730621 PMCID: PMC6881071 DOI: 10.1371/journal.pntd.0007850
Source DB: PubMed Journal: PLoS Negl Trop Dis ISSN: 1935-2727
Sequencing performance.
*For assembly data, the genome and transcriptome had different assembly and post-assembly methodology. For these metrics report, we used all contigs.
| Genome | Poly-A Selected | Not Selected | |
|---|---|---|---|
| 2x250bp | 150bp | 150bp | |
| 17,518,533 | 12,917,696 | 12,808,819 | |
| 13,048,280 (74.5%) | 11,381,290 (88.1%) | 9,976,224 (77.9%) | |
| 150x | 57x | 10x | |
| 25,122 | 35,916 | NA | |
| 6,046 | 1,178 | ||
| NA | 8,377,744 (64.9%) | 8,805,493 (68.7%) |
Fig 1Gene expression within the P. ovale curtisi field sample.
(A) Overlap between assembly and alignment-based methodology. Genes identified by peptide prediction on transcripts and transcript fragments are outlined in dark blue while the genes identifiable by transcription at TPM greater than or equal to 1 are in lighter blue. The red outer circle displays all annotated genes, including 83 of which were not identified by either means. (B) Proportion of genes annotated from the existing genome (PocGH01, PlasmoDB.org) compared to those found by alignment at 1 or more TPM in each of the statistically significant Gene Ontology groupings (Bonferroni-adjusted P-value < 0.05). (C) Distribution of TPM (Log transformed) per gene grouping with significantly high or low transcription in the P. ovale field sample. The line through the center represents the median of all genes. (D) Gene family repertoires. Bars represent genes above TPM 1, while points within the study bar represent whole peptides, defined as those with both a beginning and end present in the ORF.
Fig 2Examples of different splicing patterns in P. ovale clinical isolate.
(A) New transcript identified with additional exon toward the center of PocGH01_14030000. (B) A missing intron in some transcripts of PocGH01_0804470. (C) A shift in splicing point resulting in a shorter intron for PocGH01_11024900. (D) A shift in splicing point resulting in a longer exon in PocGH01_11032600. For each the first panel represents read density, the second represents transcript structure.
Fig 3Trends in UTR length and anti-sense transcripts.
(A) A histogram depicting the number of UTRs reaching each length (in bp) from all ORF predictions of assembled transcripts. To denote distance from coding section of the gene, five prime UTRs (left) were given negative values while three prime UTRs (right) were given positive values. The gene model on the bottom is depicted merely for visualization. (B) Distribution of proportion anti-sense reads by gene. Dotted lines separate genes into ranges and terms for each group represent terms significantly enriched (P > 0.05, Bonferroni-adjusted) within genes of that group.