| Literature DB >> 33028892 |
Sasha V Siegel1, Lia Chappell1, Jessica B Hostetler1,2,3, Chanaki Amaratunga2,4,5, Seila Suon6, Ulrike Böhme1, Matthew Berriman1, Rick M Fairhurst2,7, Julian C Rayner8,9.
Abstract
Plasmodium vivax gene regulation remains difficult to study due to the lack of a robust in vitro culture method, low parasite densities in peripheral circulation and asynchronous parasite development. We adapted an RNA-seq protocol "DAFT-seq" to sequence the transcriptome of four P. vivax field isolates that were cultured for a short period ex vivo before using a density gradient for schizont enrichment. Transcription was detected from 78% of the PvP01 reference genome, despite being schizont-enriched samples. This extensive data was used to define thousands of 5' and 3' untranslated regions, some of which overlapped with neighbouring transcripts, and to improve the gene models of 352 genes, including identifying 20 novel gene transcripts. This dataset has also significantly increased the known amount of heterogeneity between P. vivax schizont transcriptomes from individual patients. The majority of genes found to be differentially expressed between the isolates lack Plasmodium falciparum homologs and are predicted to be involved in host-parasite interactions, with an enrichment in reticulocyte binding proteins, merozoite surface proteins and exported proteins with unknown function. An improved understanding of the diversity within P. vivax transcriptomes will be essential for the prioritisation of novel vaccine targets.Entities:
Mesh:
Year: 2020 PMID: 33028892 PMCID: PMC7541449 DOI: 10.1038/s41598-020-73562-7
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1The extent of the P. vivax schizont transcriptome. (a) Size distribution of 5′ UTRs (n = 4155). (b) Size distribution of 3′ UTRs (n = 4091). (c) An overview of the extent of transcription from a representative portion of the P. vivax genome sequence (from chromosome 12). The coloured lines in the upper panel represent directional RNA-seq coverage from each of the four patient isolates, while the lower panel includes gene models on both strands of the genome sequence. (d) Overlapping transcripts are found even within a single life stage in P. vivax. The example shown is of a gene pair in a “tail-to-tail” orientation (PVP01_1416800 and PVP01_1416900). The boundaries of the mRNA sequence of the second gene in this pair is contained within the 3′ UTR sequence of the first gene in the pair.
Figure 2Splice sites present in P. vivax schizonts. (a) Thousands of splice sites were detected in the RNA-seq data (left), with splice sites not found in online databases (right) falling into a range of categories for both coding and non-coding regions of RNAs. (b) An exitron was identified in RNA-seq data for the gene PVP01_1461000. This exitron is 132 nt, a multiple of 3 nt, which can be spliced out without changing the reading frame of this protein. The vast majority of the transcripts for this mRNA contain the spliced form of the exitron.
List of the 25 most expressed genes in each sample (RPKM).
| Isolate 1 | Isolate 2 | Isolate 3 | Isolate 4 | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Gene ID | Name | RPKM | Gene ID | Name | RPKM | Gene ID | Name | RPKM | Gene ID | Name | RPKM |
| PVP01_0532300 | Early transcribed membrane protein (ETRAMP) | 16,197 | PVP01_0532300 | Early transcribed membrane protein (ETRAMP) | 16,809.1 | PVP01_0532300 | Early transcribed membrane protein (ETRAMP) | 15,516.8 | PVP01_0532300 | Early transcribed membrane protein (ETRAMP) | 15,250.1 |
| PVP01_0905900 | Histone 2B, putative (H2B) | 6237.89 | PVP01_0905900 | 18S ribosomal RNA | 8562.75 | PVP01_0202900 | 18S ribosomal RNA | 15,217.2 | PVP01_MIT01200 | Unspecified product | 10,247.8 |
| PVP01_0819300 | Histone H2A.Z, putative (H2A.Z) | 5877.47 | PVP01_0819300 | Histone 2B, putative (H2S) | 6220.65 | PVP01_0905900 | Histone 2B, putative (H2S) | 9824.49 | PVP01_0905900 | Histone 2B, putative (H2B) | 9071.38 |
| PVP01_0422600 | Early transcribed membrane protein (ETRAMP 11.2) | 5294.71 | PVP01_MIT01200 | Early transcribed membrane protein (ETRAMP11.2) | 5362.27 | PVP01_0422600 | Early transcribed membrane protein (ETRAMP11.2) | 7638.99 | PVP01_1138700 | Histone H3, putative (H3) | 4486.27 |
| PVP01_0622400 | Antigen UB05, putative | 3966.68 | PVP01_0422600 | 28S ribosomal RNA | 4654 | PVP01_0504500 | 28S ribosomal RNA | 6651 | PVP01_0819300 | Histone H2A.Z, putative (H2A.Z) | 4364.04 |
| PVP01_MIT01200 | Unspecified product | 3839.68 | PVP01_1138700 | Unspecified product | 4538.19 | PVP01_MIT01200 | Unspecified product | 6089.82 | PVP01_1131700 | Histone H2A, putative (H2A) | 4215.99 |
| PVP01_0612400 | Merozoite capping protein 1, putative (nPrx) | 3800.53 | PVP01_1131700 | Merozoite capping protein 1, putative (nPrx) | 3907.91 | PVP01_0612400 | Merozoite capping protein 1, putative (nPrx) | 4223.72 | PVP01_0612400 | Merozoite capping protein 1, putative (nPrx) | 3913.89 |
| PVP01_1138700 | Histone H3, putative (H3) | 3661.46 | PVP01_0612400 | Histone H4, putative (H4) | 3779.44 | PVP01_0905800 | Histone H4, putative (H4) | 4164.03 | PVP01_0422600 | Early transcribed membrane protein (ETRAMP11.2) | 3656.38 |
| PVP01_0300700 | Plasmodium exported protein, unknown function | 3635.15 | PVP01_1446800 | Actin, putative (ACT1) | 3429.77 | PVP01_1463200 | Actin, putative (ACT1) | 3967.17 | PVP01_0734800 | Early transcribed membrane protein (ETRAMP) | 3410.88 |
| PVP01_1131700 | Histone H2A, putative (H2A) | 3156.23 | PVP01_0622400 | Histone H2A, putative (H2A) | 3171.27 | PVP01_1131700 | Histone H2A, putative (H2A) | 3935.14 | PVP01_0300700 | Plasmodium exported protein, unknown function | 3364.47 |
| PVP01_1463200 | Actin, putative (ACT1) | 2874.06 | PVP01_0734800 | Histone H3, putative (H3) | 2978.07 | PVP01_1138700 | Histone H3, putative (H3) | 3828.61 | PVP01_0905800 | Histone H4, putative (H4) | 3234.02 |
| PVP01_1460700 | Translation initiation factor SUI1, putative (SUI1) | 2693.24 | PVP01_1463200 | Histone H2A.Z, putative (H2A.Z) | 2807.36 | PVP01_0819300 | Histone H2A.Z, putative (H2A.Z) | 3757.5 | PVP01_0622400 | Antigen UB05, putative | 3078.14 |
| PVP01_1446800 | Merozoite surface protein 9 (MSP9) | 2624.68 | PVP01_0300700 | Acyl-CoA binding protein, putative (ACBP) | 2764.72 | PVP01_1430300 | Acyl-CoA binding protein, putative (ACBP) | 3317.42 | PVP01_1463200 | Actin, putative (ACT1) | 3055.19 |
| PVP01_0734800 | Early transcribed membrane protein (ETRAMP) | 2478.01 | PVP01_0202900 | Plasmodium exported protein, unknown function | 2590.76 | PVP01_0300700 | Plasmodium exported protein, unknown function | 3097.2 | PVP01_1460700 | Translation initiation factor SUI1, putative | 3035.99 |
| PVP01_0817200 | Translation machinery-associated protein 7, putative (TMA7) | 2450.16 | PVP01_0817200 | Ookinete surface protein P25 (P25) | 2368.29 | PVP01_0616100 | Ookinete surface protein P25 (P25) | 2525.47 | PVP01_1446800 | Merozoite surface protein 9 (MSP9) | 2956.09 |
| PVP01_1423100 | Histone 2B variant, putative (H2B.Z) | 2073.71 | PVP01_0305600 | Inositol-3-phosphate synthase, putative (INO1) | 2341.59 | PVP01_1022200 | Inositol-3-phosphate synthase, putative (INO1) | 2415.42 | PVP01_0305600 | Sexual stage antigen s16, putative | 2756.14 |
| PVP01_0305600 | Sexual stage antigen s16, putative | 2014.9 | PVP01_1460700 | Early transcribed membrane protein (ETRAMP) | 2262.4 | PVP01_0734800 | Early transcribed membrane protein (ETRAMP) | 2225.38 | PVP01_0616100 | Ookinete surface protein P25 (P25) | 2276.45 |
| PVP01_0616100 | Ookinete surface protein P25 (P25) | 2012.9 | PVP01_1022200 | Histone H2B variant, putative (H2B.Z) | 2183.03 | PVP01_1423100 | Histone H2B variant, putative (H2B.Z) | 2128.31 | PVP01_1022200 | Inositol-3-phosphate synthase, putative (INO1) | 1983.83 |
| PVP01_1022200 | Inositol-3-phosphate synthase, putative (INO1) | 1988.91 | PVP01_0905800 | Translation initiation factor SUI1, putative | 2157.64 | PVP01_1460700 | Translation initiation factor SUI1, putative | 2126.72 | PVP01_1423100 | Histone H2B variant, putative (H2B.Z) | 1886.4 |
| PVP01_1023000 | Translationally-controlled tumour protein homolog, putative (TCTP) | 1855.69 | PVP01_0728900 | Merozoite surface protein 1 (MSP1) | 2045.53 | PVP01_0728900 | Merozoite surface protein 1 (MSP1) | 2113.79 | PVP01_1023000 | Translationally-controlled tumor protein homolog, putative (TCTP) | 1721.61 |
| PVP01_0905800 | Histone H4, putative (H4) | 1756.98 | PVP01_1338500 | Merozoite surface protein 9 (MSP9) | 1954.86 | PVP01_1446800 | Merozoite surface protein 9 (MSP9) | 2072.79 | PVP01_0728900 | Merozoite surface protein 1 (MSP1) | 1709.71 |
| PVP01_1430300 | Acyl-CoA binding protein, putative (ACBP) | 1712.13 | PVP01_1023000 | Antigen UB05, putative | 1932.73 | PVP01_0622400 | Antigen UB05, putative | 1843.76 | PVP01_0417600 | Serine-repeat antigen 5 (SERA) | 1696.03 |
| PVP01_0728900 | Merozoite surface protein 1 (MSP1) | 1673.39 | PVP01_1423100 | Rhoptry-associated membrane antigen, putative (RAMA) | 1873.43 | PVP01_0107500 | Rhoptry-associated membrane antigen, putative (RAMA) | 1834.14 | PVP01_1441300 | cAMP-dependent protein kinase regulatory subunit, putative | 1598.73 |
| PVP01_0716300 | Endoplasmic reticulum chaperone BiP, putative | 1611.07 | PVP01_0107500 | Rhoptry-associated protein 1 (RAP1) | 1830.12 | PVP01_1338500 | Rhoptry-associated protein 1 (RAP1) | 1764.16 | PVP01_1245500 | 40S ribosomal protein S28e, putative | 1593.55 |
| PVP01_1245500 | 40S ribosomal protein S28e, putative | 1579.65 | PVP01_1136400 | Translationally-controlled tumor protein homolog, putative (TCTP) | 1578.84 | PVP01_1023000 | Translationally-controlled tumor protein homolog, putative (TCTP) | 1673.08 | PVP01_0216700 | Plasmodium exported protein, unknown function | 1526.09 |
Differentially expressed genes belonging to clusters (in bold) or multigene families in Cambodian isolates (RPKM-adjusted for sequence variation).
| Gene ID | Name | Isolate 1 RPKM | Isolate 2 RPKM | Isolate 3 RPKM | Isolate 4 RPKM | ||
|---|---|---|---|---|---|---|---|
| PVP01_0000170 | Tryptophan-rich protein (TRAG32) | 29.27 | 240.68 | 43.30 | 127.00 | 0.88 | 85.81 |
| PVP01_0004360 | PIR protein | 65.82 | 88.33 | 148.22 | 7.42 | 0.75 | 43.75 |
| PVP01_0004370 | PIR protein | 162.13 | 55.23 | 24.47 | 25.21 | 0.98 | 63.63 |
| Merozoite surface protein 3, putative | 1246.43 | 3.94 | 1.49 | 1.90 | 1.98 | 1234.29 | |
| Merozoite surface protein 3, putative | 88.55 | 59.88 | 268.46 | 367.45 | 0.75 | 110.02 | |
| PVP01_0122100 | PIR protein | 89.09 | 24.60 | 22.38 | 11.90 | 0.95 | 33.44 |
| PVP01_0201800 | PIR protein | 132.21 | 399.29 | 124.40 | 431.42 | 0.61 | 101.71 |
| PVP01_0405200 | Plasmodium exported protein (PHISTc) | 24.31 | 12.49 | 87.81 | 8.58 | 1.11 | 41.02 |
| PVP01_0417400 | Serine-repeat antigen 4 (SERA) | 96.58 | 42.93 | 61.45 | 192.20 | 0.68 | 44.91 |
| PVP01_0423700 | PIR protein | 51.79 | 11.96 | 2.67 | 7.34 | 1.22 | 27.59 |
| Plasmodium exported protein (PHIST) | 102.08 | 54.14 | 157.91 | 38.34 | 0.61 | 32.91 | |
| Plasmodium exported protein | 19.10 | 7.17 | 69.67 | 5.00 | 1.20 | 36.29 | |
| Sporozoite invasion-associated protein 2, putative (SIAP2) | 56.42 | 17.14 | 120.93 | 8.91 | 1.01 | 51.38 | |
| 28S ribosomal RNA | 973.82 | 1698.50 | 9873.71 | 1206.48 | 1.25 | 5380.46 | |
| 18S ribosomal RNA | 80.00 | 202.88 | 963.80 | 84.93 | 1.27 | 541.09 | |
| PVP01_0515900 | Plasmodium exported protein (PHIST) | 67.25 | 29.84 | 418.41 | 18.47 | 1.43 | 273.52 |
| PVP01_0516500 | Plasmodium exported protein (PHIST) | 15.97 | 9.14 | 57.87 | 5.37 | 1.10 | 26.64 |
| PVP01_0523200 | Plasmodium exported protein (PHIST) | 42.97 | 25.14 | 95.39 | 11.34 | 0.84 | 30.99 |
| PVP01_0524100 | Plasmodium exported protein (PHIST) | 33.26 | 13.50 | 83.18 | 14.15 | 0.91 | 29.77 |
| PVP01_0533700 | Plasmodium exported protein (PHIST) | 45.73 | 21.98 | 238.70 | 9.16 | 1.36 | 146.77 |
| PVP01_0534300 | Reticulocyte binding protein 2c (RBP2c) | 62.67 | 41.64 | 177.40 | 14.22 | 0.97 | 69.57 |
| Plasmodium exported protein (PHIST) | 24.48 | 15.11 | 74.78 | 5.13 | 1.04 | 32.09 | |
| Plasmodium exported protein (PHIST) | 54.34 | 30.64 | 161.54 | 15.87 | 1.00 | 66.20 | |
| PVP01_0623800 | Duffy binding protein (DBP) | 90.60 | 32.36 | 277.41 | 20.05 | 1.13 | 134.55 |
| PVP01_0700700 | Tryptophan-rich protein (TRAG34) | 136.11 | 84.55 | 310.05 | 52.34 | 0.79 | 90.46 |
| PVP01_0701200 | reticulocyte binding protein 1a (RBP1a) | 70.02 | 31.49 | 152.90 | 20.69 | 0.87 | 52.25 |
| PVP01_0800700 | Reticulocyte binding protein 2b (RBP2b) | 45.68 | 18.96 | 99.58 | 12.59 | 0.90 | 35.49 |
| PVP01_0808700 | Plasmodium exported protein (PHIST) | 19.84 | 6.29 | 121.89 | 10.69 | 1.39 | 76.52 |
| PVP01_0839300 | PIR protein | 68.46 | 7.34 | 5.55 | 13.42 | 1.27 | 38.08 |
| Merozoite surface protein 3 (MSP3G) | 52.07 | 67.67 | 196.89 | 43.37 | 0.80 | 57.55 | |
| merozoite surface protein 3 (MSP3.5) | 143.78 | 78.00 | 14.38 | 84.49 | 0.66 | 34.92 | |
| Merozoite surface protein 3 (MSP3.3) | 208.61 | 185.68 | 94.54 | 835.10 | 1.03 | 348.59 | |
| Merozoite surface protein 3 (MSP3.1) | 2606.85 | 1761.98 | 1889.06 | 197.99 | 0.63 | 637.85 | |
| PVP01_1033800 | Tryptophan-rich protein (TRAG17) | 402.99 | 155.61 | 853.86 | 195.32 | 0.80 | 255.09 |
| PVP01_1100400 | PIR protein | 50.33 | 167.19 | 25.26 | 55.55 | 0.85 | 53.45 |
| PVP01_1201400 | Plasmodium exported protein (PHIST) | 57.95 | 34.49 | 215.77 | 12.58 | 1.15 | 106.14 |
| MSP7-like protein (MSP7.6) | 179.79 | 177.55 | 496.23 | 76.87 | 0.78 | 142.69 | |
| MSP7-like protein (MSP7.9) | 1032.53 | 2939.70 | 1457.97 | 322.91 | 0.77 | 849.01 | |
| PVP01_1401800 | Tryptophan-rich protein (TRAG21) | 34.54 | 22.27 | 118.27 | 6.70 | 1.10 | 54.72 |
| PVP01_1402400 | Reticulocyte binding protein 2a (RBP2a) | 44.16 | 16.73 | 95.14 | 11.52 | 0.91 | 34.98 |
Figure 3Variation in RNA-seq data between the four patient isolates. (a) RNA-seq coverage data (lines in top panel) for the region on PvP01 chromosome 10, which contains the sequences of multiple MSP3 genes (shown in red on the lower panel). Levels of transcripts detected for each of the genes in this locus vary between each of the patient isolate (see key in figure to identify traces). Drops in coverage within an annotated gene model are likely to represent sequence divergence in the patient isolates from the reference genome. (b) RNA-seq coverage data (lines in top panel) for a region of PvP01 chromosome 10. The four patient isolates show very similar transcript levels for most of the protein coding genes in this region. However, an unannotated ncRNA (highlighted by a box) was only found in isolate 4, opposite to the gene PVP01_1236300.