| Literature DB >> 32550551 |
Manoswini Dash1, Veena Pande2, Abhinav Sinha1.
Abstract
Amidst technical challenges which limit successful culture and genetic manipulation of P. vivax parasites, we used a computational approach to identify a critical target with evolutionary significance. The putative circumsporozoite protein on chromosome 13 of P. vivax (PvpuCSP)is distinct from the well-known vaccine candidate PfCSP. The aim of this study was to understand the role of PvpuCSP and its relatedness to the well-known CSP. The study revealed PvpuCSP as a membrane bound E3 ubiquitin ligase involved in ubiquitination. It has a species-specific tetra-peptide unit which is differentially repeated in various P. vivax strains. The PvpuCSP is different from CSP in terms of stage-specific expression and function. Since E3 ubiquitin ligases are known antimalarial drug targets targeting the proteasome pathway, PvpuCSP, with evolutionary connotation and a key role in orchestrating protein degradation in P. vivax, can be explored as a potential drug target.Entities:
Keywords: Circumsporozoite protein; Computational analysis; Hypothetical protein; Malaria; Plasmodium vivax; Tandem repeat
Year: 2019 PMID: 32550551 PMCID: PMC7285988 DOI: 10.1016/j.gene.2019.100024
Source DB: PubMed Journal: Gene X ISSN: 2590-1583
Fig. 1Schematic diagram of PvpuCSP and multiple sequence alignment of PvpuCSP orthologs. (A) Transmembrane domains (orange), transcription termination factor Rho (green) and RING zinc finger domain (blue) are marked according to their corresponding amino acid position in PvpuCSP. The repeat region (marked with dotted diagonal lines) is predicted to lie within the transcription termination factor Rho. The above figure was prepared using DOG (Domain Graph) version 1.0 (Ren et al., 2009; Liu et al., 2015). (B-i) Multiple sequence alignment of PvpuCSP and its Plasmodium orthologs showing the R[D/C]NA repeat region. (B-ii) Multiple sequence alignment of PvpuCSP from different strains of P. vivax available, showing variation in the number of repeats. Only repeat region of the multiple sequence alignment has been shown in the above figure. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)
List of tools available in public resources (and their references) used to predict in and out the orientation of N- and C-terminals, number of transmembrane domains and their corresponding positions in PvpuCSP are mentioned.
| Transmembrane prediction servers and references | Orientation of N-terminal | Orientation of C-terminal | Number of predicted TM domains | The position of TM domains with respect to | ||||
|---|---|---|---|---|---|---|---|---|
| TM1 | TM2 | TM3 | TM4 | TM5 | ||||
| HMMTOP | Inside | Outside | 5 | 239–256 | 277–299 | 316–333 | 338–355 | 362–379 |
| MEMSAT | Outside | Inside | 1 | 237–258 | – | – | – | – |
| OCTOPUS | Inside | Inside | 4 | 237–257 | 274–294 | 311–331 | 345–375 | – |
| PHILLIUS | Inside | Inside | 4 | 237–258 | 273–295 | 314–333 | 351–378 | – |
| PHOBIUS | Outside | Inside | 5 | 240–261 | 273–294 | 314–333 | 340–357 | 363–379 |
| POLYPHOBIUS | Outside | Inside | 5 | 239–261 | 273–295 | 314–333 | 338–357 | 362–379 |
| SCAMPI | Outside | Inside | 5 | 238–258 | 274–294 | 314–334 | 337–357 | 360–380 |
| SPOCTOPUS | Inside | Inside | 4 | 237–257 | 274–294 | 311–331 | 345–375 | – |
| S-TMHMM | Inside | Inside | 4 | 237–257 | 273–293 | 313–334 | 351–378 | – |
| TMHMM | Inside | Outside | 5 | 236–258 | 273–295 | 316–333 | 338–357 | 362–379 |
| TOPCONS | Inside | Inside | 4 | 237–257 | 274–294 | 313–333 | 351–371 | – |
| TOPPRED | Inside | Inside | 4 | 237–257 | 273–293 | 314–334 | 351–371 | – |
Fig. 2Reconstruction of phylogeny using PvpuCSP orthologs sharing similar domain. The orthologous proteins obtained from DELTA-BLAST using PvpuCSP (with >35% identity) as query inferred using maximum likelihood method. Evolutionary distances were computed using the JTT method, with bootstrap support at each join based on 1000 replicates. The size of the circles on the branch represents the bootstrap value.
Fig. 3Phylogenetic tree of PvpuCSP orthologs across species with similar domain architecture (Rho and RING-H2_PA-TM-RING domain-containing protein). The phylogeny was reconstructed using the maximum likelihood method. The tree was designed and color coded using a web-based tool, Interactive Tree Of Life (iTOL https://itol.embl.de/) (Letunic and Bork, 2016).
Fig. 4Expression profiling and PPI network of PvpuCSP (A-i) The Log2 FPKM value of PvpuCSP (PVX_086150) and well-known CSP (PVX_119355) was plotted at 6 time points during the intra-erythrocytic life cycle. The dots in the box plot show the exact Log2FPKM value of each protein at a different time point of erythrocytic life cycle. (A-ii) MOID normalized expression value of known stage-specific proteins (well-known CSP for sporozoite stage; AMA1 for erythrocytic stage and Pvs25 for gametocyte stage) were plotted for all parasitic stages (in human and mosquitoes). The trend lines show the exponential change in expression value of PvpuCSP and well-known CSP. (B) PPI network of PvpuCSP and interacting DEGS, where size and color represent the degree of the node (bigger size and dark color to the nodes with a higher degree). The expression data generated by Westenberger et al. was used for the above network.
List of DEGs co-expressed with PvpuCSP (PVX_086150) obtained from the STRING enrichment analysis in Cytoscape, using expression data generated by Westenberger group.
| PlasmoDB ID | UniProt ID | Description of the protein | Degree | GO term | GO description |
|---|---|---|---|---|---|
| Polyubiquitin 5 | 549 | GO:PM16005087 | Polyubiquitin 5, putative | ||
| DNA repair protein RAD23 | 505 | GO:0020002 | Host cell plasma membrane | ||
| Ubiquitin domain containing protein | 499 | GO:0008104 | Protein localization | ||
| Ubiquitin domain containing protein | 491 | GO:0006511 | Ubiquitin-dependent catabolic process | ||
| DNA-directed RNA polymerase, beta subunit | 464 | GO:GNF0218 | Merozoite development: | ||
| DNA-directed RNA polymerase III subunit | 433 | GO:0043064 | DNA-directed RNA polymerase III subunit, putative | ||
| K02927 large subunit ribosomal protein L40e | 370 | GO:0003735 | Structural constituent of ribosome | ||
| Ubiquitin-conjugating enzyme E2 I | 232 | GO:0004148 | Dihydrolipoyl dehydrogenase activity | ||
| Ubiquitin-conjugating enzyme E2 | 232 | GO:0009308 | Amine metabolic process | ||
| Ubiquitin-conjugating enzyme E2 4 | 232 | GO:0015935 | Ubiquitin-conjugating enzyme E2 4, putative | ||
| Ubiquitin-conjugating enzyme E2 | 232 | NA | NA | ||
| Ubiquitin-conjugating enzyme family protein | 232 | NA | NA | ||
| Ubiquitin conjugating enzyme E2 | 228 | GO:0003735 | Structural constituent of ribosome | ||
| Ubiquitin conjugating enzyme | 225 | GO:0009308 | Amine metabolic process | ||
| Ubiquitin-conjugating enzyme E2 | 216 | NA | NA | ||
| Ubiquitin-conjugating enzyme E2 | 213 | GO:0015935 | Ubiquitin-conjugating enzyme E2, putative | ||
| RING-box protein HRT1 | 170 | GO:PM16005087 | NA | ||
| Hypothetical protein | 158 | NA | NA | ||
| Hypothetical protein | 151 | GO:0006261 | DNA-dependent DNA replication | ||
| Hypothetical protein | 147 | GO:0004221 | Ubiquitin thiolesterase activity | ||
| Hypothetical protein | 146 | GO:0045047 | Protein targeting to ER | ||
| Ubiquitin-like protein | 142 | GO:CCYCL09 | NA | ||
| Guanidine nucleotide exchange factor | 141 | GO:0004540 | Ribonuclease activity | ||
| Hypothetical protein | 140 | NA | NA | ||
| Hypothetical protein | 139 | NA | NA | ||
| Hypothetical protein | 139 | NA | NA | ||
| S-phase kinase-associated protein 1A | 124 | GO:0044085 | Cellular component biogenesis | ||
| Hypothetical protein | 114 | NA | NA | ||
| 93 | GO:PM15774020 | NA | |||
| 62 | NA | NA | |||
| Subtilisin-like protease 2 | 61 | GO:0031124 | mRNA 3′-end processing | ||
| Subtilisin-like 1 serine protease | 57 | GO:PM15032632 | NA | ||
| Subtilisin-like serine protease | 53 | GO:GNF0218 | Merozoite development | ||
| K13989 Derlin-2/3 | 51 | GO:CCYCL09 | NA | ||
| Hypothetical protein | 50 | GO:CCYCL01 | NA | ||
| Circumsporozoite protein | 37 | GO:0009308 | Amine metabolic process | ||
| Hypothetical protein | 16 | NA | NA | ||
| Hypothetical protein | 8 | NA | NA |
GO, Gene Ontology; NA, not available.
Model validation report of PvpuCSP models generated by homology modeling and fold recognition methods.
| Servers | Parameters checked | I-TASSER | Orion | SPARK-X | Modeller |
|---|---|---|---|---|---|
| Verify 3D | Averaged 3D-1D score > 0.2 | 65.22% | 48.57% | 59.13% | 42.22% |
| PROCHECK | Residues in most favoured region | 62.2% | 78.9% | 76.9% | 61.9% |
| Additionally allowed region | 30.3% | 16.7% | 17% | 26.8% | |
| Generously allowed region | 5.2% | 2.3% | 3.5% | 6.6% | |
| Disallowed region | 2.3% | 2.2% | 2.6% | 4.7% | |
| ERRAT | Overall quality factor | 84.583 | 21.216 | 11.44 | 22.8532 |
| ProSA | Z score | −0.0369 | 1.5 | −0.07 | −0.02 |
| Modeller | DOPE score | −58,795.65234 | −43,805.5391 | −46,093.4375 | −36,789.57422 |
| GA-341 score | 1 | 0.041819 | 0.961289 | 0.07887 |
Fig. 5Cartoon representation of the model of PvpuCSP. (A) The backbone of the model of PvpuCSP is rainbow colored from blue to red, from the N- to C-terminal. (B) Domain architecture of PvpuCSP showing five transmembrane domains (236–258, 277–299, 316–333, 338–355 and 362–379 aa), transcription termination factor–Rho (420–616 aa) and RING (Really Interesting New Gene) zinc finger domain (689–733 aa) marked in blue, cyan blue and green respectively. (C) Homology model RING zinc finger domain of PvpuCSP showing characteristics alpha helix, small beta strands, and variable length loops. Zinc atoms are shown as silver sphere and residues interacting with zinc atoms are shown with their corresponding amino acid position. It has C3H2C3 motif to bind with two zinc ions.
Note: The predicted tertiary model carries two long IDRs, tertiary conformation of which couldn't be accurately predicted. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)
Fig. 6(A) Composition profiling analysis of PvpuCSP by comparing against SwissProt 51 as reference data set. Amino acids are arranged according to their hydrophobicity (Kyte-Doolittle) and color coded as per disorder propensity. Blue, red and grey color bar represents order-promoting, disorder-promoting and neutral amino acid residues, respectively. Positive and negative values represent the enriched and depleted amino acids in the PvpuCSP, respectively. (B) Prediction of natural disordered region in PvpuCSP using PONDR-FIT and PONDR VL-XT. PONDR score below 0.5 is considered suggestive of ordered protein. MoRFs prediction using ANCHOR is shown as purple line. Residues with ANCHOR score above 0.5 are predicted as possible binding site for substrates. (C) Charge-hydropathy plot of PvpuCSP with hydropathy value ranging from 0 to 1. It compares the mean net charge and hydropathy. Red and blue color dots represent the disordered and ordered residues, respectively and the PvpuCSP is represented in green. (D) CDF plot of PvpuCSP using cumulative histogram of the PONDR VL-XT score as an input. Black line with dots represents the boundary line and green line shows the cumulative distribution fraction of amino acids in PvpuCSP. Location of PvpuCSP curve below the boundary line suggest it as a disordered protein as a whole. It is to be specially noted that the tertiary model shown in the figure carries two long IDRs, tertiary conformation of which couldn't be accurately predicted. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)