| Literature DB >> 34494883 |
Lisette Meerstein-Kessel1,2, Cas Boshoven3, Selma L van Esveld1,4, Jochem F Baaij1, Konstantin Barylyuk5, Jordy P M Coolen3, Joeri van Strien1, Ronald A J Duim1, Bas E Dutilh1,6, Daniel R Garza1,7, Marijn Letterie3, Nicholas I Proellochs3, Michelle N de Ridder3, Prashanna Balaji Venkatasubramanian1, Laura E de Vries3, Ross F Waller5, Taco W A Kooij3, Martijn A Huynen1,4.
Abstract
Plasmodium species have a single mitochondrion that is essential for their survival and has been successfully targeted by antimalarial drugs. Most mitochondrial proteins are imported into this organelle, and our picture of the Plasmodium mitochondrial proteome remains incomplete. Many data sources contain information about mitochondrial localization, including proteome and gene expression profiles, orthology to mitochondrial proteins from other species, coevolutionary relationships, and amino acid sequences, each with different coverage and reliability. To obtain a comprehensive, prioritized list of Plasmodium falciparum mitochondrial proteins, we rigorously analyzed and integrated eight data sets using Bayesian statistics into a predictive score per protein for mitochondrial localization. At a corrected false discovery rate of 25%, we identified 445 proteins with a sensitivity of 87% and a specificity of 97%. They include proteins that have not been identified as mitochondrial in other eukaryotes but have characterized homologs in bacteria that are involved in metabolism or translation. Mitochondrial localization of seven Plasmodium berghei orthologs was confirmed by epitope labeling and colocalization with a mitochondrial marker protein. One of these belongs to a newly identified apicomplexan mitochondrial protein family that in P. falciparum has four members. With the experimentally validated mitochondrial proteins and the complete ranked P. falciparum proteome, which we have named PlasmoMitoCarta, we present a resource to study unique proteins of Plasmodium mitochondria. IMPORTANCE The unique biology and medical relevance of the mitochondrion of the malaria parasite Plasmodium falciparum have made it the subject of many studies. However, we actually do not have a comprehensive assessment of which proteins reside in this organelle. Many omics data are available that are predictive of mitochondrial localization, such as proteomics data and expression data. Individual data sets are, however, rarely complete and can provide conflicting evidence. We integrated a wide variety of available omics data in a manner that exploits the relative strengths of the data sets. Our analysis gave a predictive score for the mitochondrial localization to each nuclear encoded P. falciparum protein and identified 445 likely mitochondrial proteins. We experimentally validated the mitochondrial localization of seven of the new mitochondrial proteins, confirming the quality of the complete list. These include proteins that have not been observed mitochondria before, adding unique mitochondrial functions to P. falciparum.Entities:
Keywords: Bayesian data integration; Plasmodium; mitochondria
Mesh:
Substances:
Year: 2021 PMID: 34494883 PMCID: PMC8550323 DOI: 10.1128/mSphere.00614-21
Source DB: PubMed Journal: mSphere ISSN: 2379-5042 Impact factor: 4.389
Input data sets for Bayesian integration
| Dataset | Abbreviation used in figures | Size of data set | Fraction identified (%) | Fraction gold standard identified (%) |
|
|---|---|---|---|---|---|
| Data set based on omics data | |||||
| Coexpression, built using in-house developed WICCA tool (rank ≤ 2400) ( | Coexpression | 5,310 | 45.6 | 83.0 | 2.0E−08 |
| Non-apicoplast proteins ( | Apicoplast | 5,324 | 93.6 | 94.4 | 5.4E−01 |
| Data sets based on ortholog evidence | |||||
| hyperLOPIT (TAGM.MCMC.joint.mitochondrion_max ≥ 4.78E-05) ( | hyperLOPIT | 2,246 | 15 | 95.2 | 9.3E−32 |
| Mitochondrial ortholog | Mito ortholog | 5,280 | 8.7 | 75.5 | 2.6E−32 |
| Evolutionary inference, built using CLIME (NN score ≤ −1.5) ( | CLIME | 4,687 | 9.0 | 38.5 | 6.9E−09 |
| Absence of a | Crypto ortholog | 5,280 | 67.5 | 77.4 | 7.9E−02 |
| Data sets based on amino acid sequence evidence | |||||
| Mitochondrial targeting signal, built using PFMpred tool (SVM ≥ −0.67) ( | Mitochondrial TS | 5,324 | 59.5 | 83.3 | 1.4E−04 |
| Isoelectric point (Patrickios pI ≥ 4.2) ( | pI | 5,324 | 55.1 | 83.3 | 1.1E−05 |
FIG 1Predictive values of the individual data sets and the integrated ranked list. (A) Bar plot of each data set with the fraction of the gold standards on the y axis and the bins on the x axis. The mitochondrial score per bin (green) is given to show the predictive value of each bin. (B) ROC-curves comparing the performance to identify mitochondrial proteins of the prediction (10-fold cross-validated) to the performance of the individual data sets. For comparison, the values for the area under each curve are given in the legend. (C) Box plot that visualizes the distribution of the gold standard positives, negatives, and remainder of the proteome (x axis) over the calculated mitochondrial score of the prediction (y axis).
FIG 2Integration produces a ranked proteome with 445 likely mitochondrial proteins at a 25% cFDR cutoff. (A) Density plot of the mitochondrial score with a colored bar at the bottom indicating the scores of individual gold standards. The arrow indicates the 25% cFDR cutoff. (B) The top 445 proteins falling in the 25% cFDR cutoff were ranked on their mitochondrial score, with color indicating if the protein is part of a gold standard. Black arrows indicate the rankings of the seven candidate proteins that we experimentally confirmed to be mitochondrial (Fig. 3A and B), dark gray arrows indicate the rankings of the three proteins with suggested mitochondrial localization (Fig. 3C), and light gray arrows indicate the rankings of the three proteins with inconclusive results (Fig. S4B). (C) Categorical representation of the 445 proteins identified at the 25% cFDR cutoff. First, genes were separated into categories based on the gold standard; the remainder were separated based on mitochondrial and apicoplast GO terms, and subsequently, the remainder were separated based on gene function annotations.
FIG 3Validation of mitochondrial candidate proteins. Immunofluorescent analysis of tagged candidate proteins of interest (POI) in P. berghei parasites (PBANKA IDs are indicated on the left). POI were stained with anti-HA antibody (red; first columns), mitochondria were stained with anti-GFP antibody binding to the organelle marker (Mito; green; second columns). The DNA was stained with DAPI (blue; merge; third columns). (A) Representative images of candidate proteins tagged with an mOrange-3×HA tag, imaged on a SP8 confocal microscope (Leica). (B and C) Representative images of candidate proteins tagged with a linker-3×HA tag, imaged on an LSM900 confocal microscope (Zeiss). (C) Images with the brightest signal for three candidate proteins with low expression. Pixel intensities of the POI and mitochondrial signal were normalized and plotted over the line indicated in the merge panel. Bar, 2 μm.