| Literature DB >> 28744484 |
S Wuchty1,2,3,4, S V Rajagopala5, S M Blazie5, J R Parrish6, S Khuri1,2, R L Finley6, P Uetz7.
Abstract
The functions of roughly a third of all proteins in Streptococcus pneumoniae, a significant human-pathogenic bacterium, are unknown. Using a yeast two-hybrid approach, we have determined more than 2,000 novel protein interactions in this organism. We augmented this network with meta-interactome data that we defined as the pool of all interactions between evolutionarily conserved proteins in other bacteria. We found that such interactions significantly improved our ability to predict a protein's function, allowing us to provide functional predictions for 299 S. pneumoniae proteins with previously unknown functions. IMPORTANCE Identification of protein interactions in bacterial species can help define the individual roles that proteins play in cellular pathways and pathogenesis. Very few protein interactions have been identified for the important human pathogen S. pneumoniae. We used an experimental approach to identify over 2,000 new protein interactions for S. pneumoniae, the most extensive interactome data for this bacterium to date. To predict protein function, we used our interactome data augmented with interactions from other closely related bacteria. The combination of the experimental data and meta-interactome data significantly improved the prediction results, allowing us to assign possible functions to a large number of poorly characterized proteins.Entities:
Keywords: functional prediction; protein-protein interactions
Year: 2017 PMID: 28744484 PMCID: PMC5513735 DOI: 10.1128/mSystems.00019-17
Source DB: PubMed Journal: mSystems ISSN: 2379-5077 Impact factor: 6.496
FIG 1 Characteristics of bacterial interactomes. (A) We schematically show the combined network of S. pneumoniae protein interactions detected by yeast two-hybrid (light gray edges) or microfluidics (dark gray edges). Proteins with known function are colored green; the functions were unknown (red nodes) for 342 (37.2%) of the 918 proteins in the network. (B) To augment our network of S. pneumoniae protein interactions, we utilized interactomes from other bacteria. The numbers of PPIs are shown for interactions where both proteins have (green) or only one protein has (orange) S. pneumoniae orthologs. (C) Considering all genes in S. pneumoniae without known functions, we calculated the number of interaction partners (degree) in the original S. pneumoniae network (left) and in the network augmented with the meta-interactome (right). Proteins with higher degrees mostly benefited from the addition of interologs. (D) As an example, SP_1876 interacted with 6 S. pneumoniae proteins (circles) plus another 6 B. subtilis proteins (squares) in the augmented network. As a result, functions of interaction partners of SP_1876 mostly revolve around transcription, signal transduction, and posttranslational (Posttransl.) modifications (based on EggNOG; see Materials and Methods and Discussion).
Meta-interactome data can improve functional predictions—an example
| SP_1876 interacts with: | Description | Function |
|---|---|---|
| Exonuclease | Replication | |
| Glutamyl-tRNA(Gln) amidotransferase | Translation | |
| Chromosome segregation protein | Cell cycle | |
| Segregation and condensation protein B | Transcription | |
| Orotidine 5′-phosphate decarboxylase | Nucleotide transport | |
| Chaperonin | Posttranslational modification | |
| KinC | Sporulation kinase C | Signal transduction |
| YdeL | HTH-type transcriptional regulator | Transcription |
| YxaD | HTH-type transcriptional regulator | Transcription |
| DegS | Signal transduction histidine-protein kinase/phosphatase | Posttranslational modification |
| YhcY | Sensor histidine kinase | Signal transduction |
| TlpA | Methyl-accepting chemotaxis protein | Inorganic ion transport |
A protein of unknown function interacts with 6 proteins in our primary Y2H data set (bold), but addition of meta-interactions from other species is required for indication of a role in transcription, signal transduction, and posttranslational modifications. Locus and protein names are from UniProt (46) and KEGG (47); annotations and functions are from EggNOG (24).
FIG 2 Impact of the bacterial meta-interactome on protein function prediction. (A) To assess the quality of our classification procedure, we randomly sampled 20% of all functionally annotated proteins in S. pneumoniae and utilized the remainder to predict their functions. To measure prediction quality, we calculated the area under the ROC curve, suggesting that the addition of the bacterial meta-interactome allowed better functional prediction (P < 10−50; Student’s t test). (B) We calculated the fraction of correctly predicted protein functions as a function of the degree in the original protein interaction network of S. pneumoniae. The inset shows the enrichment (enr.) of accuracy (lg2 for the fraction of correctly predicted functions in the original network over the fraction for the augmented network) for each degree, showing that the prediction for proteins with a low degree was improved by adding the meta-interactome. (C) Considering the s-index, predictions of functions appeared more homogeneous with respect to the meta-interactome and increasing degree values. (D) We considered all randomized samples and calculated the mean s-indices of each gene in both the original S. pneumoniae network and the augmented network. In the scatterplot, the homogeneity of the functional prediction of the majority of genes benefitted from inclusion of the bacterial meta-interactome. freq., frequency. (E) In each sample, we determined the most probable function for each gene. Counting the occurrence of transitions between such functions in the original S. pneumoniae network and the augmented network, we largely found that the functions predicted in the original network corresponded to the same functions in the augmented network.
FIG 3 Functional prediction of unknown proteins in S. pneumoniae. Augmenting the network of protein interactions of S. pneumoniae with interactions of other bacteria, we predicted the functions of 299 proteins with unknown or poorly characterized functions (FDR = <0.05). We annotated each protein with the difference in the s-index value, deducting the corresponding value in the original network of interactions in S. pneumoniae and the value in the augmented network. For example, SP_1876 had a 66% chance (FDR = <0.05) of being involved in transcriptional activities. Scores for each protein and the corresponding functional prediction are provided in Table S4.