| Literature DB >> 23825934 |
Yan Zhang1, Hye Kyong Kweon, Christian Shively, Anuj Kumar, Philip C Andrews.
Abstract
Reversible phosphorylation is one of the major mechanisms of signal transduction, and signaling networks are critical regulators of cell growth and development. However, few of these networks have been delineated completely. Towards this end, quantitative phosphoproteomics is emerging as a useful tool enabling large-scale determination of relative phosphorylation levels. However, phosphoproteomics differs from classical proteomics by a more extensive sampling limitation due to the limited number of detectable sites per protein. Here, we propose a comprehensive quantitative analysis pipeline customized for phosphoproteome data from interventional experiments for identifying key proteins in specific pathways, discovering the protein-protein interactions and inferring the signaling network. We also made an effort to partially compensate for the missing value problem, a chronic issue for proteomics studies. The dataset used for this study was generated using SILAC (Stable Isotope Labeling with Amino acids in Cell culture) technique with interventional experiments (kinase-dead mutations). The major components of the pipeline include phosphopeptide meta-analysis, correlation network analysis and causal relationship discovery. We have successfully applied our pipeline to interventional experiments identifying phosphorylation events underlying the transition to a filamentous growth form in Saccharomyces cerevisiae. We identified 5 high-confidence proteins from meta-analysis, and 19 hub proteins from correlation analysis (Pbi2p and Hsp42p were identified by both analyses). All these proteins are involved in stress responses. Nine of them have direct or indirect evidence of involvement in filamentous growth. In addition, we tested four of our predicted proteins, Nth1p, Pbi2p, Pdr12p and Rcn2p, by interventional phenotypic experiments and all of them present differential invasive growth, providing prospective validation of our approach. This comprehensive pipeline presents a systematic way for discovering signaling networks using interventional phosphoproteome data and can suggest candidate proteins for further investigation. We anticipate the methodology to be applicable as well to other interventional studies via different experimental platforms.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23825934 PMCID: PMC3694812 DOI: 10.1371/journal.pcbi.1003077
Source DB: PubMed Journal: PLoS Comput Biol ISSN: 1553-734X Impact factor: 4.475
Figure 1Graphical illustration of the filamentous growth pathway in budding yeast from previous studies.
The ellipses are proteins; the rectangles are genes; and the triangles are metabolites. The linkage between shapes: sharp-end arrows indicate stimulation, T-end arrows indicate inhibition, and wavy lines indicate association. The information were extracted from Science Signaling Database of Cell Signaling [20] and KEGG database [27]. The white ellipses are five of the eight kinases selected to be mutated in our experiments.
Figure 2Summary flow chart of the analytical workflow.
Summary of the dataset and subsequent analyses.
| Summary | Number of phosphopeptides | Number of proteins |
| Identifications in the whole dataset | 3,312 | 1,063 |
| Identifications common among all 8 kinase-dead mutants (KDs) | 73 | 66 |
| Identifications common among 4–8 KDs | 882 | 486 |
| Identifications that are significant in at least 1 KD | 863 | 452 |
|
| 28(5 from complete measurements – high-confidence) | 26(5 from complete measurements – high-confidence; 17 have inner connections supported by STRING |
| High-confidence hub proteins identified from the stringent correlation network | - | 19 |
| Proteins known to be involved in filamentous growth from literature mining, and detected in our dataset | - | 20(15 of them are significant in at least 1 KD) |
Figure 3Correlation heat map of the kinase-dead mutants (log2 ratios adopted).
The hierarchical clustering tree using Spearman correlation as the similarity metric is drawn along the left side of the heatmap.
Top 8 tight clusters and functional enrichment.
| Cluster | Proteins (traced back from phosphopeptides) | Enriched terms |
| 1 | YRO2, BUG1, VPS74, HXK1, PIL1, FBP26, PTK2, NPA3, BIR1, MYO3, UTP14, ARE2, DBP5, RUD3 | Nucleotide phosphate-binding region:ATP (P-value = 6.54E-04, Benjamini = 3.4E-2) |
| Nucleotide-binding (P-value = 1.8E-3, Benjamini = 4.2E-2) | ||
| ATP-binding (P-value = 6.0E-3, Benjamini = 9.3E-2) | ||
| 2 | VMA2, SEC31, GLY1, PEA2, VTC2, KEM1, UFD1, TIF4631, BCY1, SPA2, MFT1, NEW1, KRE6 | - |
| 3 | NUP60, SLA1, STU1, YCL020W, VBA4, HOM2, YDR365W-B, VPS74, PSP1, CHD1, NUP145, SPT6, HSE1, ABF1, MEH1, CKI1, YLR413W, SPT5, HRB1, LCB4, CAF20, MRL1 | Endosome (P-value = 1.6E-3, Benjamini = 6.6E-2) |
| RNA polymerase II transcription elongation factor activity (P-value = 1.4E-3, Benjamini = 9.6E-2) | ||
| Transcription elongation regulator activity (P-value = 2.8E-3, Benjamini = 9.9E-2) | ||
| 4 | FAP7, ITR1, LSB3, LEU1, FLC3, SPT6, YGR125W, CRP1, KEL1, LCB3, YBT1, BDF1, YMR031C, DDR48, YMR295C, GPD2, ZEO1, CAF20, SNF2 | - |
| 5 | PIN4, CYC8, BUD3, LYS20, CDC34, MAK21, BFR2, SUM1, GLY1, NUP145, PRP43, SPT6, ENP2, YOR1, SSZ1, NUP2, YLR345W, SUB1, ESC1, BDP1, DCP2, RPC31, SLA2, NOP8, ALE1, MSB1, SNU66 | Nucleus (P-value = 1.0E-4, Benjamini = 3.4E-3) |
| Nuclear lumen (P-value = 3.4E-4, Benjamini = 2.7E-2) | ||
| 6 | SIF2, PPH22, VAC8, HSP12, RTF1, RSC30, TRA1, LCB3, NAP1, SIC1, RPN13, YMR196W, MRE11, MCK1, LEM3, FPK1, LSP1 | - |
| 7 | IST2, AIM3, RPC53, YDR186C, ECM32, MIG1, HXK2, VHS2, RNR2, UTR1, FBA1, EAP1, YLR257W, PFK2, PFK2, ACC1, YOR052C | Fructose and mannose metabolism (P-value = 3.0E-3, Benjamini = 3.9E-2) |
| Glycolysis (P-value = 1.6E-3, Benjamini = 4.3E-2) | ||
| Glycolysis/gluconeogenesis (P-value = 9.8E-3, Benjamini = 6.2E-2) | ||
| 8 | AKL1, IST2, MAK5, FEN1, LHP1, RPC53, SAS10, SHS1, MAK21, DOP1, GCD6, GUK1, CHO1, PDA1, LEU1, NOP7, SPT6, TFG1, HXT1, AIM21, URA2, CDC11, MAK11, VPS13, CBF5, VTA1, CRN1, YMR031C, EFR3, ADE4, NOP12, MAM3, CAF20, PEX25, TIF5 | Ribosome biogenesis (P-value = 1.0E-4, Benjamini = 5.0E-3) |
Functional enrichment P-value and Benjamini-Hochberg corrected p-value (Benjamini) were calculated with DAVID Functional Annotation Tool [33], [34]. They are given in the brackets following corresponding terms.
Benjamini <0.1,
Benjamini <0.05,
Benjamini <0.01.
All the clusters are highly enriched in the term “phosphoprotein” (not listed above).
Figure 4Extended filamentous pathway map.
The extended filamentous growth pathway map integrating the known knowledge (Figure 1) and the regulation inferred from significant differential phosphorylation in individual KDs. The inferred regulation might be direct or indirect. The ellipses are proteins; the rectangles are DNAs; and the triangles are metabolites. The linkage between shapes: sharp-end arrows indicate stimulation, T-end arrows indicate inhibition, and wavy lines indicate association. Solid lines indicate physical interactions, while dashed lines indicate changes in phosphorylation.
Globally significant phosphopeptides selected from the complete measurements (high-confidence).
| ENSEMBL ID | Standard name | Name description | Modified sequence | Stress response |
| YDR001C | NTH1 | Neutral trehalase;Alpha,alpha-trehalase;Alpha,alpha-trehalose glucohydrolase | _RGS(ph)EDDTYSSSQGNR_ | Nth1p is a multiple stress responsive protein |
| YNL015W | PBI2 | Protease B inhibitors 2 and 1;Proteinase inhibitor I(B)2 | _HNDVIENVEEDKEVHT(ph)N_ | Pbi2 gene deletion leads to decreased resistance to hyperosmotic stress |
| YOR220W | RCN2 | Regulator of calcineurin 2;Weak suppressor of PAT1 ts protein 1 | _NKPLLSINT(ph)DPGVTGVDSSSLNK_ | Rcn2p is Induced in response to DNA-damaging agent methyl methanesulphonate |
| YPL058C | PDR12 | ATP-dependent permease PDR12 | _HLSNILS(ph)NEEGIER_ | Pdr12 is strongly induced by weak acid stress |
| YDR171W | HSP42 | Heat shock protein 42 | _KS(ph)S(ph)SFAHLQAPSPIPDPLQVSKPETR_ | Protein expression is induced by stresses such as heat shock, salt shock and starvation |
Annotated with MaxQuant.
Figure 5Stringent correlation network of phosphoprotein pairs.
Red lines indicate positive correlations, while black lines indicate negative correlations. The larger the node size, the greater the degree of connectivity.
Focus proteins used for causal relationship discovery.
| Mutated kinases | Globally significant (high-confidence) | Hub proteins (high-confidence) | From literature mining and detected in our dataset |
| (also see | (also see Table S2 in | (also see Table S1 in | |
| KSP1 | NTH1 | SEC21 | BCY1 |
| KSS1 | PBI2 | ABF1 | BMH1 |
| SKS1 | RCN2 | ARE2 | BUD2 |
| STE20 | PDR12 | DCP2 | CDC28 |
| SNF1 | HSP42 | KEM1 | CYR1 |
| TPK2 | NUP145 | DIG1 | |
| ELM1 | SPA2 | DIG2 | |
| FUS3 | CHO1 | FLO8 | |
| GLY1 | GPR1 | ||
| HSP42 | KEM1 | ||
| PWP1 | NRG1 | ||
| PUF6 | PEA2 | ||
| SPT6 | RAS2 | ||
| SSD1 | SFL1 | ||
| SUM1 | SNF1 | ||
| NUP2 | SPA2 | ||
| PBI2 | STE20 | ||
| PBP1 | STE50 | ||
| UFD1 | TPK3 | ||
| TPM1 |
Figure 6Causal Bayesian network.
Each edge indicates a potential causal influence between proteins, which might be a direct or indirect influence. It does not distinguish activation and inhibition. The thicker the edge, the higher the posterior probability.
Figure 7Phenotypic analysis of genes predicted to contribute to the yeast filamentous response.
The genes PBI2, PDR12, RCN2, and NTH1 were deleted in a haploid strain of the filamentous ∑1278b genetic background by targeted replacement with the G418-encoding kanMX6 cassette. The resulting strains were grown 3–4 days under normal growth conditions, and invasive growth was assayed in these strains and in a wild-type strain according to standard protocols using a plate-washing assay. Deletion of PBI2 resulted in decreased invasive growth upon plate washing, and strains deleted for PDR12, RCN2, and NTH1 yielded hyperactive invasive growth. The increased invasive growth in these strains was most clearly evident in patched cultures as shown.
Ratio lists for two representative phosphopeptides from the ratio matrix.
| Phosphopeptide | Phosphorylation fold-changes in following KD-vs-WT conditions | |||||||
| Sks1-KD vs. WT | Ste20-KD vs. WT | Snf1-KD vs. WT | Tpk2-KD vs. WT | Elm1-KD vs. WT | Fus3-KD vs. WT | Kss1-KD vs. WT | Ksp1-KD vs. WT | |
| ADDEEDLS(ph)DENIQPELR | 0.72 | 0.71 | 0.70 | 0.52 | 1.0 | 0.88 | 0.83 | 0.86 |
| ADGTGEAQVDNS(ph)PTTESNSR | 2.3 | 3.7 | 2.1 | 2.2 | 0.33 | 0.58 | 0.75 | 0.69 |
Phosphorylation level of each phosphopeptide is represented in a list of ratios. We used the peptide ratios provided by the MaxQuant output, which have been normalized for each LC-MS/MS run [30]. The significance B values provided by MaxQuant are not shown here. For the cluster analysis, if a phosphopeptide is detected multiple times under the same KD-versus-WT condition, the median of all its ratios are taken. S(ph) or T(ph) indicates that the specific amino acid, serine or threonine, is phosphorylated, respectively.