| Literature DB >> 25510246 |
Nagendra K Chaturvedi, Riyaz A Mir, Vimla Band, Shantaram S Joshi, Chittibabu Guda1.
Abstract
BACKGROUND: Computational methods have been widely used for the prediction of protein subcellular localization. However, these predictions are rarely validated experimentally and as a result remain questionable. Therefore, experimental validation of the predicted localizations is needed to assess the accuracy of predictions so that such methods can be confidently used to annotate the proteins of unknown localization. Previously, we published a method called ngLOC that predicts the localization of proteins targeted to ten different subcellular organelles. In this short report, we describe the accuracy of these predictions using experimental validations.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25510246 PMCID: PMC4301851 DOI: 10.1186/1756-0500-7-912
Source DB: PubMed Journal: BMC Res Notes ISSN: 1756-0500
Figure 1The cartoon shows the research strategy used to experimentally validate the predicted subcellular localization of proteins. Genes of interest were cloned into pEGP-N1 vector as GFP fusions. These GFP-tagged genes along with corresponding location-specific RFP-tagged gene markers, were transiently co-transfected using liposome-mediated method into two normal breast and breast cancer cell lines. Following 24 hours of transfection and protein expression, subcellular localization was determined using live cell imaging/confocal microscopy. Protein localizations to each compartment were confirmed by observing the colocalization of GFP- and RFP-tagged proteins that gives yellow color up on merging green and red images.
Figure 2Experimental validation of predicted localization of human proteins. GFP-tagged full-length genes of target proteins and RFP-tagged compartment specific genes were transiently co-expressed in two normal and two breast cancer cell lines. Subcellular localization of the transiently expressed proteins was determined under the confocal microscope (40X). To facilitate the visualization of predicted subcellular localization, the specific RFP-tagged protein marker/dye for each localization was used in colocalization studies. Hoechst (nuclear dye) was used in all experiments. This figure shows a representative observation of colocalization in MCF-7 cells for each of the nine subcellular compartments used for validation in this study.
Experimental validation for ngLOC predicted proteins subcellular localization
| Protein | Prediction | Validation | Protein | Prediction | Validation |
|---|---|---|---|---|---|
| NR2F1 | NUC | Yes | FKBP7 | END | Yes |
| LMO2 | NUC | Yes | ZFAN2B | END | Yes |
| LEF1 | NUC | Yes | USMG5 | MIT | Yes |
| U2AF1L4 | NUC | No | UQCR10 | MIT | Yes |
| KLF7 | NUC | Yes | COX6B1 | MIT | Yes |
| PHF5A | NUC | No | BRP44L | MIT | Yes |
| LMO1 | NUC | Yes | UCP3 | MIT | Yes |
| HMNG4 | NUC | Yes | SFXN1 | MIT | Yes |
| SCNM1 | NUC | Yes | NDUFS8 | MIT | Yes |
| SNRNP27 | NUC | Yes | ATP5S | MIT | Yes |
| SSX3 | NUC | Yes | COX7C | MIT | Yes |
| LMO1 | NUC | Yes | MRPL30 | MIT | Yes |
| PRKRIP1 | NUC | Yes | MRPL15 | MIT | Yes |
| HNRNPCL1 | NUC | Yes | PHB | MIT | Yes |
| MAB21L1 | NUC | Yes | MRPL53 | MIT | No |
| VGLL2 | NUC | Yes | MRPS24 | MIT | Yes |
| AES | NUC | Yes | MRPL10 | MIT | Yes |
| ST13 | CYT | Yes | MRPL2 | MIT | Yes |
| MLST8 | CYT | Yes | COX7B | MIT | Yes |
| GNPDA2 | CYT | Yes | MRPL51 | MIT | No |
| CARD17 | CYT | No | MRP63 | MIT | Yes |
| RAC1 | CYT | No | COQ9 | MIT | Yes |
| FKBP1B | CYT | Yes | LGMN | LYS | Yes |
| SPRR2F | CYT | Yes | CTSL2 | LYS | Yes |
| SPRR2G | CYT | Yes | MMD | LYS | Yes |
| NUD10 | CYT | Yes | RAB7A | LYS | Yes |
| PCTP | CYT | No | ITM2C | LYS | Yes |
| PEBP1 | CYT | No | DECR2 | POX | Yes |
| RPL36AL | CYT | No | ZADH2 | POX | Yes |
| GST5A | CYT | No | PXMP4 | POX | Yes |
| OTUB1 | CYT | Yes | TUBAL3 | CSK | Yes |
| PCMT1 | CYT | No | TMSB15A | CSK | No |
| PGPEP1 | CYT | No | DYNLL2 | CSK | No |
| PGEP1-2 | CYT | No | ACTBL2 | CSK | Yes |
| PMP2 | CYT | No | CAPZB | CSK | Yes |
| PPIAL4A | CYT | No | TMSB4Y | CSK | No |
| UBE2K | CYT | Yes | CAPZA1 | CSK | Yes |
| UXS1 | GOL | Yes | ANKRA2 | CSK | Yes |
| UGCG | GOL | Yes | PDLIM1 | CSK | Yes |
| GKAP1 | GOL | Yes | TRIM54 | CSK | Yes |
| HS2ST1 | GOL | Yes | PNP | CSK | Yes |
| GKAP1-2 | GOL | Yes | CDC42EP5 | CSK | Yes |
| GCNT2 | GOL | Yes | SEP3 | CSK | Yes |
| GABRAPL2 | GOL | No | ACTRT3 | CSK | Yes |
| ZADHHC3 | GOL | Yes | NABP | PLA | Yes |
| ST6SIA1 | GOL | Yes | GNAS | PLA | Yes |
| SACM1L | END | Yes | KCNIP2 | PLA | Yes |
| SEC11A | END | Yes | MOG | PLA | Yes |
| SEC11C | END | Yes | CD8B | PLA | Yes |
| CNPY3 | END | Yes | CACNG4 | PLA | Yes |
| CNPY3-2 | END | Yes | IFITM2 | PLA | No |
| SEC61G | END | Yes | STOML3 | PLA | Yes |
| DGAT2 | END | Yes | RTP1 | PLA | Yes |
| ASPH | END | Yes | ABHD6 | PLA | Yes |
| MEST | END | Yes | RASD2 | PLA | Yes |
| DOLPP1 | END | Yes | TMEM68 | PLA | Yes |
| POFUT1 | END | Yes | RHOV | PLA | Yes |
Figure 3Graph showing the total number of tested and those that are in agreement with the predicted localizations in each subcellular location. This graph was generated based on the data provided in Table 1.
Experimental validation of ngLOC top second predicted proteins subcellular localization
| Protein | First Prediction | Second Prediction | Validation for First prediction | Validation for Second prediction |
|---|---|---|---|---|
| U2AF1L4 | NUC | CYT |
|
|
| CARD17 | CYT | NUC |
|
|
| RAC1 | CYT | PLA | No | No |
| PCTP | CYT | NUC |
|
|
| PEBP1 | CYT | PLA | No | No |
| RPL36AL | CYT | NUC |
|
|
| GST5A | CYT | NUC |
|
|
| PCMT | CYT | NUC |
|
|
| PGPEP1 | CYT | NUC |
|
|
| PMP2 | CYT | NUC |
|
|
| PPIAL4A | CYT | MIT | No | Yes |
| GABRAPL2 | GOL | CSK | No | Yes |
| MRPL51 | MIT | CYT | No | No |
| MRPL53 | MIT | CYT | No | No |
| DYNLL2 | CSK | NUC |
|
|
| TMSB4Y | CSK | CYT |
|
|
| TMSB15A | CSK | NUC |
|
|
| PXMP4 | POX | PLA | Yes | No |
| MMD | LYS | PLA | Yes | No |
| RAB7A | LYS | PLA | Yes | No |
| SEC61G | END | MIT | Yes | Yes |
| DGAT2 | END | PLA | Yes | No |
| DOLPP1 | END | PLA | Yes | Yes |
| SEC11A | END | PLA | Yes | No |
| CNPY3 | END | PLA | Yes | No |
| LMO1 | NUC | MIT | Yes | Yes |
| LMO2 | NUC | PLA | Yes | Yes |
| NUDT10 | CYT | PLA | Yes | No |
| CDC42EP5 | CSK | PLA | Yes | Yes |
| ZDHHC3 | GOL | PLA | Yes | Yes |
The asterisk (*) represents homogenous distribution where localization of protein was seen in both cytoplasm and nucleus.