| Literature DB >> 26912414 |
Li Zhao1, Yiyun Chen2, Amol Onkar Bajaj3, Aiden Eblimit2, Mingchu Xu2, Zachry T Soens2, Feng Wang2, Zhongqi Ge2, Sung Yun Jung3, Feng He3, Yumei Li2, Theodore G Wensel4, Jun Qin4, Rui Chen5.
Abstract
Proteomic profiling on subcellular fractions provides invaluable information regarding both protein abundance and subcellular localization. When integrated with other data sets, it can greatly enhance our ability to predict gene function genome-wide. In this study, we performed a comprehensive proteomic analysis on the light-sensing compartment of photoreceptors called the outer segment (OS). By comparing with the protein profile obtained from the retina tissue depleted of OS, an enrichment score for each protein is calculated to quantify protein subcellular localization, and 84% accuracy is achieved compared with experimental data. By integrating the protein OS enrichment score, the protein abundance, and the retina transcriptome, the probability of a gene playing an essential function in photoreceptor cells is derived with high specificity and sensitivity. As a result, a list of genes that will likely result in human retinal disease when mutated was identified and validated by previous literature and/or animal model studies. Therefore, this new methodology demonstrates the synergy of combining subcellular fractionation proteomics with other omics data sets and is generally applicable to other tissues and diseases.Entities:
Mesh:
Substances:
Year: 2016 PMID: 26912414 PMCID: PMC4864458 DOI: 10.1101/gr.198911.115
Source DB: PubMed Journal: Genome Res ISSN: 1088-9051 Impact factor: 9.043
Figure 1.High-quality proteomic data of the OS and the RR were obtained. (A) Schematic diagram of the structure of a rod photoreceptor cell and a cone photoreceptor cell in mouse retina. (B, left) Immunofluorescence of isolated OS preparation stained with antibodies of RHO (green) and IFT20 (red); (right) microscopic analysis of an isolated rod OS. (C) OS protein complex (left) and RR protein complex (right) were electrophoresed. The sizes of the molecular weight markers are indicated in M. (D) Scatter plot of normalized protein abundance for OS proteins between different replicates. The average Pearson correlation between all replicates is 0.85. (E) Scatter plot of normalized protein abundance for RR proteins between different replicates. The Pearson correlation between two replicates is 0.75.
Figure 2.Novel OS proteins were identified. (A) Venn diagram of OS proteome identified in previous studies and this study. (B) Venn diagram of known retinal disease genes identified in different studies. All proteins were mapped to human homologs. (C) Distribution of log2 (Spectral Counts) for OS proteins that overlapped with previous OS studies and new proteins identified in this study.
Figure 3.Protein OS enrichment score is predictive for protein localization. (A) Venn diagram of proteins identified in the OS and RR. (B) OS enrichment score distribution of all proteins identified in the OS and RR. (C) Functional gene ontology analysis of OS-enriched and RR-enriched proteins. (D) Pie chart showing 84% of the protein localizations predicted by the enrichment score are consistent with literature, and 16% are inconsistent.
Figure 4.Integrative method was applied to predict novel retinal disease genes. (A) The enrichment score distribution of all retinal disease and background proteins identified in the OS and RR proteomes. (B) The attribute importance, measured by ReliefF score, of all attributes used during machine learning. (C) The prediction score distributions for background genes, known retinal disease genes, and candidate disease genes. (D) The average prediction scores of background genes, known retinal disease genes, and candidate disease genes. All other genes identified in the OS and RR proteome are used as background.
The sensitivity, specificity, and AUC (area under curve) of logistic regression and naïve Bayes methods for retinal disease gene prediction
Top 50 genes with the highest scores of retinal disease gene prediction