| Literature DB >> 26416496 |
Abstract
Protein sorting is an important mechanism for transporting proteins to their target subcellular locations after their synthesis. Mutations on genes may disrupt the well regulated protein sorting process, leading to a variety of mislocation related diseases. This paper proposes a methodology to discover such disease genes based on gene expression data and computational protein localization prediction. A kernel logistic regression based algorithm is used to successfully identify several candidate cancer genes which may cause cancers due to their mislocation within the cell. Our results also showed that compared to the gene co-expression network defined on Pearson correlation coefficients, the nonlinear Maximum Correlation Coefficients (MIC) based co-expression network give better results for subcellular localization prediction.Entities:
Keywords: Cancer gene; Disease gene identification; Gene expression; Protein localization
Mesh:
Year: 2015 PMID: 26416496 DOI: 10.1016/j.ymeth.2015.09.022
Source DB: PubMed Journal: Methods ISSN: 1046-2023 Impact factor: 3.608