Literature DB >> 30932845

Deep Collaborative Filtering for Prediction of Disease Genes.

Xiangxiang Zeng, Yinglai Lin, Yuying He, Linyuan Lu, Xiaoping Min, Alfonso Rodriguez-Paton.   

Abstract

Accurate prioritization of potential disease genes is a fundamental challenge in biomedical research. Various algorithms have been developed to solve such problems. Inductive Matrix Completion (IMC) is one of the most reliable models for its well-established framework and its superior performance in predicting gene-disease associations. However, the IMC method does not hierarchically extract deep features, which might limit the quality of recovery. In this case, the architecture of deep learning, which obtains high-level representations and handles noises and outliers presented in large-scale biological datasets, is introduced into the side information of genes in our Deep Collaborative Filtering (DCF) model. Further, for lack of negative examples, we also exploit Positive-Unlabeled (PU) learning formulation to low-rank matrix completion. Our approach achieves substantially improved performance over other state-of-the-art methods on diseases from the Online Mendelian Inheritance in Man (OMIM) database. Our approach is 10 percent more efficient than standard IMC in detecting a true association, and significantly outperforms other alternatives in terms of the precision-recall metric at the top-k predictions. Moreover, we also validate the disease with no previously known gene associations and newly reported OMIM associations. The experimental results show that DCF is still satisfactory for ranking novel disease phenotypes as well as mining unexplored relationships. The source code and the data are available at https://github.com/xzenglab/DCF.

Entities:  

Mesh:

Year:  2019        PMID: 30932845     DOI: 10.1109/TCBB.2019.2907536

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  8 in total

1.  HSM6AP: a high-precision predictor for the Homo sapiens N6-methyladenosine (m^6 A) based on multiple weights and feature stitching.

Authors:  Jing Li; Shida He; Fei Guo; Quan Zou
Journal:  RNA Biol       Date:  2021-02-12       Impact factor: 4.652

2.  A Method for Prediction of Thermophilic Protein Based on Reduced Amino Acids and Mixed Features.

Authors:  Changli Feng; Zhaogui Ma; Deyun Yang; Xin Li; Jun Zhang; Yanjuan Li
Journal:  Front Bioeng Biotechnol       Date:  2020-05-05

3.  RNA-Associated Co-expression Network Identifies Novel Biomarkers for Digestive System Cancer.

Authors:  Zheng Chen; Zijie Shen; Zilong Zhang; Da Zhao; Lei Xu; Lijun Zhang
Journal:  Front Genet       Date:  2021-03-26       Impact factor: 4.599

4.  iAIPs: Identifying Anti-Inflammatory Peptides Using Random Forest.

Authors:  Dongxu Zhao; Zhixia Teng; Yanjuan Li; Dong Chen
Journal:  Front Genet       Date:  2021-11-30       Impact factor: 4.599

5.  Accurate identification of RNA D modification using multiple features.

Authors:  Lijun Dou; Wenyang Zhou; Lichao Zhang; Lei Xu; Ke Han
Journal:  RNA Biol       Date:  2021-03-17       Impact factor: 4.652

6.  Stable DNA Sequence Over Close-Ending and Pairing Sequences Constraint.

Authors:  Xue Li; Ziqi Wei; Bin Wang; Tao Song
Journal:  Front Genet       Date:  2021-05-17       Impact factor: 4.599

7.  sgRNA-PSM: Predict sgRNAs On-Target Activity Based on Position-Specific Mismatch.

Authors:  Bin Liu; Zhihua Luo; Juan He
Journal:  Mol Ther Nucleic Acids       Date:  2020-01-31       Impact factor: 8.886

8.  Predicting ATP-Binding Cassette Transporters Using the Random Forest Method.

Authors:  Ruiyan Hou; Lida Wang; Yi-Jun Wu
Journal:  Front Genet       Date:  2020-03-25       Impact factor: 4.599

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.