Literature DB >> 30496340

Multilevel heterogeneous omics data integration with kernel fusion.

Haitao Yang1, Hongyan Cao2, Tao He3, Tong Wang2, Yuehua Cui2,4.   

Abstract

High-throughput omics data are generated almost with no limit nowadays. It becomes increasingly important to integrate different omics data types to disentangle the molecular machinery of complex diseases with the hope for better disease prevention and treatment. Since the relationship among different omics data features are typically unknown, a supervised learning model assuming a particular distribution with a specific structure will not serve the purpose to capture the underlying complex relationship between multiple features and a disease phenotype. In this work, we briefly reviewed methods for kernel fusion (KF) based on support vector machine and kernel partial least squares (KPLS) algorithms. We then proposed a fused KPLS (fKPLS) model for disease classification and prediction with multilevel omics data. The fused kernel can deal with effect heterogeneity in which different omic data types may have different effect contribution to the trait of interest, with the purpose to improve the prediction performance. We proposed to optimize the kernel parameters and kernel weights with the genetic algorithm (GA). The proposed GA-fKPLS model can substantially improve disease classification performance by integrating multiple omics data types, demonstrated via extensive simulations and real data analysis. With properly defined fitness functions during GA optimization, the proposed KF method can be extended to other kernel-based analyses such as in kernel association analysis with common or rare variants.

Year:  2018        PMID: 30496340     DOI: 10.1093/bib/bby115

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  3 in total

1.  How Can Gene-Expression Information Improve Prognostic Prediction in TCGA Cancers: An Empirical Comparison Study on Regularization and Mixed Cox Models.

Authors:  Xinghao Yu; Ting Wang; Shuiping Huang; Ping Zeng
Journal:  Front Genet       Date:  2020-08-21       Impact factor: 4.599

2.  Risk Prediction in Patients With Heart Failure With Preserved Ejection Fraction Using Gene Expression Data and Machine Learning.

Authors:  Liye Zhou; Zhifei Guo; Bijue Wang; Yongqing Wu; Zhi Li; Hongmei Yao; Ruiling Fang; Haitao Yang; Hongyan Cao; Yuehua Cui
Journal:  Front Genet       Date:  2021-03-22       Impact factor: 4.599

3.  Editorial: Cross-Domain Analysis for "All of Us" Precision Medicine.

Authors:  Tao Zeng; Tao Huang; Chuan Lu
Journal:  Front Genet       Date:  2021-07-01       Impact factor: 4.599

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.