| Literature DB >> 31416413 |
Christopher M Wilson1, Kaiqiao Li2, Xiaoqing Yu1, Pei-Fen Kuan2, Xuefeng Wang3.
Abstract
BACKGROUND: Advances in medical technology have allowed for customized prognosis, diagnosis, and treatment regimens that utilize multiple heterogeneous data sources. Multiple kernel learning (MKL) is well suited for the integration of multiple high throughput data sources. MKL remains to be under-utilized by genomic researchers partly due to the lack of unified guidelines for its use, and benchmark genomic datasets.Entities:
Keywords: Classification; Data integration; Genomics; Kernel methods; Machine learning; Multiple kernel learning
Mesh:
Substances:
Year: 2019 PMID: 31416413 PMCID: PMC6694479 DOI: 10.1186/s12859-019-2992-1
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Recommended workflow for an MKL experiment
Fig. 2Results from SEMKL, SimpleMKL, and DAMKL on 9 benchmark datasets, where two radial kernels K1 and K2 with σ1=2 and σ2=0.05 were used. a Displays the learned kernel weight of K1 as the mean of each group changes. b Displays the predictive accuracy of each algorithm as the distance between each group changes. DAL Hinge and DAL Logistic refer to conducting DALMKL under different loss functions
Fig. 3Prediction accuracy of MKL implementations using clinical and miRNA data individually and together in a single analysis using 198 patients to train each model and 85 patients to test the corresponding model. DAL Hinge and DAL Logistic refer to conducting DALMKL under different loss functions
Fig. 4Heatmap of gene set importance for each of the 15 cancer types considered. (DALMKL Logistic)