Literature DB >> 25980368

A novel mixed integer programming for multi-biomarker panel identification by distinguishing malignant from benign colorectal tumors.

Meng Zou1, Peng-Jun Zhang2, Xin-Yu Wen3, Luonan Chen4, Ya-Ping Tian5, Yong Wang6.   

Abstract

Multi-biomarker panels can capture the nonlinear synergy among biomarkers and they are important to aid in the early diagnosis and ultimately battle complex diseases. However, identification of these multi-biomarker panels from case and control data is challenging. For example, the exhaustive search method is computationally infeasible when the data dimension is high. Here, we propose a novel method, MILP_k, to identify serum-based multi-biomarker panel to distinguish colorectal cancers (CRC) from benign colorectal tumors. Specifically, the multi-biomarker panel detection problem is modeled by a mixed integer programming to maximize the classification accuracy. Then we measured the serum profiling data for 101 CRC patients and 95 benign patients. The 61 biomarkers were analyzed individually and further their combinations by our method. We discovered 4 biomarkers as the optimal small multi-biomarker panel, including known CRC biomarkers CEA and IL-10 as well as novel biomarkers IMA and NSE. This multi-biomarker panel obtains leave-one-out cross-validation (LOOCV) accuracy to 0.7857 by nearest centroid classifier. An independent test of this panel by support vector machine (SVM) with threefold cross validation gets an AUC 0.8438. This greatly improves the predictive accuracy by 20% over the single best biomarker. Further extension of this 4-biomarker panel to a larger 13-biomarker panel improves the LOOCV to 0.8673 with independent AUC 0.8437. Comparison with the exhaustive search method shows that our method dramatically reduces the searching time by 1000-fold. Experiments on the early cancer stage samples reveal two panel of biomarkers and show promising accuracy. The proposed method allows us to select the subset of biomarkers with best accuracy to distinguish case and control samples given the number of selected biomarkers. Both receiver operating characteristic curve and precision-recall curve show our method's consistent performance gain in accuracy. Our method also shows its advantage in capturing synergy among selected biomarkers. The multi-biomarker panel far outperforms the simple combination of best single features. Close investigation of the multi-biomarker panel illustrates that our method possesses the ability to remove redundancy and reveals complementary biomarker combinations. In addition, our method is efficient and can select multi-biomarker panel with more than 5 biomarkers, for which the exhaustive methods fail. In conclusion, we propose a promising model to improve the clinical data interpretability and to serve as a useful tool for other complex disease studies. Our small multi-biomarker panel, CEA, IL-10, IMA, and NSE, may provide insights on the disease status of colorectal diseases. The implementation of our method in MATLAB is available via the website: http://doc.aporc.org/wiki/MILP_k.
Copyright © 2015 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Colorectal cancer; Mixed integer programming; Multi-biomarker; Translational bioinformatics

Mesh:

Substances:

Year:  2015        PMID: 25980368     DOI: 10.1016/j.ymeth.2015.05.011

Source DB:  PubMed          Journal:  Methods        ISSN: 1046-2023            Impact factor:   3.608


  7 in total

1.  ColoFinder: a prognostic 9-gene signature improves prognosis for 871 stage II and III colorectal cancer patients.

Authors:  Mingguang Shi; Jianmin He
Journal:  PeerJ       Date:  2016-03-14       Impact factor: 2.984

2.  Metabolomics and metabolic pathway networks from human colorectal cancers, adjacent mucosa, and stool.

Authors:  Dustin G Brown; Sangeeta Rao; Tiffany L Weir; Joanne O'Malia; Marlon Bazan; Regina J Brown; Elizabeth P Ryan
Journal:  Cancer Metab       Date:  2016-06-06

3.  An approach to forecast human cancer by profiling microRNA expressions from NGS data.

Authors:  A Salim; R Amjesh; S S Vinod Chandra
Journal:  BMC Cancer       Date:  2017-01-25       Impact factor: 4.430

4.  Prognostic Value of Enolase Gene Family in Colon Cancer.

Authors:  Xiaohang Pan; Huawen Wu; Guofu Chen; Wenhuan Li
Journal:  Med Sci Monit       Date:  2020-07-24

5.  Omics Integration Analysis Unravel the Landscape of Driving Mechanisms of Colorectal Cancer.

Authors:  Fatemeh Nikmanesh; Shamim Sarhadi; Mehdi Dadashpour; Yazdan Asghari; Nosratollah Zarghami
Journal:  Asian Pac J Cancer Prev       Date:  2020-12-01

6.  Analysis of Half a Billion Datapoints Across Ten Machine-Learning Algorithms Identifies Key Elements Associated With Insulin Transcription in Human Pancreatic Islet Cells.

Authors:  Wilson K M Wong; Vinod Thorat; Mugdha V Joglekar; Charlotte X Dong; Hugo Lee; Yi Vee Chew; Adwait Bhave; Wayne J Hawthorne; Feyza Engin; Aniruddha Pant; Louise T Dalgaard; Sharda Bapat; Anandwardhan A Hardikar
Journal:  Front Endocrinol (Lausanne)       Date:  2022-03-23       Impact factor: 6.055

7.  Identifying Critical States of Complex Diseases by Single-Sample Jensen-Shannon Divergence.

Authors:  Jinling Yan; Peiluan Li; Rong Gao; Ying Li; Luonan Chen
Journal:  Front Oncol       Date:  2021-06-04       Impact factor: 6.244

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.