Literature DB >> 27153719

MetaKTSP: a meta-analytic top scoring pair method for robust cross-study validation of omics prediction analysis.

SungHwan Kim1, Chien-Wei Lin2, George C Tseng3.   

Abstract

MOTIVATION: Supervised machine learning is widely applied to transcriptomic data to predict disease diagnosis, prognosis or survival. Robust and interpretable classifiers with high accuracy are usually favored for their clinical and translational potential. The top scoring pair (TSP) algorithm is an example that applies a simple rank-based algorithm to identify rank-altered gene pairs for classifier construction. Although many classification methods perform well in cross-validation of single expression profile, the performance usually greatly reduces in cross-study validation (i.e. the prediction model is established in the training study and applied to an independent test study) for all machine learning methods, including TSP. The failure of cross-study validation has largely diminished the potential translational and clinical values of the models. The purpose of this article is to develop a meta-analytic top scoring pair (MetaKTSP) framework that combines multiple transcriptomic studies and generates a robust prediction model applicable to independent test studies.
RESULTS: We proposed two frameworks, by averaging TSP scores or by combining P-values from individual studies, to select the top gene pairs for model construction. We applied the proposed methods in simulated data sets and three large-scale real applications in breast cancer, idiopathic pulmonary fibrosis and pan-cancer methylation. The result showed superior performance of cross-study validation accuracy and biomarker selection for the new meta-analytic framework. In conclusion, combining multiple omics data sets in the public domain increases robustness and accuracy of the classification model that will ultimately improve disease understanding and clinical treatment decisions to benefit patients.
AVAILABILITY AND IMPLEMENTATION: An R package MetaKTSP is available online. (http://tsenglab.biostat.pitt.edu/software.htm). CONTACT: ctseng@pitt.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Year:  2016        PMID: 27153719      PMCID: PMC6280887          DOI: 10.1093/bioinformatics/btw115

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  46 in total

1.  Limits of predictive models using microarray data for breast cancer clinical treatment outcome.

Authors:  James F Reid; Lara Lusa; Loris De Cecco; Danila Coradini; Silvia Veneroni; Maria Grazia Daidone; Manuela Gariboldi; Marco A Pierotti
Journal:  J Natl Cancer Inst       Date:  2005-06-15       Impact factor: 13.506

2.  Cross-study validation and combined analysis of gene expression microarray data.

Authors:  Elizabeth Garrett-Mayer; Giovanni Parmigiani; Xiaogang Zhong; Leslie Cope; Edward Gabrielson
Journal:  Biostatistics       Date:  2007-09-14       Impact factor: 5.899

3.  Multiclass cancer diagnosis using tumor gene expression signatures.

Authors:  S Ramaswamy; P Tamayo; R Rifkin; S Mukherjee; C H Yeang; M Angelo; C Ladd; M Reich; E Latulippe; J P Mesirov; T Poggio; W Gerald; M Loda; E S Lander; T R Golub
Journal:  Proc Natl Acad Sci U S A       Date:  2001-12-11       Impact factor: 11.205

4.  Development of omics-based clinical tests for prognosis and therapy selection: the challenge of achieving statistical robustness and clinical utility.

Authors:  Lisa M McShane; Mei-Yin C Polley
Journal:  Clin Trials       Date:  2013-09-02       Impact factor: 2.486

5.  Genome-wide methylation analysis identifies epigenetically inactivated candidate tumour suppressor genes in renal cell carcinoma.

Authors:  M R Morris; C J Ricketts; D Gentle; F McRonald; N Carli; H Khalili; M Brown; T Kishida; M Yao; R E Banks; N Clarke; F Latif; E R Maher
Journal:  Oncogene       Date:  2010-12-06       Impact factor: 9.867

6.  Simple decision rules for classifying human cancers from gene expression profiles.

Authors:  Aik Choon Tan; Daniel Q Naiman; Lei Xu; Raimond L Winslow; Donald Geman
Journal:  Bioinformatics       Date:  2005-08-16       Impact factor: 6.937

7.  Supervised risk predictor of breast cancer based on intrinsic subtypes.

Authors:  Joel S Parker; Michael Mullins; Maggie C U Cheang; Samuel Leung; David Voduc; Tammi Vickery; Sherri Davies; Christiane Fauron; Xiaping He; Zhiyuan Hu; John F Quackenbush; Inge J Stijleman; Juan Palazzo; J S Marron; Andrew B Nobel; Elaine Mardis; Torsten O Nielsen; Matthew J Ellis; Charles M Perou; Philip S Bernard
Journal:  J Clin Oncol       Date:  2009-02-09       Impact factor: 44.544

8.  Highly accurate two-gene classifier for differentiating gastrointestinal stromal tumors and leiomyosarcomas.

Authors:  Nathan D Price; Jonathan Trent; Adel K El-Naggar; David Cogdell; Ellen Taylor; Kelly K Hunt; Raphael E Pollock; Leroy Hood; Ilya Shmulevich; Wei Zhang
Journal:  Proc Natl Acad Sci U S A       Date:  2007-02-21       Impact factor: 11.205

9.  Breast cancer index identifies early-stage estrogen receptor-positive breast cancer patients at risk for early- and late-distant recurrence.

Authors:  Yi Zhang; Catherine A Schnabel; Brock E Schroeder; Piiha-Lotta Jerevall; Rachel C Jankowitz; Tommy Fornander; Olle Stål; Adam M Brufsky; Dennis Sgroi; Mark G Erlander
Journal:  Clin Cancer Res       Date:  2013-06-11       Impact factor: 12.531

10.  CMA: a comprehensive Bioconductor package for supervised classification with high dimensional data.

Authors:  M Slawski; M Daumer; A-L Boulesteix
Journal:  BMC Bioinformatics       Date:  2008-10-16       Impact factor: 3.169

View more
  16 in total

1.  Meta-analytic principal component analysis in integrative omics application.

Authors:  SungHwan Kim; Dongwan Kang; Zhiguang Huo; Yongseok Park; George C Tseng
Journal:  Bioinformatics       Date:  2018-04-15       Impact factor: 6.937

2.  Prediction of prognosis and immunotherapy response with a robust immune-related lncRNA pair signature in lung adenocarcinoma.

Authors:  Kui Cao; Mingdong Liu; Keru Ma; Xiangyu Jiang; Jianqun Ma; Jinhong Zhu
Journal:  Cancer Immunol Immunother       Date:  2021-10-15       Impact factor: 6.968

3.  Development and Validation of an Individualized Immune Prognostic Signature in Early-Stage Nonsquamous Non-Small Cell Lung Cancer.

Authors:  Bailiang Li; Yi Cui; Maximilian Diehn; Ruijiang Li
Journal:  JAMA Oncol       Date:  2017-11-01       Impact factor: 31.777

4.  High-dimensional integrative copula discriminant analysis for multiomics data.

Authors:  Yong He; Hao Chen; Hao Sun; Jiadong Ji; Yufeng Shi; Xinsheng Zhang; Lei Liu
Journal:  Stat Med       Date:  2020-10-15       Impact factor: 2.373

5.  Systematic assessment of cervical cancer initiation and progression uncovers genetic panels for deep learning-based early diagnosis and proposes novel diagnostic and prognostic biomarkers.

Authors:  Nguyen Phuoc Long; Kyung Hee Jung; Sang Jun Yoon; Nguyen Hoang Anh; Tran Diem Nghi; Yun Pyo Kang; Hong Hua Yan; Jung Eun Min; Soon-Sun Hong; Sung Won Kwon
Journal:  Oncotarget       Date:  2017-11-25

6.  MINT: a multivariate integrative method to identify reproducible molecular signatures across independent experiments and platforms.

Authors:  Florian Rohart; Aida Eslami; Nicholas Matigian; Stéphanie Bougeard; Kim-Anh Lê Cao
Journal:  BMC Bioinformatics       Date:  2017-02-27       Impact factor: 3.169

7.  Meta-analytic support vector machine for integrating multiple omics data.

Authors:  SungHwan Kim; Jae-Hwan Jhong; JungJun Lee; Ja-Yong Koo
Journal:  BioData Min       Date:  2017-01-26       Impact factor: 2.522

8.  Biomarker Categorization in Transcriptomic Meta-Analysis by Concordant Patterns With Application to Pan-Cancer Studies.

Authors:  Zhenyao Ye; Hongjie Ke; Shuo Chen; Raul Cruz-Cano; Xin He; Jing Zhang; Joanne Dorgan; Donald K Milton; Tianzhou Ma
Journal:  Front Genet       Date:  2021-07-02       Impact factor: 4.599

9.  Biomarkers for classification and class prediction of stress in a murine model of chronic subordination stress.

Authors:  Dominik Langgartner; Andrea M Füchsl; Lisa M Kaiser; Tatjana Meier; Sandra Foertsch; Christian Buske; Stefan O Reber; Medhanie A Mulaw
Journal:  PLoS One       Date:  2018-09-05       Impact factor: 3.240

10.  Development and Validation of an RNA-Seq-Based Prognostic Signature in Neuroblastoma.

Authors:  Jian-Guo Zhou; Bo Liang; Su-Han Jin; Hui-Ling Liao; Guo-Bo Du; Long Cheng; Hu Ma; Udo S Gaipl
Journal:  Front Oncol       Date:  2019-12-04       Impact factor: 6.244

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.