Literature DB >> 35706767

A statistical methodology to select covariates in high-dimensional data under dependence. Application to the classification of genetic profiles in oncology.

B Bastien1, T Boukhobza2, H Dumond2, A Gégout-Petit3, A Muller-Gueudin3, C Thiébaut2.   

Abstract

We propose a new methodology for selecting and ranking covariates associated with a variable of interest in a context of high-dimensional data under dependence but few observations. The methodology successively intertwines the clustering of covariates, decorrelation of covariates using Factor Latent Analysis, selection using aggregation of adapted methods and finally ranking. A simulation study shows the interest of the decorrelation inside the different clusters of covariates. We first apply our method to transcriptomic data of 37 patients with advanced non-small-cell lung cancer who have received chemotherapy, to select the transcriptomic covariates that explain the survival outcome of the treatment. Secondly, we apply our method to 79 breast tumor samples to define patient profiles for a new metastatic biomarker and associated gene network in order to personalize the treatments.
© 2020 Informa UK Limited, trading as Taylor & Francis Group.

Entities:  

Keywords:  Aggregated methods; correlated covariates selection; genetic profiles; high dimension; multiple testing procedures; personalized medicine; ranking; variable selection

Year:  2020        PMID: 35706767      PMCID: PMC9041748          DOI: 10.1080/02664763.2020.1837083

Source DB:  PubMed          Journal:  J Appl Stat        ISSN: 0266-4763            Impact factor:   1.416


  10 in total

Review 1.  Micro-RNAs and breast cancer.

Authors:  John Le Quesne; Carlos Caldas
Journal:  Mol Oncol       Date:  2010-04-28       Impact factor: 6.603

2.  Statistical significance for genomewide studies.

Authors:  John D Storey; Robert Tibshirani
Journal:  Proc Natl Acad Sci U S A       Date:  2003-07-25       Impact factor: 11.205

3.  RankGene: identification of diagnostic genes based on expression data.

Authors:  Yang Su; T M Murali; Vladimir Pavlovic; Michael Schaffer; Simon Kasif
Journal:  Bioinformatics       Date:  2003-08-12       Impact factor: 6.937

4.  Identification, cloning, and expression of human estrogen receptor-alpha36, a novel variant of human estrogen receptor-alpha66.

Authors:  Zhaoyi Wang; Xintian Zhang; Peng Shen; Brian W Loggie; Yunchao Chang; Thomas F Deuel
Journal:  Biochem Biophys Res Commun       Date:  2005-11-04       Impact factor: 3.575

5.  Multiple testing. Part I. Single-step procedures for control of general type I error rates.

Authors:  Sandrine Dudoit; Mark J van der Laan; Katherine S Pollard
Journal:  Stat Appl Genet Mol Biol       Date:  2004-06-09

6.  Novel multivariate methods for integration of genomics and proteomics data: applications in a kidney transplant rejection study.

Authors:  Oliver P Günther; Heesun Shin; Raymond T Ng; W Robert McMaster; Bruce M McManus; Paul A Keown; Scott J Tebbutt; Kim-Anh Lê Cao
Journal:  OMICS       Date:  2014-11

7.  Sparse PLS discriminant analysis: biologically relevant feature selection and graphical displays for multiclass problems.

Authors:  Kim-Anh Lê Cao; Simon Boitard; Philippe Besse
Journal:  BMC Bioinformatics       Date:  2011-06-22       Impact factor: 3.169

8.  From ERα66 to ERα36: a generic method for validating a prognosis marker of breast tumor progression.

Authors:  Clémence Chamard-Jovenin; Alain C Jung; Amand Chesnel; Joseph Abecassis; Stéphane Flament; Sonia Ledrappier; Christine Macabre; Taha Boukhobza; Hélène Dumond
Journal:  BMC Syst Biol       Date:  2015-06-17

9.  Transcriptional hallmarks of cancer cell lines reveal an emerging role of branched chain amino acid catabolism.

Authors:  Ieva Antanavičiūtė; Valeryia Mikalayeva; Ieva Ceslevičienė; Gintarė Milašiūtė; Vytenis Arvydas Skeberdis; Sergio Bordel
Journal:  Sci Rep       Date:  2017-08-10       Impact factor: 4.379

10.  Determination of the differentially expressed genes in microarray experiments using local FDR.

Authors:  J Aubert; A Bar-Hen; J J Daudin; S Robin
Journal:  BMC Bioinformatics       Date:  2004-09-06       Impact factor: 3.169

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.