Literature DB >> 25640961

Using distance covariance for improved variable selection with application to learning genetic risk models.

Jing Kong1, Sijian Wang, Grace Wahba.   

Abstract

Variable selection is of increasing importance to address the difficulties of high dimensionality in many scientific areas. In this paper, we demonstrate a property for distance covariance, which is incorporated in a novel feature screening procedure together with the use of distance correlation. The approach makes no distributional assumptions for the variables and does not require the specification of a regression model and hence is especially attractive in variable selection given an enormous number of candidate attributes without much information about the true model with the response. The method is applied to two genetic risk problems, where issues including uncertainty of variable selection via cross validation, subgroup of hard-to-classify cases, and the application of a reject option are discussed.
Copyright © 2015 John Wiley & Sons, Ltd.

Entities:  

Keywords:  SVM with reject option; TCGA ovarian cancer data; distance correlation; penalized Bernoulli likelihood; variable selection

Mesh:

Substances:

Year:  2015        PMID: 25640961      PMCID: PMC4441212          DOI: 10.1002/sim.6441

Source DB:  PubMed          Journal:  Stat Med        ISSN: 0277-6715            Impact factor:   2.373


  16 in total

1.  The Gs-linked receptor GPR3 maintains meiotic arrest in mammalian oocytes.

Authors:  Lisa M Mehlmann; Yoshinaga Saeki; Shigeru Tanaka; Thomas J Brennan; Alexei V Evsikov; Frank L Pendola; Barbara B Knowles; John J Eppig; Laurinda A Jaffe
Journal:  Science       Date:  2004-12-10       Impact factor: 47.728

2.  Using distance correlation and SS-ANOVA to assess associations of familial relationships, lifestyle factors, diseases, and mortality.

Authors:  Jing Kong; Barbara E K Klein; Ronald Klein; Kristine E Lee; Grace Wahba
Journal:  Proc Natl Acad Sci U S A       Date:  2012-11-21       Impact factor: 11.205

3.  Inhibition of FGFR2 and FGFR1 increases cisplatin sensitivity in ovarian cancer.

Authors:  Claire Cole; Sin Lau; Alison Backen; Andrew Clamp; Graham Rushton; Caroline Dive; Cassandra Hodgkinson; Rhona McVey; Henry Kitchener; Gordon C Jayson
Journal:  Cancer Biol Ther       Date:  2010-09-04       Impact factor: 4.742

4.  On Brownian Distance Covariance and High Dimensional Data.

Authors:  Michael R Kosorok
Journal:  Ann Appl Stat       Date:  2009-01-01       Impact factor: 2.083

5.  Premature ovarian aging in mice deficient for Gpr3.

Authors:  Catherine Ledent; Isabelle Demeestere; David Blum; Julien Petermans; Tuula Hämäläinen; Guillaume Smits; Gilbert Vassart
Journal:  Proc Natl Acad Sci U S A       Date:  2005-06-13       Impact factor: 11.205

6.  Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks.

Authors:  J Khan; J S Wei; M Ringnér; L H Saal; M Ladanyi; F Westermann; F Berthold; M Schwab; C R Antonescu; C Peterson; P S Meltzer
Journal:  Nat Med       Date:  2001-06       Impact factor: 53.440

7.  Prediction of chemotherapeutic response in ovarian cancer with DNA microarray expression profiling.

Authors:  Zachariah E Selvanayagam; Tak Hong Cheung; Nien Wei; Ragini Vittal; Keith Wing Kit Lo; Winnie Yeo; Tsunekazu Kita; Roald Ravatn; Tony Kwok Hung Chung; Yick Fu Wong; Khew-Voon Chin
Journal:  Cancer Genet Cytogenet       Date:  2004-10-01

8.  Feature Screening via Distance Correlation Learning.

Authors:  Runze Li; Wei Zhong; Liping Zhu
Journal:  J Am Stat Assoc       Date:  2012-07-01       Impact factor: 5.033

9.  Genetic variants in TGF-β pathway are associated with ovarian cancer risk.

Authors:  Jikai Yin; Karen Lu; Jie Lin; Lei Wu; Michelle A T Hildebrandt; David W Chang; Larissa Meyer; Xifeng Wu; Dong Liang
Journal:  PLoS One       Date:  2011-09-30       Impact factor: 3.240

10.  Integrated genomic analyses of ovarian carcinoma.

Authors: 
Journal:  Nature       Date:  2011-06-29       Impact factor: 49.962

View more
  4 in total

1.  ASYMPTOTIC DISTRIBUTIONS OF HIGH-DIMENSIONAL DISTANCE CORRELATION INFERENCE.

Authors:  Lan Gao; Yingying Fan; Jinchi Lv; Qi-Man Shao
Journal:  Ann Stat       Date:  2021-09-29       Impact factor: 4.904

2.  An adaptive threshold determination method of feature screening for genomic selection.

Authors:  Guifang Fu; Gang Wang; Xiaotian Dai
Journal:  BMC Bioinformatics       Date:  2017-04-12       Impact factor: 3.169

3.  Detecting PCOS susceptibility loci from genome-wide association studies via iterative trend correlation based feature screening.

Authors:  Xiaotian Dai; Guifang Fu; Randall Reese
Journal:  BMC Bioinformatics       Date:  2020-05-04       Impact factor: 3.169

4.  Association Factor for Identifying Linear and Nonlinear Correlations in Noisy Conditions.

Authors:  Nezamoddin N Kachouie; Wejdan Deebani
Journal:  Entropy (Basel)       Date:  2020-04-13       Impact factor: 2.524

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.