Literature DB >> 22945786

Interaction-based feature selection and classification for high-dimensional biological data.

Haitian Wang1, Shaw-Hwa Lo, Tian Zheng, Inchi Hu.   

Abstract

MOTIVATION: Epistasis or gene-gene interaction has gained increasing attention in studies of complex diseases. Its presence as an ubiquitous component of genetic architecture of common human diseases has been contemplated. However, the detection of gene-gene interaction is difficult due to combinatorial explosion.
RESULTS: We present a novel feature selection method incorporating variable interaction. Three gene expression datasets are analyzed to illustrate our method, although it can also be applied to other types of high-dimensional data. The quality of variables selected is evaluated in two ways: first by classification error rates, then by functional relevance assessed using biological knowledge. We show that the classification error rates can be significantly reduced by considering interactions. Secondly, a sizable portion of genes identified by our method for breast cancer metastasis overlaps with those reported in gene-to-system breast cancer (G2SBC) database as disease associated and some of them have interesting biological implication. In summary, interaction-based methods may lead to substantial gain in biological insights as well as more accurate prediction.

Entities:  

Mesh:

Year:  2012        PMID: 22945786      PMCID: PMC3577111          DOI: 10.1093/bioinformatics/bts531

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  20 in total

1.  Interaction between krit1 and icap1alpha infers perturbation of integrin beta1-mediated angiogenesis in the pathogenesis of cerebral cavernous malformation.

Authors:  J Zhang; R E Clatterbuck; D Rigamonti; D D Chang; H C Dietz
Journal:  Hum Mol Genet       Date:  2001-12-01       Impact factor: 6.150

2.  The mystery of missing heritability: Genetic interactions create phantom heritability.

Authors:  Or Zuk; Eliana Hechter; Shamil R Sunyaev; Eric S Lander
Journal:  Proc Natl Acad Sci U S A       Date:  2012-01-05       Impact factor: 11.205

3.  Structures and Assumptions: Strategies to Harness Gene × Gene and Gene × Environment Interactions in GWAS.

Authors:  Charles Kooperberg; Michael Leblanc; James Y Dai; Indika Rajapakse
Journal:  Stat Sci       Date:  2009       Impact factor: 2.901

4.  Pre-validation and inference in microarrays.

Authors:  Robert J Tibshirani; Brad Efron
Journal:  Stat Appl Genet Mol Biol       Date:  2002-08-22

Review 5.  A review of feature selection techniques in bioinformatics.

Authors:  Yvan Saeys; Iñaki Inza; Pedro Larrañaga
Journal:  Bioinformatics       Date:  2007-08-24       Impact factor: 6.937

6.  Gene expression profiling predicts clinical outcome of breast cancer.

Authors:  Laura J van 't Veer; Hongyue Dai; Marc J van de Vijver; Yudong D He; Augustinus A M Hart; Mao Mao; Hans L Peterse; Karin van der Kooy; Matthew J Marton; Anke T Witteveen; George J Schreiber; Ron M Kerkhoven; Chris Roberts; Peter S Linsley; René Bernards; Stephen H Friend
Journal:  Nature       Date:  2002-01-31       Impact factor: 49.962

7.  Molecular portraits of human breast tumours.

Authors:  C M Perou; T Sørlie; M B Eisen; M van de Rijn; S S Jeffrey; C A Rees; J R Pollack; D T Ross; H Johnsen; L A Akslen; O Fluge; A Pergamenschikov; C Williams; S X Zhu; P E Lønning; A L Børresen-Dale; P O Brown; D Botstein
Journal:  Nature       Date:  2000-08-17       Impact factor: 49.962

8.  Prognostic significance of carbonic anhydrase IX (CA-IX), endoglin (CD105) and 8-hydroxy-2'-deoxyguanosine (8-OHdG) in breast cancer patients.

Authors:  Lidija Beketic-Oreskovic; Petar Ozretic; Zahid N Rabbani; Isabel L Jackson; Bozena Sarcevic; Sonja Levanat; Petra Maric; Ivan Babic; Zeljko Vujaskovic
Journal:  Pathol Oncol Res       Date:  2011-01-20       Impact factor: 3.201

Review 9.  Epistasis and its implications for personal genetics.

Authors:  Jason H Moore; Scott M Williams
Journal:  Am J Hum Genet       Date:  2009-09       Impact factor: 11.025

Review 10.  Detecting gene-gene interactions that underlie human diseases.

Authors:  Heather J Cordell
Journal:  Nat Rev Genet       Date:  2009-06       Impact factor: 53.242

View more
  13 in total

1.  Why significant variables aren't automatically good predictors.

Authors:  Adeline Lo; Herman Chernoff; Tian Zheng; Shaw-Hwa Lo
Journal:  Proc Natl Acad Sci U S A       Date:  2015-10-26       Impact factor: 11.205

2.  Approaches to treatment effect heterogeneity in the presence of confounding.

Authors:  Sarah C Anoke; Sharon-Lise Normand; Corwin M Zigler
Journal:  Stat Med       Date:  2019-03-31       Impact factor: 2.373

3.  Framework for making better predictions by directly estimating variables' predictivity.

Authors:  Adeline Lo; Herman Chernoff; Tian Zheng; Shaw-Hwa Lo
Journal:  Proc Natl Acad Sci U S A       Date:  2016-11-29       Impact factor: 11.205

4.  Stratified polygenic risk prediction model with application to CAGI bipolar disorder sequencing data.

Authors:  Maggie Haitian Wang; Billy Chang; Rui Sun; Inchi Hu; Xiaoxuan Xia; William Ka Kei Wu; Ka Chun Chong; Benny Chung-Ying Zee
Journal:  Hum Mutat       Date:  2017-06-13       Impact factor: 4.878

5.  Biomarker interaction selection and disease detection based on multivariate gain ratio.

Authors:  Xiao Chu; Mao Jiang; Zhuo-Jun Liu
Journal:  BMC Bioinformatics       Date:  2022-05-12       Impact factor: 3.307

6.  A robust model-free approach for rare variants association studies incorporating gene-gene and gene-environmental interactions.

Authors:  Ruixue Fan; Shaw-Hwa Lo
Journal:  PLoS One       Date:  2013-12-17       Impact factor: 3.240

7.  Considering interactive effects in the identification of influential regions with extremely rare variants via fixed bin approach.

Authors:  Michael Agne; Chien-Hsun Huang; Inchi Hu; Haitian Wang; Tian Zheng; Shaw-Hwa Lo
Journal:  BMC Proc       Date:  2014-06-17

8.  Informative gene selection and direct classification of tumor based on Chi-square test of pairwise gene interactions.

Authors:  Hongyan Zhang; Lanzhi Li; Chao Luo; Congwei Sun; Yuan Chen; Zhijun Dai; Zheming Yuan
Journal:  Biomed Res Int       Date:  2014-07-23       Impact factor: 3.411

9.  DWFS: a wrapper feature selection tool based on a parallel genetic algorithm.

Authors:  Othman Soufan; Dimitrios Kleftogiannis; Panos Kalnis; Vladimir B Bajic
Journal:  PLoS One       Date:  2015-02-26       Impact factor: 3.240

10.  A fast and powerful W-test for pairwise epistasis testing.

Authors:  Maggie Haitian Wang; Rui Sun; Junfeng Guo; Haoyi Weng; Jack Lee; Inchi Hu; Pak Chung Sham; Benny Chung-Ying Zee
Journal:  Nucleic Acids Res       Date:  2016-04-25       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.