Literature DB >> 19067340

Classification with high-dimensional genetic data: assigning patients and genetic features to known classes.

Holger Schwender1, Katja Ickstadt, Jörg Rahnenführer.   

Abstract

A major task in the statistical analysis of genetic data such as gene expressions and single nucleotide polymorphisms (SNPs) is to predict whether a patient has a certain disease, or from which of several known subtypes of a disease a patient suffers. A large number of discrimination methods have been proposed in the literature and have been applied to genetic data to tackle this task. In this paper, we give an overview on the most popular of these procedures in the analysis of genetic data. Moreover, we describe how these methods for supervised classification can be combined with variable selection approaches to reduce the number of genetic features from several thousands to as few as possible to form a concise classification rule. Finally, we show how the resulting statistical models can be validated. ((c) 2008 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim).

Entities:  

Mesh:

Substances:

Year:  2008        PMID: 19067340     DOI: 10.1002/bimj.200810475

Source DB:  PubMed          Journal:  Biom J        ISSN: 0323-3847            Impact factor:   2.207


  8 in total

1.  Single nucleotide polymorphisms predict symptom severity of autism spectrum disorder.

Authors:  Yun Jiao; Rong Chen; Xiaoyan Ke; Lu Cheng; Kangkang Chu; Zuhong Lu; Edward H Herskovits
Journal:  J Autism Dev Disord       Date:  2012-06

2.  Shrunken methodology to genome-wide SNPs selection and construction of SNPs networks.

Authors:  Yang Liu; Michael Ng
Journal:  BMC Syst Biol       Date:  2010-09-13

3.  Comparative genome analysis of a large Dutch Legionella pneumophila strain collection identifies five markers highly correlated with clinical strains.

Authors:  Ed Yzerman; Jeroen W den Boer; Martien Caspers; Arpit Almal; Bill Worzel; Walter van der Meer; Roy Montijn; Frank Schuren
Journal:  BMC Genomics       Date:  2010-07-15       Impact factor: 3.969

4.  Latent variable modeling paradigms for genotype-trait association studies.

Authors:  Yan Liu; Andrea S Foulkes
Journal:  Biom J       Date:  2011-09       Impact factor: 2.207

5.  A Robust Supervised Variable Selection for Noisy High-Dimensional Data.

Authors:  Jan Kalina; Anna Schlenker
Journal:  Biomed Res Int       Date:  2015-06-02       Impact factor: 3.411

6.  Refining developmental coordination disorder subtyping with multivariate statistical methods.

Authors:  Christophe Lalanne; Bruno Falissard; Bernard Golse; Laurence Vaivre-Douret
Journal:  BMC Med Res Methodol       Date:  2012-07-26       Impact factor: 4.615

7.  Genome analysis of Legionella pneumophila strains using a mixed-genome microarray.

Authors:  Sjoerd M Euser; Nico J Nagelkerke; Frank Schuren; Ruud Jansen; Jeroen W Den Boer
Journal:  PLoS One       Date:  2012-10-18       Impact factor: 3.240

8.  Genomic analyses of African Trypanozoon strains to assess evolutionary relationships and identify markers for strain identification.

Authors:  Joshua Brian Richardson; Kuang-Yao Lee; Paul Mireji; John Enyaru; Mark Sistrom; Serap Aksoy; Hongyu Zhao; Adalgisa Caccone
Journal:  PLoS Negl Trop Dis       Date:  2017-09-29
  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.