| Literature DB >> 19067340 |
Holger Schwender1, Katja Ickstadt, Jörg Rahnenführer.
Abstract
A major task in the statistical analysis of genetic data such as gene expressions and single nucleotide polymorphisms (SNPs) is to predict whether a patient has a certain disease, or from which of several known subtypes of a disease a patient suffers. A large number of discrimination methods have been proposed in the literature and have been applied to genetic data to tackle this task. In this paper, we give an overview on the most popular of these procedures in the analysis of genetic data. Moreover, we describe how these methods for supervised classification can be combined with variable selection approaches to reduce the number of genetic features from several thousands to as few as possible to form a concise classification rule. Finally, we show how the resulting statistical models can be validated. ((c) 2008 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim).Entities:
Mesh:
Substances:
Year: 2008 PMID: 19067340 DOI: 10.1002/bimj.200810475
Source DB: PubMed Journal: Biom J ISSN: 0323-3847 Impact factor: 2.207