Literature DB >> 15177665

A pilot study on the application of statistical classification procedures to molecular epidemiological data.

Holger Schwender1, Manuela Zucknick, Katja Ickstadt, Hermann M Bolt.   

Abstract

The development of new statistical methods for use in molecular epidemiology comprises the building and application of appropriate classification rules. The aim of this study was to assess various classification methods that can potentially handle genetic interactions. A data set comprising genotypes at 25 single nucleotide polymorphic (SNP) loci from 518 breast cancer cases and 586 age-matched population-based controls from the GENICA study was used to built a classification rule with the discrimination methods SVM (support vector machine), CART (classification and regression tree), Bagging, Random Forest, LogitBoost and k nearest neighbours (kNN). A blind pilot analysis of the genotypic data set was a first approach to obtain an impression of the statistical structure of the data. Furthermore, this analysis was performed to explore classification methods that may be applied to molecular-epidemiological evaluation. The results showed that all blindly applied classification methods had a slightly smaller misclassification rate than a random classification. The findings, nevertheless, suggest that SNP data might be useful for the classification of individuals into categories of high or low risk of diseases.

Entities:  

Mesh:

Year:  2004        PMID: 15177665     DOI: 10.1016/j.toxlet.2004.02.021

Source DB:  PubMed          Journal:  Toxicol Lett        ISSN: 0378-4274            Impact factor:   4.372


  17 in total

1.  Machine learning for detecting gene-gene interactions: a review.

Authors:  Brett A McKinney; David M Reif; Marylyn D Ritchie; Jason H Moore
Journal:  Appl Bioinformatics       Date:  2006

2.  Patient-centered yes/no prognosis using learning machines.

Authors:  I R König; J D Malley; S Pajevic; C Weimar; H-C Diener; A Ziegler
Journal:  Int J Data Min Bioinform       Date:  2008       Impact factor: 0.667

3.  A support vector machine approach to assess drug efficacy of interferon-alpha and ribavirin combination therapy.

Authors:  Eugene Lin; Yuchi Hwang
Journal:  Mol Diagn Ther       Date:  2008       Impact factor: 4.074

4.  Exposure assessment models for elemental components of particulate matter in an urban environment: A comparison of regression and random forest approaches.

Authors:  Cole Brokamp; Roman Jandarov; M B Rao; Grace LeMasters; Patrick Ryan
Journal:  Atmos Environ (1994)       Date:  2016-12-01       Impact factor: 4.798

Review 5.  Applications of Support Vector Machine (SVM) Learning in Cancer Genomics.

Authors:  Shujun Huang; Nianguang Cai; Pedro Penzuti Pacheco; Shavira Narrandes; Yang Wang; Wayne Xu
Journal:  Cancer Genomics Proteomics       Date:  2018 Jan-Feb       Impact factor: 4.069

6.  A multicenter study of diet quality on birth weight and gestational age in infants of HIV-infected women.

Authors:  Tracie L Miller; Denise L Jacobson; Gabriel Somarriba; Daniela Neri; Joy Kurtz-Vraney; Patricia Graham; Matthew W Gillman; David C Landy; Suzanne Siminski; Laurie Butler; Kenneth C Rich; Kristy Hendricks; David A Ludwig
Journal:  Matern Child Nutr       Date:  2016-11-08       Impact factor: 3.092

7.  Identification of type 2 diabetes-associated combination of SNPs using support vector machine.

Authors:  Hyo-Jeong Ban; Jee Yeon Heo; Kyung-Soo Oh; Keun-Joon Park
Journal:  BMC Genet       Date:  2010-04-23       Impact factor: 2.797

8.  Gene selection and classification of microarray data using random forest.

Authors:  Ramón Díaz-Uriarte; Sara Alvarez de Andrés
Journal:  BMC Bioinformatics       Date:  2006-01-06       Impact factor: 3.169

9.  Screening large-scale association study data: exploiting interactions using random forests.

Authors:  Kathryn L Lunetta; L Brooke Hayward; Jonathan Segal; Paul Van Eerdewegh
Journal:  BMC Genet       Date:  2004-12-10       Impact factor: 2.797

10.  A novel similarity-measure for the analysis of genetic data in complex phenotypes.

Authors:  Vincenzo Lagani; Alberto Montesanto; Fausta Di Cianni; Victor Moreno; Stefano Landi; Domenico Conforti; Giuseppina Rose; Giuseppe Passarino
Journal:  BMC Bioinformatics       Date:  2009-06-16       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.