Literature DB >> 22550652

Multiple imputation and random forests (MIRF) for unobservable, high-dimensional data.

Bareng A S Nonyane1, Andrea S Foulkes.   

Abstract

Understanding the genetic underpinnings to complex diseases requires consideration of sophisticated analytical methods designed to uncover intricate associations across multiple predictor variables. At the same time, knowledge of whether single nucleotide polymorphisms within a gene are on the same (in cis) or on different (in trans) chromosomal copies, may provide crucial information about measures of disease progression. In association studies of unrelated individuals, allelic phase is generally unobservable, generating an additional analytical challenge. In this manuscript, we describe a novel approach that combines multiple imputation and random forests for this high-dimensional, unobservable data setting. An application to a cohort of HIV-1 infected individuals receiving anti-retroviral therapies is presented. A simulation study is also presented to characterize method performance.

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 22550652     DOI: 10.2202/1557-4679.1049

Source DB:  PubMed          Journal:  Int J Biostat        ISSN: 1557-4679            Impact factor:   0.968


  2 in total

1.  Identification of genes and haplotypes that predict rheumatoid arthritis using random forests.

Authors:  Rui Tang; Jason P Sinnwell; Jia Li; David N Rider; Mariza de Andrade; Joanna M Biernacka
Journal:  BMC Proc       Date:  2009-12-15

2.  Application of two machine learning algorithms to genetic association studies in the presence of covariates.

Authors:  Bareng A S Nonyane; Andrea S Foulkes
Journal:  BMC Genet       Date:  2008-11-14       Impact factor: 2.797

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.