| Literature DB >> 22546560 |
Abstract
Random forests (RF) is a popular tree-based ensemble machine learning tool that is highly data adaptive, applies to "large p, small n" problems, and is able to account for correlation as well as interactions among features. This makes RF particularly appealing for high-dimensional genomic data analysis. In this article, we systematically review the applications and recent progresses of RF for genomic data, including prediction and classification, variable selection, pathway analysis, genetic association and epistasis detection, and unsupervised learning.Entities:
Mesh:
Year: 2012 PMID: 22546560 PMCID: PMC3387489 DOI: 10.1016/j.ygeno.2012.04.003
Source DB: PubMed Journal: Genomics ISSN: 0888-7543 Impact factor: 5.736