| Literature DB >> 29431517 |
Anna Okula Basile1, Marylyn DeRiggi Ritchie1,2.
Abstract
INTRODUCTION: For the past decade, the focus of complex disease research has been the genotype. From technological advancements to the development of analysis methods, great progress has been made. However, advances in our definition of the phenotype have remained stagnant. Phenotype characterization has recently emerged as an exciting area of informatics and machine learning. The copious amounts of diverse biomedical data that have been collected may be leveraged with data-driven approaches to elucidate trait-related features and patterns. Areas covered: In this review, the authors discuss the phenotype in traditional genetic associations and the challenges this has imposed.Approaches for phenotype refinement that can aid in more accurate characterization of traits are also discussed. Further, the authors highlight promising machine learning approaches for establishing a phenotype and the challenges of electronic health record (EHR)-derived data. Expert commentary: The authors hypothesize that through unsupervised machine learning, data-driven approaches can be used to define phenotypes rather than relying on expert clinician knowledge. Through the use of machine learning and an unbiased set of features extracted from clinical repositories, researchers will have the potential to further understand complex traits and identify patient subgroups. This knowledge may lead to more preventative and precise clinical care.Entities:
Keywords: Cluster analysis; complex traits; dimensionality reduction; electronic health records (EHRs); heterogeneity; machine learning; missing data; phenotype; topological analysis; unsupervised analysis
Mesh:
Year: 2018 PMID: 29431517 PMCID: PMC6080627 DOI: 10.1080/14737159.2018.1439380
Source DB: PubMed Journal: Expert Rev Mol Diagn ISSN: 1473-7159 Impact factor: 5.225