| Literature DB >> 31564439 |
Huaying Fang1, Qin Hui2, Julie Lynch3, Jacqueline Honerlaw4, Themistocles L Assimes5, Jie Huang6, Marijana Vujkovic7, Scott M Damrauer8, Saiju Pyarajan9, J Michael Gaziano9, Scott L DuVall10, Christopher J O'Donnell6, Kelly Cho9, Kyong-Mi Chang11, Peter W F Wilson12, Philip S Tsao5, Yan V Sun13, Hua Tang14.
Abstract
Large-scale multi-ethnic cohorts offer unprecedented opportunities to elucidate the genetic factors influencing complex traits related to health and disease among minority populations. At the same time, the genetic diversity in these cohorts presents new challenges for analysis and interpretation. We consider the utility of race and/or ethnicity categories in genome-wide association studies (GWASs) of multi-ethnic cohorts. We demonstrate that race/ethnicity information enhances the ability to understand population-specific genetic architecture. To address the practical issue that self-identified racial/ethnic information may be incomplete, we propose a machine learning algorithm that produces a surrogate variable, termed HARE. We use height as a model trait to demonstrate the utility of HARE and ethnicity-specific GWASs.Entities:
Keywords: biobank; ethnicity-specific trait loci; genetic ancestry; multi-ethnic cohort; self-reported race/ethnicity; stratified analysis; trans-ethnic GWAS
Year: 2019 PMID: 31564439 PMCID: PMC6817526 DOI: 10.1016/j.ajhg.2019.08.012
Source DB: PubMed Journal: Am J Hum Genet ISSN: 0002-9297 Impact factor: 11.025