| Literature DB >> 30275897 |
Arthur Porto1,2, Juan M Peralta1, Nicholas B Blackburn1,3, John Blangero1.
Abstract
Genome-wide association studies have helped us identify a wealth of genetic variants associated with complex human phenotypes. Because most variants explain a small portion of the total phenotypic variation, however, marker-based studies remain limited in their ability to predict such phenotypes. Here, we show how modern statistical genetic techniques borrowed from animal breeding can be employed to increase the accuracy of genomic prediction of complex phenotypes and the power of genetic mapping studies. Specifically, using the triglyceride data of the GAW20 data set, we apply genomic-best linear unbiased prediction (G-BLUP) methods to obtain empirical genetic values (EGVs) for each triglyceride phenotype and each individual. We then study 2 different factors that influence the prediction accuracy of G-BLUP for the analysis of human data: (a) the choice of kinship matrix, and (b) the overall level of relatedness. The resulting genetic values represent the total genetic component for the phenotype of interest and can be used to represent a trait without its environmental component. Finally, using empirical data, we demonstrate how this method can be used to increase the power of genetic mapping studies. In sum, our results show that dense genome-wide data can be used in a wider scope than previously anticipated.Entities:
Year: 2018 PMID: 30275897 PMCID: PMC6157117 DOI: 10.1186/s12919-018-0138-5
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Fig. 1Reliability in EGV estimates using both (a) pre- and (b) posttreatment TG levels, when comparing across the different kinship matrices (Pedigree, LDAK, IBDLD). Distributions are illustrated using kernel density estimates (KDEs), as implemented in the ggplot2 R package
Fig. 2Local regressions of the accuracy in individual estimates of EGVs on the number of SDRs, when using both (a) pre- and (b) posttreatment TG levels, as well as different kinship matrices (Pedigree, LDAK, IBDLD)
Note the closer fit of empirically derived EGVs when compared to pedigree-based EGVs
Fig. 3Linear regressions of (a) pre- and (b) post-TG LOD scores obtained from EGV-based linkage scans on the traditional linkage scan LODs. Red line indicates the 1:1 line and the blue line indicates the best-fitting regression line. Regression equations are shown in the bottom right corner