| Literature DB >> 25519398 |
Ellen E Quillen1, V Saroja Voruganti1, Geetha Chittoor1, Rohina Rubicz1, Juan M Peralta2, Marcio Aa Almeida1, Jack W Kent1, Vincent P Diego1, Thomas D Dyer1, Anthony G Comuzzie1, Harald Hh Göring1, Ravindranath Duggirala1, Laura Almasy1, John Blangero1.
Abstract
The concept of breeding values, an individual's phenotypic deviation from the population mean as a result of the sum of the average effects of the genes they carry, is of great importance in livestock, aquaculture, and cash crop industries where emphasis is placed on an individual's potential to pass desirable phenotypes on to the next generation. As breeding or genetic values (as referred to here) cannot be measured directly, estimated genetic values (EGVs) are based on an individual's own phenotype, phenotype information from relatives, and, increasingly, genetic data. Because EGVs represent additive genetic variation, calculating EGVs in an extended human pedigree is expected to provide a more refined phenotype for genetic analyses. To test the utility of EGVs in genome-wide association, EGVs were calculated for 847 members of 20 extended Mexican American families based on 100 replicates of simulated systolic blood pressure. Calculations were performed in GAUSS to solve a variation on the standard Best Linear Unbiased Predictor (BLUP) mixed model equation with age, sex, and the first 3 principal components of sample-wide genetic variability as fixed effects and the EGV as a random effect distributed around the relationship matrix. Three methods of calculating kinship were considered: expected kinship from pedigree relationships, empirical kinship from common variants, and empirical kinship from both rare and common variants. Genome-wide association analysis was conducted on simulated phenotypes and EGVs using the additive measured genotype approach in the SOLAR software package. The EGV-based approach showed only minimal improvement in power to detect causative loci.Entities:
Year: 2014 PMID: 25519398 PMCID: PMC4143678 DOI: 10.1186/1753-6561-8-S1-S66
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Description of GWA results for EGVs and SBP across simulations.
| SBP | EGVped | EGVsnp | EGVseq | |
|---|---|---|---|---|
| Total number of significant (sig) SNPs in 100 simulations | 166 | 134 | 191 | 273 |
| Sig SNPs with minor allele frequency (MAF) <0.05 | 12.0% | 5.2% | 11.5% | 6.6% |
| Sig SNPs with MAF <0.01 | 1.2% | 0.7% | 1.0% | 0.7% |
| Mean number of sig SNPs | 11.9 | 7.8 | 13.9 | 14.1 |
| Median number of sig SNPs | 8 | 2 | 8 | 8 |
| stdev in number of sig SNPs | 15.8 | 15.0 | 18.4 | 19.3 |
| False-discovery rate (FDR) | 7.7% | 5.7% | 6.7% | 7.7% |
| FDR for SNPs not seen in SBP | - | 52.1% | 50.0% | 40.4% |
| Smaller average | - | 30.0% | 57.0% | 48.0% |
| Simulations identifying more SNPs than SBP | - | 14 | 39 | 36 |
| Simulations identifying fewer SNPs than SBP | - | 77 | 18 | 30 |
| Simulations identifying same number of SNPs as SBP | - | 9 | 43 | 34 |
For 100 replicates of simulated SBP, GWA was performed on the raw data, EGVped, EGVsnp, and EGVseq. The following table gives descriptive statistics for significant SNPs (p < 5 × 10-8) by method. The last five rows illustrate the performance of the EGV method relative to the raw SBP GWA.
Significant results for GWA of EGVs and SBP by gene.
| Gene | Chr | SBP | EGVped | EGVsnp | EGVseq |
|---|---|---|---|---|---|
| 7 | 4 | 6 | 4 | ||
| 6 | 0 | 8 | 5 | ||
| 1 | 1 | 2 | 1 | ||
| 16 | 16 | 22 | 20 | ||
| 100 | 95 | 100 | 99 | ||
| 100 | 95 | 100 | 99 | ||
| 33 | 14 | 34 | 39 | ||
| 3 | 0 | 3 | 4 | ||
| 2 | 0 | 1 | 1 | ||
| 2 | 1 | 1 | 1 | ||
| 0 | 0 | 1 | 0 | ||
| 0 | 1 | 0 | 1 | ||
| 0 | 1 | 0 | 1 | ||
| 0 | 1 | 0 | 1 | ||
| 0 | 0 | 1 | 1 | ||
| 0 | 1 | 1 | 0 |
For each gene contributing to simulated SBP, the table lists the number of replicates (out of 100) in which at least one significant association was found. Variants in 24 additional genes have small effects on SBP but were never detected and were omitted from the table. Due to extended linkage disequilibrium, more than one gene may be tagged by a single variant; in particular, the associations in DNASE1L3 are likely due to strong LD with major causative gene MAP4.