| Literature DB >> 28377602 |
Chao Ning1, Huimin Kang1, Lei Zhou1, Dan Wang1, Haifei Wang1, Aiguo Wang1, Jinluan Fu1, Shengli Zhang1, Jianfeng Liu2.
Abstract
Complex traits with multiple phenotypic values changing over time are called longitudinal traits. In traditional genome-wide association studies (GWAS) for longitudinal traits, a combined/averaged estimated breeding value (EBV) or deregressed proof (DRP) instead of multiple phenotypic measurements per se for each individual was frequently treated as response variable in statistical model. This can result in power losses or even inflate false positive rates (FPRs) in the detection due to failure of exploring time-dependent relationship among measurements. Aiming at overcoming such limitation, we developed two random regression-based models for functional GWAS on longitudinal traits, which could directly use original time-dependent records as response variable and fit the time-varied Quantitative Trait Nucleotide (QTN) effect. Simulation studies showed that our methods could control the FPRs and increase statistical powers in detecting QTN in comparison with traditional methods where EBVs, DRPs or estimated residuals were considered as response variables. Besides, our proposed models also achieved reliable powers in gene detection when implementing into two real datasets, a Chinese Holstein Cattle data and the Genetic Analysis Workshop 18 data. Our study herein offers an optimal way to enhance the power of gene detection and further understand genetic control of developmental processes for complex longitudinal traits.Entities:
Mesh:
Year: 2017 PMID: 28377602 PMCID: PMC5428860 DOI: 10.1038/s41598-017-00638-2
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1The type I errors (false positive rates, FPRs) of different GWAS models for the simulated data at the tabulated thresholds of p = 0.05 and p = 0.01 for the simulation study.
Figure 2The powers of different GWAS models with alternative QTN heritabilities at tabulated thresholds of p = 0.01 and p = 0.05 for the simulation study.
Figure 3Comparison of p-values () using different GWAS models at QTN heritability of 1.0% for the simulation study. Scatterplots of for any two GWAS models were shown at the upper triangular, with Pearson correlation coefficients listed at the lower triangular. The read lines represented regression line y = x; the blue lines were the lines of best fit for of each two models.
Means, standard deviations (SD), and root-mean-square errors (RMSE) of estimated cumulative additive genetic effect of the QTN for different GWAS models with various QTN heritabilities in the simulation study.
| Models |
| |||||||
|---|---|---|---|---|---|---|---|---|
| 0.1% | 0.5% | 1% | 2% | |||||
| Mean ± SD | RMSE | Mean ± SD | RMSE | Mean ± SD | RMSE | Mean ± SD | RMSE | |
| fGWAS-C | 179.45 ± 135.33 | 135.33 | 175.9 ± 63.54 | 63.52 | 176.97 ± 48.09 | 48.10 | 174.76 ± 34.17 | 34.15 |
| fGWAS-F | 182.87 ± 142.99 | 143.12 | 177.5 ± 63.55 | 63.56 | 175.04 ± 47.53 | 47.50 | 176.1 ± 34.26 | 34.26 |
| GWAS-EBV-P | 29.17 ± 41.67 | 151.87 | 34.97 ± 16.59 | 141.22 | 35.84 ± 11.62 | 139.86 | 35.19 ± 7.28 | 140.21 |
| GWAS-EBV-NP | 71.52 ± 114.81 | 154.66 | 67.92 ± 52.39 | 119.39 | 69.81 ± 37.35 | 111.82 | 69.73 ± 24.71 | 108.34 |
| GWAS-DRP-P | 71.65 ± 109.83 | 150.91 | 97.95 ± 48.75 | 91.34 | 108.85 ± 37.93 | 76.43 | 119.37 ± 27.21 | 62.11 |
| GWAS-DRP-NP | 108.00 ± 147.73 | 162.24 | 121.61 ± 69.30 | 87.58 | 132.73 ± 52.31 | 67.37 | 143.59 ± 36.18 | 48.04 |
| GWAS-Residual | 1.46 ± 1.08 | 173.75 | 6.40 ± 2.30 | 168.82 | 11.74 ± 3.25 | 163.50 | 21.53 ± 4.33 | 153.74 |
Figure 4The plots of average additive genetic effect curves predicted by the fGWAS-C and fGWAS-F models against the simulated true curves with alternative QTN heritabilities for the simulation study.
Figure 5Manhattan plots of p-values for milk yield (MY), fat percentage (FP), and protein percentage (PP) by the fGWAS-C and fGWAS-F model for the Chinese Holstein cattle data. Chromosomes 1–29 were shown with black and grey intervals. The red horizontal lines indicated the genome-wise significance level of −log10(6.98 × 10−7) and SNPs above the lines were highlighted in green.
Figure 6Manhattan plots of p values for systolic blood pressure (SBP) and diastolic blood pressure (DBP) by the fGWAS-F model for the GAW18 data. Odd numbered autosomes were shown with black and grey intervals. The significant SNPs (q values < 0.05) were highlighted in green.
The characters of fGWAS-C, fGWAS-F, GWAS-EBV-P, GWAS-EBV-NP, GWAS-DRP-P, GWAS-DRP-NP, and GWAS-Residual models.
| Models | Response variable | SNP effect time dependence | SNP effect modeling | polygenetic effects (fit or not) |
|---|---|---|---|---|
| fGWAS-C | longitudinal records | time-dependent | covariate | YES |
| fGWAS-F | longitudinal records | time-dependent | factor | YES |
| GWAS-EBV-P | EBVs | time-independent | covariate | YES |
| GWAS-EBV-NP | EBVs | time-independent | covariate | NO |
| GWAS-DRP-P | DRPs | time-independent | covariate | YES |
| GWAS-DRP-NP | DRPs | time-independent | covariate | NO |
| GWAS-Residual | estimated residuals | time-independent | covariate | NO |