| Literature DB >> 33795796 |
Chao-Yu Guo1,2, Reng-Hong Wang3,4, Hsin-Chou Yang5.
Abstract
After the genome-wide association studies (GWAS) era, whole-genome sequencing is highly engaged in identifying the association of complex traits with rare variations. A score-based variance-component test has been proposed to identify common and rare genetic variants associated with complex traits while quickly adjusting for covariates. Such kernel score statistic allows for familial dependencies and adjusts for random confounding effects. However, the etiology of complex traits may involve the effects of genetic and environmental factors and the complex interactions between genes and the environment. Therefore, in this research, a novel method is proposed to detect gene and gene-environment interactions in a complex family-based association study with various correlated structures. We also developed an R function for the Fast Gene-Environment Sequence Kernel Association Test (FGE-SKAT), which is freely available as supplementary material for easy GWAS implementation to unveil such family-based joint effects. Simulation studies confirmed the validity of the new strategy and the superior statistical power. The FGE-SKAT was applied to the whole genome sequence data provided by Genetic Analysis Workshop 18 (GAW18) and discovered concordant and discordant regions compared to the methods without considering gene by environment interactions.Entities:
Year: 2021 PMID: 33795796 PMCID: PMC8016937 DOI: 10.1038/s41598-021-86871-2
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Descriptive Statistics of phenotype data.
| Variable | Exam 1 | Exam 2 | Exam 3 | Exam 6 |
|---|---|---|---|---|
| N | 809 | 578 | 594 | 231 |
| Year of exam | 1992–1996 | 1997–2000 | 1998–2006 | 2009–2011 |
| Mean age at exam (range) | 39.4 (16–94) | 42.6 (17–97) | 46.5 (18–95) | 50.9 (30–81) |
| Mean SBP (range) | 122 (80–216) | 125 (90–211) | 125 (76–220) | 128 (93–233) |
| Mean DBP (range) | 71 (40–123) | 72 (43–115) | 71 (32–108) | 78 (46–126) |
| Antihypertensive medication (%) | 10.05 | 19.37 | 28.76 | 43.67 |
| Hypertension (%) | 18.00 | 29.58 | 36.58 | 52.38 |
| Smoking status (%) | 22.79 | 15.92 | 18.86 | 11.26 |
The sample size of each family by sex.
| Pedigree number | Sex | Individual | Pedigree number | Sex | Individual | ||
|---|---|---|---|---|---|---|---|
| 2 | Female | 53 | 107 | 14 | Female | 30 | 60 |
| Male | 54 | Male | 30 | ||||
| 3 | Female | 46 | 98 | 15 | Female | 24 | 57 |
| Male | 52 | Male | 33 | ||||
| 4 | Female | 46 | 97 | 16 | Female | 32 | 59 |
| Male | 51 | Male | 27 | ||||
| 5 | Female | 48 | 91 | 17 | Female | 28 | 57 |
| Male | 43 | Male | 29 | ||||
| 6 | Female | 44 | 88 | 20 | Female | 26 | 51 |
| Male | 44 | Male | 25 | ||||
| 7 | Female | 37 | 89 | 21 | Female | 22 | 50 |
| Male | 52 | Male | 28 | ||||
| 8 | Female | 38 | 84 | 23 | Female | 18 | 46 |
| Male | 46 | Male | 28 | ||||
| 9 | Female | 45 | 81 | 25 | Female | 21 | 44 |
| Male | 36 | Male | 23 | ||||
| 10 | Female | 41 | 83 | 27 | Female | 24 | 44 |
| Male | 42 | Male | 20 | ||||
| 11 | Female | 39 | 76 | 47 | Female | 11 | 27 |
| Male | 37 | Male | 16 | ||||
Permutation studies for Type-I errors.
| Alpha | Chr | Window 1 | Window 2 | Window 3 | Window 4 | ||||
|---|---|---|---|---|---|---|---|---|---|
| FFBSKAT | FGE-SKAT | FFBSKAT | FGE-SKAT | FFBSKAT | FGE-SKAT | FFBSKAT | FGE-SKAT | ||
| 0.05 | 1 | 0.056 | 0.045 | 0.056 | 0.054 | 0.044 | 0.054 | 0.048 | 0.052 |
| 0.05 | 3 | 0.045 | 0.048 | 0.054 | 0.049 | 0.046 | 0.054 | 0.044 | 0.0510 |
| 0.05 | 5 | 0.032 | 0.044 | 0.034 | 0.045 | 0.04 | 0.053 | 0.041 | 0.044 |
| 0.01 | 1 | 0.012 | 0.01 | 0.009 | 0.01 | 0.006 | 0.009 | 0.003 | 0.006 |
| 0.01 | 3 | 0.007 | 0.009 | 0.009 | 0.01 | 0.011 | 0.014 | 0.011 | 0.013 |
| 0.01 | 5 | 0.009 | 0.01 | 0.006 | 0.009 | 0.01 | 0.012 | 0.007 | 0.011 |
The first column, "Alpha" represents the nominal significance level and the second column, "Chr." represents the chromosome number.
Simulations for statistical power.
| Model | Scenario | Window 1 | Window 2 | Window 3 | Window 4 | ||||
|---|---|---|---|---|---|---|---|---|---|
| FFBSKAT | FGE-SKAT | FFBSKAT | FGE-SKAT | FFBSKAT | FGE-SKAT | FFBSKAT | FGE-SKAT | ||
| Dom | 120,180,120,180 | 0.14 | 0.03 | 0.624 | 0.295 | 0.999 | 0.992 | 0.985 | 0.92 |
| Dom | 120,120,180,180 | 0.049 | 0.053 | 0.05 | 0.046 | 0.054 | 0.056 | 0.05 | 0.051 |
| Dom | 120,120,120,180 | 0.027 | 0.367 | 0.025 | 0.462 | 0.013 | 0.172 | 0.008 | 0.06 |
| Dom | 120,120,150,180 | 0.057 | 0.187 | 0.061 | 0.189 | 0.033 | 0.078 | 0.025 | 0.046 |
| Add | 120,180,120,180 | 0.151 | 0.036 | 0.617 | 0.287 | 0.996 | 0.992 | 0.98 | 0.913 |
| Add | 120,120,180,180 | 0.054 | 0.048 | 0.05 | 0.047 | 0.04 | 0.034 | 0.043 | 0.042 |
| Add | 120,120,120,180 | 0.018 | 0.341 | 0.02 | 0.461 | 0.006 | 0.178 | 0.006 | 0.049 |
| Add | 120,120,150,180 | 0.041 | 0.191 | 0.035 | 0.188 | 0.023 | 0.066 | 0.021 | 0.045 |
| Rec | 120,180,120,180 | 1 | 1 | 1 | 1 | 0.019 | 1 | 0.019 | 1 |
| Rec | 120,120,180,180 | 0.056 | 0.048 | 0.05 | 0.05 | 0.049 | 0.047 | 0.049 | 0.045 |
| Rec | 120,120,120,180 | 0.99 | 1 | 1 | 1 | 0.005 | 1 | 0.005 | 1 |
| Rec | 120,120,150,180 | 0.746 | 1 | 1 | 1 | 0.022 | 1 | 0.022 | 1 |
| Rec | 120,140,120,140 | 0.999 | 0.999 | 1 | 1 | 0.059 | 0.989 | 0.059 | 0.988 |
| Rec | 120,120,140,140 | 0.036 | 0.038 | 0.033 | 0.037 | 0.037 | 0.041 | 0.037 | 0.04 |
| Rec | 120,120,120,140 | 0.458 | 0.964 | 0.999 | 1 | 0.028 | 0.961 | 0.028 | 0.959 |
| Rec | 120,120,130,140 | 0.16 | 0.0457 | 0.721 | 0.999 | 0.04 | 0.0471 | 0.04 | 0.0469 |
The four numbers listed in the scenarios column are the four means of the normally distributed SBP with a standard deviation of 10 for four groups (non-smokers without the SNV, non-smokers with the SNV, smokers without the SNV, and smokers with the SNV).
The most significant genes identified by both methods for normalized DBP.
| Chromosome | FBSKAT | FGE-SKAT | ||
|---|---|---|---|---|
| Gene_Seq | UniGene | Gene_Seq | UniGene | |
| Chr3 | LOC105374165 | 0 | LOC105374165 | 0 |
| Chr5 | – | – | 0 | 0 |
| Chr7 | CACNA2D1 | CACNA2D1 | CACNA2D1 | CACNA2D1 |
| Chr9 | – | – | 0 | 0 |
| Chr13 | – | – | 0 | 0 |
Figure 1Manhattan plot for normalized DBP.
Top 10 smallest p-values for normalized DBP and SBP.
| CHR | Trait | Genomic region | FFBSKAT P-value | FGE-SKAT P-value |
|---|---|---|---|---|
| 5 | DBP | 50,274,970–50,279,007 | 3.79246E−06 | 3.79246E−06 |
| 5 | DBP | 50,318,550–50,318,624 | 1.15102E−06 | 1.15102E−06 |
| 5 | DBP | 50,319,835–50,323,204 | 8.42894E−06 | 8.42894E−06 |
| 7 | DBP | 18,473,528–18,478,318 | 8.49578E−06 | 8.49578E−06 |
| 7 | DBP | 18,475,056–18,479,387 | 5.29517E−06 | 5.29517E−06 |
| 7 | DBP | 132,160,189–132,163,619 | 9.36002E−06 | 9.36002E−06 |
| 9 | DBP | 133,316,470–133,319,343 | 9.65182E−06 | 9.65182E−06 |
| 11 | DBP | 77,592,371–77,595,131 | 5.82584E−06 | 5.82584E−06 |
| 11 | DBP | 77,593,756–77,596,971 | 4.43435E−06 | 4.43435E−06 |
| 11 | DBP | 82,432,685–82,435,840 | 6.19042E−06 | 6.19042E−06 |
| 7 | SBP | 139,953,680–139,955,405 | 1.82164E−07 | 1.66733E−07 |
| 7 | SBP | 139,954,850–139,956,291 | 9.09435E−08 | 9.09435E−08 |
| 7 | SBP | 139,959,269–139,961,904 | 1.83966E−07 | 1.82164E−07 |
| 7 | SBP | 142,258,881–142,262,340 | 7.67391E−07 | 7.67391E−07 |
| 7 | SBP | 143,609,266–143,611,641 | 6.65998E−07 | 6.65998E−07 |
| 7 | SBP | 145,963,864–145,967,584 | 1.87147E−07 | 1.83966E−07 |
| 7 | SBP | 146,913,363–146,918,656 | 5.85388E−07 | 5.85388E−07 |
| 7 | SBP | 146,916,497–146,920,752 | 2.1425E−07 | 2.1425E−07 |
| 7 | SBP | 146,918,731–146,922,053 | 2.5671E−07 | 2.5671E−07 |
| 7 | SBP | 148,901,779–148,904,624 | 4.85975E−07 | 4.85975E−07 |
Figure 2Manhattan plot for normalized SBP.