| Literature DB >> 27128317 |
Tom G Richardson1, Colin Campbell2, Nicholas J Timpson1, Tom R Gaunt1.
Abstract
BACKGROUND: The success of collapsing methods which investigate the combined effect of rare variants on complex traits has so far been limited. The manner in which variants within a gene are selected prior to analysis has a crucial impact on this success, which has resulted in analyses conventionally filtering variants according to their consequence. This study investigates whether an alternative approach to filtering, using annotations from recently developed bioinformatics tools, can aid these types of analyses in comparison to conventional approaches. METHODS &Entities:
Mesh:
Year: 2016 PMID: 27128317 PMCID: PMC4851421 DOI: 10.1371/journal.pone.0154181
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Results of gene-level low frequency variant association tests using various variant filters (MAF ≤ 5%).
| Nonsynonymous variants | Loss-of-Function variants | CADD variants (C-Score ≥ 15) | |||||
|---|---|---|---|---|---|---|---|
| Gene | Lipid trait | nVars | P-value | nVars | P-value | nVars | P-value |
| HDL | 30 | 0.85 | 3 | 0.81 | 57 | 0.18 | |
| HDL | 20 | 0.54 | 4 | 0.13 | 12 | 0.81 | |
| HDL | 11 | 0.17 | 2 | 0.98 | 5 | 0.02 | |
| HDL | 2 | 0.89 | |||||
| HDL | 23 | 0.15 | 5 | 0.28 | 29 | 0.80 | |
| HDL | 15 | 0.11 | 4 | 0.37 | |||
| LDL | 21 | 0.21 | 3 | 0.09 | 11 | 0.82 | |
| LDL | 36 | 0.28 | 7 | ||||
| LDL | 16 | 0.86 | 5 | 0.77 | 3 | 0.39 | |
| LDL | 2 | 0.08 | 8 | 0.12 | |||
| LDL | 43 | 0.93 | 10 | 0.94 | 11 | 0.29 | |
| TG | 11 | 0.25 | |||||
| TG | 20 | 0.83 | 4 | 0.51 | 12 | 0.56 | |
nVars = number of variants analysed, HDL = High Density Lipoproteins, LDL = Low Density Lipoproteins, TG = Triglycerides, MAF = Minor Allele Frequency. Results in bold represent p-values ≤ 0.05. No multiple testing threshold was applied to the results of this analysis as the purpose was to compare filtering approaches. All p-values were calculated using SKAT.
Results of gene-level rare variant association tests using various variant filters (MAF ≤ 1%).
| Nonsynonymous variants | Loss-of-Function variants | CADD variants (C-Score ≥ 15) | |||||
|---|---|---|---|---|---|---|---|
| Gene | Lipid trait | nVars | P-value | nVars | P-value | nVars | P-value |
| HDL | 27 | 0.32 | 2 | 0.40 | 54 | 0.37 | |
| HDL | 18 | 0.12 | 4 | 0.13 | 11 | 0.54 | |
| HDL | 11 | 0.17 | 2 | 0.98 | |||
| HDL | 15 | 0.07 | 2 | 0.89 | |||
| HDL | 21 | 0.32 | 5 | 0.28 | 28 | 0.60 | |
| HDL | 14 | 0.10 | 3 | 0.28 | |||
| LDL | 19 | 0. 70 | 2 | 0.72 | 11 | 0.82 | |
| LDL | 32 | 0.16 | 7 | ||||
| LDL | 12 | 0.57 | 4 | 0.43 | 2 | 0.26 | |
| LDL | 1 | N/A | 8 | 0.13 | |||
| LDL | 41 | 0.92 | 9 | 0.93 | 11 | 0.28 | |
| TG | 11 | 0.25 | |||||
| TG | 18 | 0.45 | 4 | 0.51 | 11 | 0.26 | |
nVars = number of variants analysed, HDL = High Density Lipoproteins, LDL = Low Density Lipoproteins, TG = Triglycerides, MAF = Minor Allele Frequency. Results in bold represent p-values ≤ 0.05. No multiple testing threshold was applied to the results of this analysis as the purpose was to compare filtering approaches. All p-values were calculated using SKAT.
Fig 1Hexbin plots representing gene-based SKAT analyses for all genes across the genome using a MAF cutoff of 5% with 4 lipid traits.
The x-axis represents the–log10 transformed p-value from the analysis after filtering according to CADD annotations. The y-axis represents the–log10 transformed p-value from the analysis after filtering according to ‘nonsynonymous’ annotations. Only gene regions which had at least 2 variants in them after filtering by both methods were plotted.
Fig 2Hexbin plots representing gene-based SKAT analyses for all genes across the genome using a MAF cutoff of 1% with 4 lipid traits.
The x-axis represents the–log10 transformed p-value from the analysis after filtering according to CADD annotations. The y-axis represents the–log10 transformed p-value from the analysis after filtering according to ‘nonsynonymous’ annotations. Only gene regions which had at least 2 variants in them after filtering by both methods were plotted.