| Literature DB >> 35041074 |
Qingbo S Wang1,2,3,4, Hailiang Huang5,6,7.
Abstract
Although genome-wide association studies (GWAS) have identified thousands of loci in the human genome that are associated with different traits, understanding the biological mechanisms underlying the association signals identified in GWAS remains challenging. Statistical fine-mapping is a method aiming to refine GWAS signals by evaluating which variant(s) are truly causal to the phenotype. Here, we review the types of statistical fine-mapping methods that have been widely used to date, with a focus on recently developed functionally informed fine-mapping (FIFM) methods that utilize functional annotations. We then systematically review the applications of statistical fine-mapping in autoimmune disease studies to highlight the value of statistical fine-mapping in biological contexts.Entities:
Keywords: Autoimmune disorders; Bayesian; Functionally informed fine-mapping; IBD genetics; Inflammatory bowel diseases; Statistical fine-mapping
Mesh:
Year: 2022 PMID: 35041074 PMCID: PMC8837575 DOI: 10.1007/s00281-021-00902-8
Source DB: PubMed Journal: Semin Immunopathol ISSN: 1863-2297 Impact factor: 9.623
Fig. 1Schematic overview of the statistical fine-mapping methods with uniform or functionally informed prior, in comparison with direct experimental approaches. a Downstream experiments following GWAS without statistical fine-mapping often assume the variant with the most significant p-value (“lead” variant) as the causal variant and proceed to perturbation of the lead variant and/or nearby gene(s). b Statistical fine-mapping is utilized to prioritize a small number of variants for downstream perturbation, which can be different from what p-value in GWAS suggests. This facilitates variant-level interpretation of GWAS results. c In a functionally informed fine-mapping (FIFM) framework, functional annotations are used (often together with the GWAS data) to form a prior. FIFM often results in an increase of power in prioritizing putative causal variants, which is typically characterized by higher maximum posterior inclusion probability (PIP) and/or lower credible set size[14]. The functional annotations used to form the prior are often directly used to interpret the biological mechanisms of causal variant(s)
Fig. 2Two simplified examples where marginal p-value fails to prioritize the true causal variants. a The non-causal variant (center), frequently tagging one of the two true causal variants, has the most significant association p-value (in F-test) as well as the highest marginal effect size (7.8 vs 5.8 and 5.3). b Two nearby causal variants in LD harboring high true effect sizes to the opposite direction, both have limited marginal association p-values that do not reach the statistical significance under multiple test correction (p = 0.06 and 0.007). Synthetic samples of n = 300 for a and n = 200 for b were generated, with true = 10 and drawn from a normal distribution with SD = 5 for simplicity (therefore, y axis has no unit). r = 0.317, 0.317, and 0.0353 for a and 0.734 for b The code is
available at http://github.com/QingboWang/fm-toy
GWAS and fine-mapping analyses across ten autoimmune disorders. All studies were performed on European subjects except for SLE, which combined European and East Asian subjects in the fine-mapping analysis
| Disorder | Abbreviation | Heritability (CIs) (c) | GWAS | Fine-mapping | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| # case | # loci | PMID | # case | # loci | 1-SNP set | 5-SNP set | Method | PMID | |||
| Ankylosing spondylitis | AS | 0.97 (0.92–0.99) | 10,417 | 48 | 26974007 | 10,619 | 28 | 0 | - | PICS | 25363779 |
| Autoimmune thyroid disease | THY | 0.79 | 30,234 | 93 | 32581359 | 2,747 | 10 | 0 | - | PICS | 25363779 |
| Celiac disease | CeD | 0.57 (0.32–0.93) | 12,041 | 40 | 22057235 | 12,041 | 40 | 1 | - | PICS | 25363779 |
| Inflammatory bowel diseases—Crohn’s disease (a) | IBD—CD | 1.00 (0.34–1.00) | 25,042 | 240 | 28067908 | 20,155 | 94 | 18 | 42 | - | 28658209 |
| Inflammatory bowel diseases—Ulcerative colitis (a) | IBD—UC | 0.67 ± 0.13 | 15,191 | ||||||||
| Multiple sclerosis | MS | 0.25 (0.00–0.88) | 47,429 | 233 | 31604244 | 14,498 | 87 | 2 | - | PICS | 25363779 |
| Psoriasis | PSOR | 0.66 (0.52–0.77) | 19,032 | 63 | 28537254 | 10,588 | 36 | 7 | - | PICS | 25363779 |
| Rheumatoid arthritis | RA | 0.68 (0.55–0.79) | 22,628 | 121 | 33310728 | 11,475 | 46 | 0 | 5 | ABF | 30224649 |
| Systemic lupus erythematosus | SLE | 0.66 | 11,283 | 132 | 33536424 | 11,283 | 132 | 5 | 17 | PAINTOR | 33536424 |
| Type 1 diabetes (b) | T1D | 0.88 (0.78–0.94) | 11,644 | 51 | 25751624 | 9,334 | 49 | 1 | 10 | ABF | 30224649 |
aCD and UC are two subtypes of IBD and are often analyzed together for their extensively shared genetic architecture
bThe GWAS study included both case–control and family samples
cHeritability estimates compiled from multiple sources with detailed provided in Maria Gutierrez-Arcelus et al. Nature Reviews Genetics 2016 (PMID: 26907721)
Putative causal variants with PIP > 95% for autoimmune disorders. See Table 1 for trait abbreviations
| Trait | Variant | Gene | Function | PIP |
|---|---|---|---|---|
| CD | rs2066844 | R702W | 99.9% | |
| CD | rs2066845 | G908R | 99.9% | |
| CD | rs5743293 | Fs1007insC | 99.9% | |
| CD | rs61839660 | Intronic | 99.9% | |
| CD | rs7307562 | Intronic | 99.9% | |
| CD | rs5743271 | N289S | 99.3% | |
| CD | rs72796367 | Intronic | 98.3% | |
| CD | rs41313262 | V362I | 97.3% | |
| CD | rs28701841 | Intergenic | 97.1% | |
| UC | rs6017342 | Intergenic | 99.9% | |
| UC | rs35667974 | I923V | 99.4% | |
| UC | rs4676408 | Intergenic | 99.4% | |
| IBD | rs6062496 | Intronic | 99.6% | |
| IBD | rs141992399 | 1434 + 1G > C | 99.5% | |
| IBD | rs74465132 | Intergenic | 99.4% | |
| IBD | rs10748781 | Intergenic | 99.0% | |
| IBD | rs35874463 | I170V | 98.9% | |
| IBD | rs1887428 | Intergenic | 97.4% | |
| SLE | rs2736100 | TERT | Intronic | 100.0% |
| SLE | rs2431697 | Intergenic | 99.9% | |
| SLE | rs2297550 | TF binding site | 99.7% | |
| SLE | rs7097397 | Arg1816Gln | 99.3% | |
| SLE | rs2205960 | Intergenic | 95.7% | |
| T1D | rs34536443 | P1104A | 100.0% | |
| MS | rs533259 | Intronic | 100.0% | |
| MS | rs733724 | Intronic | 98.0% | |
| PSOR | rs17716942 | Intronic | 100.0% | |
| PSOR | rs12188300 | Intergenic | 100.0% | |
| PSOR | rs33980500 | D10N | 100.0% | |
| PSOR | rs11795343 | Intronic | 99.7% | |
| PSOR | rs8016947 | Intergenic | 100.0% | |
| PSOR | rs28998802 | Intronic | 100.0% | |
| PSOR | rs34536443 | P1104A | 99.6%% | |
| CeD | rs1893592 | Intronic | 98.0% |