| Literature DB >> 35110524 |
Gyda Bjornsdottir1, Lilja Stefansdottir2, Gudmar Thorleifsson2, Patrick Sulem2, Kristjan Norland2, Egil Ferkingstad2, Asmundur Oddsson2, Florian Zink2, Sigrun H Lund2, Muhammad S Nawaz2,3, G Bragi Walters2,3, Astros Th Skuladottir2, Sigurjon A Gudjonsson2, Gudmundur Einarsson2, Gisli H Halldorsson2,4, Valgerdur Bjarnadottir5, Gardar Sveinbjornsson2, Anna Helgadottir2, Unnur Styrkarsdottir2, Larus J Gudmundsson2, Ole B Pedersen6,7, Thomas Folkmann Hansen8,9, Thomas Werge7,10,11, Karina Banasik9, Anders Troelsen7,12, Soren T Skou13,14, Lise Wegner Thørner15, Christian Erikstrup16, Kaspar Rene Nielsen17, Susan Mikkelsen16, Ingileif Jonsdottir2,3, Aron Bjornsson18, Ingvar H Olafsson18, Elfar Ulfarsson18, Josep Blondal19, Arnor Vikingsson5, Soren Brunak9, Sisse R Ostrowski7,15, Henrik Ullum20, Unnur Thorsteinsdottir2,3, Hreinn Stefansson2, Daniel F Gudbjartsson2,4, Thorgeir E Thorgeirsson21, Kari Stefansson22,23.
Abstract
Back pain is a common and debilitating disorder with largely unknown underlying biology. Here we report a genome-wide association study of back pain using diagnoses assigned in clinical practice; dorsalgia (119,100 cases, 909,847 controls) and intervertebral disc disorder (IDD) (58,854 cases, 922,958 controls). We identify 41 variants at 33 loci. The most significant association (ORIDD = 0.92, P = 1.6 × 10-39; ORdorsalgia = 0.92, P = 7.2 × 10-15) is with a 3'UTR variant (rs1871452-T) in CHST3, encoding a sulfotransferase enzyme expressed in intervertebral discs. The largest effects on IDD are conferred by rare (MAF = 0.07 - 0.32%) loss-of-function (LoF) variants in SLC13A1, encoding a sodium-sulfate co-transporter (LoF burden OR = 1.44, P = 3.1 × 10-11); variants that also associate with reduced serum sulfate. Genes implicated by this study are involved in cartilage and bone biology, as well as neurological and inflammatory processes.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35110524 PMCID: PMC8810832 DOI: 10.1038/s41467-022-28167-1
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 17.694
a) Sequence variants associated with IDD (Ncase = 58,854, Nctrl = 922,958). b) Sequence variants associated with Dorsalgia (Ncase = 119,110, Nctrl = 909,847).
| a) IDD Loci | Position (hg38) | rs name | EAa | OAa | Close gene | Annotation | Frqb | OR (95% CI)c | ||
|---|---|---|---|---|---|---|---|---|---|---|
| chr1:102875460 | rs4907985 | A | T | Downstream | 49.8 | 1.04 (1.03, 1.06) | 3.9E−10 | 0.0044 | ||
| 1q23.5 | chr1:183974675 | rs3010044 | C | A | Intron | 23.8 | 1.05 (1.04, 1.07) | 2.0E−11 | 0.00045 | |
| 1q32.1 | chr1:198841735 | rs71663412 | T | TGA | Indel | 20.3 | 0.95 (0.93, 0.97) | 4.7E−10 | 0.011 | |
| 2p13.3 | chr2:69345897 | rs6722492* | T | C | Splice region | 41.5 | 1.05 (1.04, 1.07) | 2.2E−17 | 2.2E−11 | |
| chr2:70489467 | rs2902345* | T | C | Intron | 45.5 | 1.05 (1.04, 1.07) | 1.1E−17 | 2.6E−10 | ||
| chr4:1171342 | rs11247975* | G | T | Missense | 32.5 | 0.96 (0.94, 0.97) | 6.5E−11 | 6.6E−05 | ||
| chr4:1794909 | rs3135842* | C | G | Intron | 27.1 | 0.95 (0.93, 0.96) | 4.0E−14 | 4.5E−07 | ||
| chr6:34578783 | rs2814982 | T | C | Intergenic | 11.9 | 1.09 (1.07, 1.11) | 2.4E−17 | 1.6E−09 | ||
| 6p21.1 | chr6:44478351 | rs6929734 | G | T | Intergenic | 44.5 | 0.96 (0.94, 0.97) | 2.6E−11 | 0.00059 | |
| 7p21.1 | chr7:19554541 | rs2192477 | G | A | Intergenic | 34.4 | 1.05 (1.04, 1.07) | 5.3E−14 | 1.2E−06 | |
| 7p12.3 | chr7:45988978 | rs1723939 | T | C | Intergenic | 48.2 | 1.06 (1.05, 1.07) | 1.6E−18 | 3.6E−11 | |
| 7q31.32 | chr7:123199913 | rs28364172 | A | G | Stop gained | 0.23 | 1.41 (1.25, 1.60) | 2.5E−08 | 0.0053 | |
| 8q13.2 | chr8:68665402 | rs16934882 | A | C | Intron | 19.9 | 1.06 (1.04, 1.07) | 5.7E−12 | 0.00013 | |
| chr8:129707875 | rs10110842*e | C | T | Regulatory region | 27.4 | 1.07 (1.05, 1.09) | 5.4E−08 | >0.05 | ||
| chr8:129726726 | rs7826493* | G | A | Regulatory region | 20.0 | 0.91 (0.89, 0.92) | 2.8E−16 | 2.6E−22 | ||
| 9q22.32 | chr9:93911476 | rs58723578 | T | C | Intergenic | 10.0 | 1.08 (1.06, 1.10) | 1.4E−11 | 0.00033 | |
| 10p12.1 | chr10:27612430 | rs2637326 | G | T | Intergenic | 51.1 | 0.95 (0.94, 0.96) | 1.1E−15 | 2.6E−08 | |
| chr10:72012903 | rs1871452 | T | A | 3′UTR | 39.1 | 0.92 (0.90, 0.93) | 1.6E−39 | 1.8E−32 | ||
| 10q24.32 | chr10:102868477 | rs7098825 | C | T | Upstream | 10.2 | 0.94 (0.92, 0.96) | 4.1E−09 | 0.047 | |
| 11p15.3 | chr11:13275014 | rs11022742 | C | T | Upstream | 27.3 | 0.95 (0.94, 0.96) | 4.3E−12 | 4.8E−05 | |
| 11p15.2 | chr11:15693077 | rs4757353 | C | T | Intron | 22.4 | 1.06 (1.04, 1.07) | 2.4E−12 | 5.4E−05 | |
| 12p12.1 | chr12:23822285 | rs12310519 | T | C | Intron | 15.7 | 1.11 (1.10, 1.13) | 4.8E−35 | 3.2E−27 | |
| 14q13.3 | chr14:36988829 | rs28487989 | C | T | Intron | 21.1 | 0.95 (0.93, 0.96) | 3.2E−12 | 7.4E−05 | |
| 14q32.13 | chr14:94378610 | rs28929474 | T | C | Missense | 1.83 | 0.87 (0.83, 0.92) | 1.1E−08 | 0.011 | |
| 15q22.33 | chr15:67078168 | rs12901372* | G | C | Intron | 47.0 | 0.94 (0.93, 0.95) | 9.0E−17 | 2.0E−09 | |
| chr15:67083662 | rs4776881*e | C | T | Intron | 45.3 | 1.05 (1.03, 1.06) | 7.2E−08 | >0.05 | ||
| 17q23.3 | chr17:63921519 | rs2040347 | A | G | Upstream | 34.9 | 0.96 (0.94, 0.97) | 9.8E−11 | 0.0011 | |
| chr19:45877067 | rs35318830 | G | T | Downstream | 9.41 | 1.08 (1.05, 1.10) | 7.3E−12 | 8.2E−05 | ||
| 20q11.22 | chr20:35437976 | rs143384 | G | A | 5′UTR | 45.7 | 1.04 (1.03, 1.06) | 1.2E−10 | 0.0014 |
Bold are loci with marker significant in both IDD (Table 1a) and dorsalgia (Table 1b).
aEffect allele (EA) and other allele (OA).
bAverage frequency of effect allele in the four cohorts.
cOR and P value for an inverse-variance weighted meta-analysis of association results for the four cohorts.
dP value after a variant class-specific Bonferroni adjustment[18]. *For this variant the P value and OR presented is adjusted for the effect of the other variant at the locus through conditional analysis.
eSecond signal is not genome-wide significant. Results per cohort are in Supplementary Data 3, 4. Associations of these and correlated variants (r2≥ 0.8) with various traits listed in the GWAS catalog (https://www.ebi.ac.uk/gwas/) are in Supplementary Data 16 (IDD variants) and Supplementary Data 17 (dorsalgia variants).
Fig. 1Manhattan plots showing results for meta-analyses of Intervertebral disc disorders (M51), dorsalgia (M54) and lumbar discectomy (LDHsurg).
The P values (−log 10) from meta-analyses of the studied phenotypes are plotted (y-axis) against their respective positions on each chromosome (x-axis). a Intervertebral disc disorders IDD (M51), additive model (four cohorts; 58,854 cases, 922,958 controls), (b) Dorsalgia (M54), additive model (four cohorts, 119,110 cases, 909,847 controls), and (c) severe lumbar IDD defined by surgery (LDHsurg) (three cohorts; 9188 cases, 780,233 controls). P values are two sided and derived from a likelihood-ratio test. The gray and black dots represent SNPs not reaching genome-wide significance threshold weighted for variant impact[18]. The yellow dots represent genome-wide significant SNPs and the red dots represent genome-wide significant SNPs with moderate or high impact[18] (Methods).
Fig. 2Genes likely to associate with (a) IDD and (b) Dorsalgia.
Sequence variants associated with (a) IDD and (b) Dorsalgia for which functional evidence supports implication of genes in back pain. The variants listed are either protein-coding variants or affect mRNA expression (top cis-eQTL) as depicted by gray boxes (lof loss-of-function) (Supplementary Data 5–8). *Variants also associated in cis with mRNA of other genes (Supplementary Data 7,8). The meta-analyses were performed using logistic regression, the risk (odds ratio OR in yellow) of (a) IDD and (b) Dorsalgia are here shown for the risk-increasing allele and significance in blue. COL11A1 and GSDMC are not included in the figure as evidence for their association with back pain was derived differently as described in results.
Variants associating with LDHsurg in GWAS meta-analysis of three cohorts; Iceland, UK Biobank and Finland (Ncases = 9188, Ncontrols = 780,323) compared to association with IDD (M51) in all four cohorts.
| Loci | rs name | EAa | Close gene | Annotation | Frqb | ORLDHsurg (95% CI)c | OR IDD (95% CI)c | ||
|---|---|---|---|---|---|---|---|---|---|
| 8q24.21 | rs7833174 | C | Regulatory region | 23.4 | 0.85 (0.82, 0.88) | 2.1 × 10−16 | 0.96 (0.95, 0.97) | 7.2 × 10−14 | |
| 10q22.1 | rs4148948 | G | 3′UTR | 38.2 | 0.88 (0.85, 0.91) | 4.1 × 10−14 | 0.92 (0.90, 0.93) | 1.6 × 10−39 | |
| 7p12.3 | rs1723939 | T | Regulatory region | 50.2 | 1.12 (1.08, 1.15) | 1.4 × 10−11 | 1.06 (1.05, 1.07) | 1.6 × 10−18 |
aEffect allele (EA).
bAverage frequency of effect allele in the three cohorts for which the surgical phenotype was available (Iceland, UKB, and Denmark).
cOR and P value for an inverse-variance weighted meta-analysis of association results for three cohorts (LDHsurg) and all four cohorts (IDD).
Fig. 3Mendelian randomization (MR) analyses of the genetic relationship between IDD and dorsalgia in terms of causality.
a shows effects of variants associating with Dorsalgia at genome-wide significance, on IDD and dorsalgia. b shows the effects of variants associating with IDD at genome-wide significance, on IDD and dorsalgia. Effects are expressed as logarithms of odds ratios (log(OR)) and black crosses indicate 95% confidence intervals (CI) around effects. To avoid sample overlap, exposure effects are from the cohorts from Iceland (IS), Denmark (DK) and Finland (FIN), while outcome effects are from UK-Biobank. The dashed blue lines show the linear regression fit through the origin, weighting variants according to the square of the standard error of their effect estimates (also known as inverse-variance weighted, IVW)a)). The IVW-MR method is a multiplicative random effects model, where the test statistic is from a t-distribution, the test is two sided. No multiple comparison adjustments were made. The dashed red lines show the weighted linear regression fit not constrained to go through the origin (also known as MR Eggerb)). For the IDD variants, the slopes of both regression lines are different from zero. For the dorsalgia variants, the slope of the regression line (IVW) through the origin is different from zero, but not the slope of the unconstrained regression line (MR Egger). Further, the effects of the dorsalgia variants deviate substantially more from the regression lines than the IDD variants. These results are not sensitive to outlier removal (Methods, Supplementary Fig. 3).