| Literature DB >> 28492228 |
Tom Parks1, Mariana M Mirabel2, Joseph Kado3,4, Kathryn Auckland1, Jaroslaw Nowak5, Anna Rautanen1, Alexander J Mentzer1, Eloi Marijon2,6, Xavier Jouven2,6, Mai Ling Perman4, Tuliana Cua7, John K Kauwe8, John B Allen8, Henry Taylor9, Kathryn J Robson10, Charlotte M Deane5, Andrew C Steer11,12, Adrian V S Hill1.
Abstract
The indigenous populations of the South Pacific experience a high burden of rheumatic heart disease (RHD). Here we report a genome-wide association study (GWAS) of RHD susceptibility in 2,852 individuals recruited in eight Oceanian countries. Stratifying by ancestry, we analysed genotyped and imputed variants in Melanesians (607 cases and 1,229 controls) before follow-up of suggestive loci in three further ancestral groups: Polynesians, South Asians and Mixed or other populations (totalling 399 cases and 617 controls). We identify a novel susceptibility signal in the immunoglobulin heavy chain (IGH) locus centring on a haplotype of nonsynonymous variants in the IGHV4-61 gene segment corresponding to the IGHV4-61*02 allele. We show each copy of IGHV4-61*02 is associated with a 1.4-fold increase in the risk of RHD (odds ratio 1.43, 95% confidence intervals 1.27-1.61, P=4.1 × 10-9). These findings provide new insight into the role of germline variation in the IGH locus in disease susceptibility.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28492228 PMCID: PMC5437274 DOI: 10.1038/ncomms14946
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Figure 1Oceanian study population.
(a) Approximate location where genotyped cases (red) and controls (black) were sampled. (b) Projection of the samples on to the first and second (left) and first and third (right) principal components (PCs) of genetic variation coloured by self-reported ancestry (MEL, Melanesians; POL, Polynesian; IND, Fijian Indian; MIX, Mixed and other) with cases indicated by empty squares and controls by empty diamonds. Selected samples from the Human Genome Diversity Project Panel (NGH, Papuan; EAS, South East Asian; EUR, European; CSA, Central South Asian) are superimposed for comparison and indicated by filled circles. (c) Estimates of admixture proportions from four source populations grouped by self-reported ancestry, with selected samples of Papuan and European ancestry shown at the far left and right, respectively, for comparison.
Figure 2Genome-wide meta-analysis for RHD susceptibility.
For each variant, the negative common logarithm of the P value from an inverse-variance weighted fixed-effects meta-analysis is plotted against genomic position. The blue horizontal line indicates suggestive significance (FE meta-analysis, P=10−5) and the red horizontal line indicates genome-wide significance (FE meta-analysis, P=5 × 10−8).
Figure 3Association of the IGHV4-61 locus with RHD susceptibility.
(a) For each variant in the 99% credible set, the common logarithm of the Bayes' factor is plotted against genomic position. Variants are coloured by linkage disequilibrium with the most associated variant averaged across the entire data set (estimated r2: dark blue, 0–0.2; light blue, 0.2–0.4; green, 0.4–0.6; yellow, 0.6–0.8; red, 0.8–1.0). A vertical blue line indicates the position of the four nonsynonymous variants in IGHV4-61 and locations of expressed IGH gene segments are indicated by blue rectangles below the x axis. (b) Forest plot for the IGHV4-61*02 allele under an additive genetic model with association statistics from LMM analysis in each strata combined by FE meta-analysis. Individual and combined odds ratio estimates with confidence intervals are shown on a logarithmic scale. (c) Structural model of an antibody that includes the IGHV4-61 heavy variable domain (Protein Databank 4FQQ) showing both heavy (blue) and light (white) chains with both the first (CDR-H1, green) and second (CDR-H2, violet) heavy chain complementarity determining loops and the heavy chain interface framework loop (HIFL, red) highlighted. The positions that distinguish IGHV4-61*01 from IGHV4-61*02 are shown as spheres labelled with the amino acids found in IGHV4-61*01.
Association of the IGHV4-61*02 allele with RHD susceptibility by ancestry and country.
| Ancestry | Melanesian | |||||||||||
| iTaukei | All | 307 | 553 | 790 | 0.32 | 0.24 | LMM | 1.02 | 1.34 (1.07–1.67) | 0.011 | ||
| Matched | 296 | 296 | 592 | 0.32 | 0.25 | LMM | 1.01 | 1.49 (1.15–1.94) | 0.003 | |||
| LR | 1.01 | 1.51 (1.15–1.98) | 0.0024 | |||||||||
| Kanak | All | 280 | 154 | 397 | 0.31 | 0.22 | LMM | 1.03 | 1.80 (1.30–2.49) | 0.00039 | ||
| Matched | 153 | 153 | 306 | 0.29 | 0.19 | LMM | 1.09 | 1.77 (1.22–2.58) | 0.0028 | |||
| LR | 1.07 | 1.80 (1.21–2.69) | 0.0028 | |||||||||
| Polynesian | ||||||||||||
| Samoan | All | 61 | 74 | 134 | 0.32 | 0.18 | LMM | 0.99 | 2.07 (1.23–3.50) | 0.0062 | ||
| Matched | 55 | 55 | 110 | 0.35 | 0.18 | LMM | 0.99 | 2.16 (1.25–3.75) | 0.0061 | |||
| LR | 1.04 | 2.24 (1.21–4.15) | 0.0066 | |||||||||
| Fijian Indian | ||||||||||||
| All | Matched | 142 | 142 | 284 | 0.18 | 0.11 | LMM | 1 | 1.99 (1.18–3.36) | 0.0096 | ||
| LR | 1.00 | 2.02 (1.18–3.49) | 0.0092 | |||||||||
| Mixed and other | ||||||||||||
| Country | Fiji Islands | All | All | 532 | 751 | 1245 | 0.27 | 0.22 | LMM | 1.02 | 1.39 (1.16–1.67) | 0.00043 |
| New Caledonia | All | All | 422 | 362 | 779 | 0.28 | 0.18 | LMM | 1.02 | 1.57 (1.25–1.97) | 8.6 × 10−5 | |
| Samoa | All | All | 61 | 75 | 135 | 0.32 | 0.18 | LMM | 0.99 | 2.10 (1.25–3.54) | 0.0054 | |
CI, confidence interval; freq., frequency; LMM, linear mixed model; LR, logistic regression; OR, odds ratio; RHD, rheumatic heart disease; Subpopn, subpopulation.
Lines highlighted in bold refer to the initial discovery and replication analyses, while other lines refer to subsequent sensitivity analyses. The genomic control factor (λ) was calculated from a genome-wide analysis using the analytical method indicated.