Literature DB >> 36007525

Variation in CFHR3 determines susceptibility to meningococcal disease by controlling factor H concentrations.

Vikrant Kumar1, Richard B Pouw2, Matias I Autio3, Manfred G Sagmeister4, Zai Yang Phua5, Lisa Borghini6, Victoria J Wright7, Clive Hoggart7, Bangfen Pan3, Antson Kiat Yee Tan8, Alexander Binder4, Mieke C Brouwer9, Ellie Pinnock10, Ronald De Groot11, Jan Hazelzet12, Marieke Emonts13, Michiel Van Der Flier14, Karl Reiter15, Markus M Nöthen16, Per Hoffmann16, Luregn J Schlapbach17, Evangelos Bellos7, Suzanne Anderson18, Fatou Secka18, Federico Martinón-Torres19, Antonio Salas20, Colin Fink10, Enitan D Carrol21, Andrew J Pollard22, Lachlan J Coin23, Werner Zenz4, Diana Wouters9, Lay Teng Ang8, Martin L Hibberd24, Michael Levin7, Taco W Kuijpers25, Sonia Davila26.   

Abstract

Neisseria meningitidis protects itself from complement-mediated killing by binding complement factor H (FH). Previous studies associated susceptibility to meningococcal disease (MD) with variation in CFH, but the causal variants and underlying mechanism remained unknown. Here we attempted to define the association more accurately by sequencing the CFH-CFHR locus and imputing missing genotypes in previously obtained GWAS datasets of MD-affected individuals of European ancestry and matched controls. We identified a CFHR3 SNP that provides protection from MD (rs75703017, p value = 1.1 × 10-16) by decreasing the concentration of FH in the blood (p value = 1.4 × 10-11). We subsequently used dual-luciferase studies and CRISPR gene editing to establish that deletion of rs75703017 increased FH expression in hepatocyte by preventing promotor inhibition. Our data suggest that reduced concentrations of FH in the blood confer protection from MD; with reduced access to FH, N. meningitidis is less able to shield itself from complement-mediated killing.
Copyright © 2022 The Authors. Published by Elsevier Inc. All rights reserved.

Entities:  

Keywords:  CFH; CFHR3; Complement; Factor H-related protein; Meningococcal disease; Neisseria meningitidis; factor H; genotyping; in-depth sequencing; infectious disease

Mesh:

Substances:

Year:  2022        PMID: 36007525      PMCID: PMC9502058          DOI: 10.1016/j.ajhg.2022.08.001

Source DB:  PubMed          Journal:  Am J Hum Genet        ISSN: 0002-9297            Impact factor:   11.043


Introduction

Meningitis and sepsis caused by Neisseria meningitidis remain amongst the most feared bacterial infections world-wide. Although immunization has decreased the incidence of invasive meningococcal disease (MD) in some countries, there are no vaccines effective against all serogroups, and the emergence of new serogroups and strains, poses new challenges to international vaccination strategies. Epidemics and outbreaks continue to occur in many countries, particularly in the meningitis belt of sub-Saharan Africa.3, 4, 5, 6 A remarkable feature of N. meningitidis is that it is a harmless commensal for the majority of the world’s population and is carried in the nasopharynx repeatedly throughout life. Invasive disease occurs in 0.16–20 per 100,000 people in developed countries, but there is wide variation in incidence, and epidemics occur.,,, There is good evidence that genetic factors play a role in MD.9, 10, 11 Rare Mendelian defects in complement genes are associated with familial MD., Our previous genome-wide association study (GWAS) identified an association between MD and a broad genomic region spanning complement factor H (CFH [MIM: 134370]) and the complement factor H-related protein (in genetic order; CFHR3 [MIM: 605336], CFHR1 [MIM: 134371], CFHR4 [MIM: 605337], CFHR2 [MIM: 600889], and CFHR5 [MIM: 608593]) genes. Identification of the causal gene and characterization of the functional variant(s) have been difficult because of the complexity of the region; CFH shows sequence similarity to the five adjacent CFHR genes on human chromosome 1. Factor H (FH) is a serum glycoprotein that is synthesized mostly in the liver and acts as a negative regulator of the alternative complement activation pathway. FH is a crucial factor in preventing host cell damage by uncontrolled complement activation, and genetic variation in CFH or the CFHR genes is associated with several diseases, including systemic lupus erythematosus (SLE [MIM: 152700]), glomerulonephritis, IgA nephropathy, atypical hemolytic uremic syndrome (aHUS [MIM: 235400]) and age-related macular degeneration (AMD [MIM: 603075),, although the mechanistic process leading to disease is unclear for all these diseases. N. meningitidis expresses several membrane proteins that bind human FH; Neisserial surface protein A (NspA), Porin B2 (PorB2), Porin B3 (PorB3), and FH-binding protein (fHbp) and is believed to survive and replicate in human blood by using the surface bound FH in a “Trojan horse” process to inhibit complement-mediated killing. Genetically regulated differences in FH plasma concentrations might thus alter susceptibility to N. meningitidis. Furthermore, inhibition of complement by “hijacking” FH has been adopted as an immune evasion strategy by several pathogens, including fungi, parasites, and viruses next to bacteria (reviewed in). We aimed to identify the mechanism underlying the association of variants within the CFH-CFHR region with susceptibility and resistance to MD.

Methods

Study sample sets

The design for our study and the composition of clinical cohorts are shown in Figure S1 and Table S1. Clinical details of individuals with MD in UK, Spanish, and other European cohorts have been reported previously, as have the diagnostic criteria, recruitment procedure, and ethical approvals, (supplementary Appendix). 238 individuals with MD and 237 controls from the Central European cohort (CEC) were used for deep sequencing the CFH-CFHR region. Replication of the most significant SNPs was undertaken in 1,522 individuals with MD and 2,672 controls (755 individuals with MD and 1,253 controls from the UK, 279 individuals with MD and 395 controls from Central Europe, and 488 individuals with MD and 1,024 controls from Spain)., Previously genome-wide-genotyped cohorts totaling 1,246 individuals with MD and 7,197 controls (472 individuals with MD and 4,614 controls from the UK; 358 individuals with MD and 1,770 controls from Central Europe; and 416 individuals with MD and 813 controls from Spain,) were newly imputed, and the data were used for a subsequent meta-analysis. Convalescent serum was available from 367 individuals with MD (308 UK, 59 Dutch) and 124 healthy, unrelated Dutch controls for measurement of FH and FHR-3 concentrations; of the 308 UK individuals with MD, 295 were included in protein quantitative trait loci (pQTL) analysis, together with 56 healthy, unrelated controls from Central Europe.

Sequencing and genotyping of the CFH-CFHR region

To identify functional variants driving the association with MD susceptibility, we devised a capture-targeted sequencing strategy with tiling arrays (designed by Roche NimbleGen) covering more than 85% of the CFH-CFHR region spanning 359 kb on chromosome 1 (chr1: 196,620,000–196,979,000, GRCh37/hg19) and then performed sequencing with Illumina HiSeq 2000 by using 100 bp paired-end reads (stage 1, see supplemental information). The average depth of sequencing was 227× (Figure S2). We validated the most significant SNPs (stage 2, see supplemental information) by using a Sequenom Multiplex MassArray (San Diego, USA). Genetic association testing was carried out with Fisher’s exact test for rare SNPs (MAF < 1%) and logistic regression analysis for common SNPs and copy-number variants (CNVs) under an additive genetic model. To mitigate the effect of population stratification, we analyzed association of SNPs with MD separately in all three replication cohorts under the additive model and performed meta-analysis for both SNPs and CNVs by combining summary statistics of stage 1 (deep sequencing) and stage 2 (Sequenom validation) by using the Cochran-Mantel-Haenszel (CMH) test. For CNVs, all samples were combined and analyzed under a genotypic and additive model.

Detection of copy-number variation

CNVs were detected in the resequencing dataset (238 individuals with MD and 237 controls from Central Europe) with cnvCapSeq (version 0.1.230) and cross-validated by quantitative PCR in the same cohort (Taqman qPCR, Table S2). In the second-stage validation of the 51 SNPs across the Central Europe, UK, and Spanish cohorts, detection of CNVs was done with the Taqman qPCR assays. For pQTL data, multiplex ligation-dependent probe amplification (MLPA) and Taqman assays were used for identifying CNVs (supplemental information). Genotype-phenotype correlation of SNPs and CNVs were analyzed for FH and FHR-3 concentrations via linear regression analysis. We used ANCOVA, with sex as a covariate, to estimate the overall difference in the protein concentrations across six genotype groups. Differences in protein concentrations between two genotype groups were evaluated by t test.

Imputation of genome-wide genotyped data

To confirm our resequencing analysis by using current genome assemblies, we re-analyzed our original UK GWAS data, including newly genome-wide-genotyped cohorts from Central Europe and Spain (stage 3, see supplemental information). After pre-processing (supplemental information), we used BEAGLE (version 5.1) to perform haplotype estimation and imputation of missing genotypes by utilizing alternately the Haplotype Reference Consortium (HRC [http://www.haplotype-reference-consortium.org]; HRC release 1.1 [https://ega-archive.org/datasets/EGAD00001002729]) and the 1000 Genomes Project phase 3 (1KGP [http://www.internationalgenome.org]) as reference genomes. After extraction of the individually calculated allele dose, which is the sum of the two allele probabilities based on a hidden Markov model, we applied a univariate linear mixed-model algorithm (uLMM) using a centered relatedness matrix implemented in GEMMAsoftware (version 0.98.1) to perform genetic association testing for quantitative traits under an additive model. To additionally account for population stratification, we used the first two or four principal components (PCs; Figure S4) as covariates in each individual cohort. Furthermore, the genomic control function implemented in the GWAMA software (version 2.2.2) was used for the subsequent meta-analysis of the single summary statistics, resulting in an overall genomic control lambda (λ) of 1.002 (95% CI 0.094–1.010) when all variants were used and 1.007 (95% CI 0.971–1.0432) when only genotyped variants were used (Figure S5).

Serum concentrations of FH and FHR-3

FH and FHR-3 concentrations were determined by specific ELISAs as previously described. In brief, the antigen was captured with monoclonal antibody anti-FH.16 and anti-FHR-3.1 for FH and FHR-3, respectively (Sanquin Research, Amsterdam, The Netherlands). Bound FH was subsequently detected by the use of polyclonal goat anti-human FH antibodies, and bound FHR-3 was detected with monoclonal anti-FHR-3.4 (Sanquin Research).

Differentiation of human embryonic stem cells

Wild-type and CRISPR/Cas-targeted H1 human embryonic stem (hES) cells were differentiated to hepatocyte-like cells over 18 days as previously described.

Genome editing of differentiated hepatocytes by CRISPR/Cas9

Guide RNAs flanking the liver-specific regulatory region of interest in CFHR3 were designed, incorporated in plasmids, and transfected via electroporation into H1 hES cells (Table S3). After incubation for two days, Clover+ cells were seeded at 500–1,000 cells per well of a 6-well plate. After 2-3 weeks culture, single colonies were picked and expanded for screening. Deletion of liver specific regulatory region was determined by PCR with primers spanning the targeted region (Table S3). Confirmed deletion clones and wild-type controls were used for detecting RNA expression by RT-PCR (see supplemental information).

Results

Fine mapping of the CFH-CFHR region identifies CFHR3 as the lead association

Deep sequencing of the CFH-CFHR region in 238 individuals with MD and 237 healthy controls identified 4,369 SNPs after application of stringent quality-control filters (Table S4). The strongest signal of association was identified on CFHR3 in a region with high linkage disequilibrium (LD, D′ = 0.92) with the previously reported lead variant, rs1065489 in CFH.,, The 51 SNPs with the strongest association with MD were selected for validation, and 44 SNPs were successfully typed (Table S5) in the UK, Spanish, and Central European cohorts (n = 4,194). 13 SNPs, in a tight LD block within CFHR3, achieved genome-wide significance in the meta-analysis (Figure 1, Table 1), confirming the genetic association with CFHR3. The lead SNP (Table 1), rs75703017 (p value = 1.1 × 10−16), located in intron 1 of CFHR3, showed consistent odds ratios (OR = 0.62) for susceptibility to MD in all cohorts, indicating a protective effect (Figure 2).
Figure 1

Fine mapping by sequencing of the CFH-CFHR locus

Plot showing association results of all the SNPs (arranged according to their GRCh37/hg19 build chromosomal position on the x axis) from deep sequencing (circle) and from meta-analysis (square) with combined stage 1 and stage 2 cohorts. The top SNP from the analysis is labeled (rs75703017). The color intensity of each symbol reflects the extent of LD with the top GWAS SNP.

Table 1

Unadjusted p values and ORs of top SNPs from sequencing, genotyping, and the combined analysis

SNP IDSequence
Replication
Combined CMHa
Central Europe
Central Europe
UK
Spain
MAF MDMAF conp valueORMAF MDMAF Conp valueORMAF MDMAF conp valueORMAF MDMAF Conp valueORp valueOR
rs757030170.120.213.11 × 10−30.620.140.213.60 × 10−30.680.140.218.34 × 10−60.710.180.262.26 × 10−40.721.11 × 10−160.62
rs6200150.130.216.18 × 10−30.650.140.223.62 × 10−30.680.150.212.27 × 10−50.730.200.261.01 × 10−30.759.55 × 10−150.64
rs3871070.140.219.35 × 10−30.670.140.214.49 × 10−30.680.150.214.97 × 10−50.740.200.273.11 × 10−40.731.24 × 10−140.65
rs3853900.140.226.04 × 10−30.650.150.229.74 × 10−30.710.150.211.50 × 10−50.720.200.277.11 × 10−40.751.35 × 10−140.65
rs124095710.130.215.04 × 10−30.640.140.218.20 × 10−30.700.150.201.10 × 10−40.750.190.262.46 × 10−40.732.38 × 10−140.65
rs4255240.130.217.46 × 10−30.650.360.454.97 × 10−30.730.310.422.63 × 10−90.660.460.509.96 × 10−20.878.26 × 10−140.69
rs4011880.130.215.98 × 10−30.650.220.253.36 × 10−10.870.200.341.14 × 10−130.460.340.383.22 × 10−20.763.32 × 10−130.64
rs17387410.140.219.35 × 10−30.670.140.211.06 × 10−20.710.150.211.39 × 10−50.720.220.271.43 × 10−20.811.54 × 10−120.67
rs3768410.130.214.42 × 10−30.640.100.161.55 × 10−30.560.100.151.29 × 10−50.630.140.184.74 × 10−40.653.06 × 10−110.66
rs13294230.190.274.10 × 10−30.630.210.263.43 × 10−20.750.210.259.72 × 10−40.780.230.307.31 × 10−50.705.07 × 10−100.73
rs118079970.130.215.12 × 10−30.640.190.188.97 × 10−11.020.150.211.24 × 10−40.740.190.251.12 × 10−30.751.34 × 10−90.71
rs124084460.130.207.96 × 10−30.650.200.185.19 × 10−11.080.150.218.96 × 10−50.740.190.251.36 × 10−30.766.86 × 10−90.72
rs1162490580.140.217.53 × 10−30.660.120.142.54 × 10−10.830.120.151.74 × 10−20.810.140.218.85 × 10−50.679.59 × 10−90.70

CMH = Cochran-Mantel-Haenszel test; MD = individuals with meningococcal disease; con = healthy controls; MAF = minor-allele frequency; OR = odds ratio.

Figure 2

Forest plot of the top SNP, rs75703017

Plot showing odds ratios (ORs) and confidence intervals in each of the four cohorts; the summary estimate is shown below. The dotted vertical line indicates no effect. Cohort names with numbers of MD individuals and healthy controls are on the left, ORs and 95% CI are on the right.

Fine mapping by sequencing of the CFH-CFHR locus Plot showing association results of all the SNPs (arranged according to their GRCh37/hg19 build chromosomal position on the x axis) from deep sequencing (circle) and from meta-analysis (square) with combined stage 1 and stage 2 cohorts. The top SNP from the analysis is labeled (rs75703017). The color intensity of each symbol reflects the extent of LD with the top GWAS SNP. Unadjusted p values and ORs of top SNPs from sequencing, genotyping, and the combined analysis CMH = Cochran-Mantel-Haenszel test; MD = individuals with meningococcal disease; con = healthy controls; MAF = minor-allele frequency; OR = odds ratio. Forest plot of the top SNP, rs75703017 Plot showing odds ratios (ORs) and confidence intervals in each of the four cohorts; the summary estimate is shown below. The dotted vertical line indicates no effect. Cohort names with numbers of MD individuals and healthy controls are on the left, ORs and 95% CI are on the right.

Imputation across the CFH-CFHR region confirms a broad region of association

Imputation of genome-wide genotyped data in three different European cohorts including 1,246 individuals with MD and 7,197 controls was complicated by the reported CNVs nsv3888824 (deletion) spanning 84,671 bases and resulting in a hybrid CFH/CFHR1 gene and nsv4649133 (deletion) spanning 79,989 bases and resulting in a complete deletion of CFHR3 and CFHR1 (Figure 3C). Using a subset of the HRC reference panel identified only four of the 13 SNPs found within and adjacent to CFHR3 by our resequencing work, whereas use of the 1KGP reference panel, which identified 11 SNPs with high confidence in a tight LD block closely around the genome-wide significance level of 5 × 10−8 (Table 2 and Figure 3). The usage of updated references and bioinformatic tools for the association mapping (Figure 3) allowed us to impute variants within the complex region in and adjacent to CFHR3, but we were still unable to refine the location of the causative variant within the CFH-CFHR locus as a result of the apparent tight LD. Indeed, we observed discrepancies between the FH associations reported by Sun et al. and our findings based on the 1KGP controls (Table S6); we suspect these discrepancies are due to differences in the imputation methods used for estimating the CNVs. Using imputation for the CNVs, we found peak association with MD outside the CNVs; whreas direct measurement of the CNVs shows the peak association to be within them (Figure 1).
Figure 3

Fine mapping of the CFH-CFHR locus by GWAS

(A) Known variants reported in the NHGRI-EBI catalog of human genome-wide association studies. An asterisk represents the location of the rs426736 SNP within the CNVs and annotated as associated with MD.

(B) The plot represents the genes located in the captured region (ranging from CFH to CFHR5) of the sequencing approach and shows association results of all variants (SNPs and InDels arranged according to their GRCh37/hg19 build chromosomal position on the x axis) from GWAS meta-analysis with the lead SNP, rs1065489, set as a reference variant (purple diamond). The color intensity of each symbol reflects the extent of LD with the top GWAS SNP.

(C) dbVar (https://www.ncbi.nlm.nih.gov/dbvar/)-annotated common CNVs with partial (nsv3888824 results in a CFH/CFHR1 hybrid gene) or complete (nsv4649133) deletion of CFHR3 and CFHR1.

(D) Plot showing association results of all variants (SNPs and InDels arranged according to their GRCh37/hg19 build chromosomal position on the x axis) from a GWAS meta-analysis with the lead SNP, rs75703017, from stages 1+2 set as a reference variant (purple diamond) mapping to a smaller genetic area focused on the start of the CNVs. Variants, which were either previously reported, or notable findings from this study (stages 1–3) are annotated within the plot. #Annotated variants represent small InDels within the CNVs. Violet vertical and green lines represent the start of the CNVs nsv3888824 and nsv4649133, respectively.

Table 2

p values and ORs of the top 20 variants from the GWAS meta-analysis and of the 11 SNPs from the sequencing and genotyping

Variant IDBPVariantAltRefMAFORpmetaI2DR2
rs1065489a196,709,774missenseTG0.170.691.25 × 10−100.74gt
rs200384682196,739,608bindelCAC0.180.691.81 × 10−100.720.90
rs431408196,764,663bintronGT0.200.692.62 × 10−100.610.91
rs3753396196,695,742synonymousGA0.170.693.21 × 10−100.75gt
rs380424196,763,939bdownstreamCT0.180.693.38 × 10−100.700.97
rs72482676196,730,755bintergenicCT0.160.684.21 × 10−100.730.96
rs11582939a196,710,157intronTC0.170.704.99 × 10−100.72gt
rs742855a196,705,520intronCT0.170.705.66 × 10−100.76gt
rs141408533196,690,281intronTTA0.170.706.77 × 10−100.751.00
rs377298196,758,541b3′ UTRCA0.190.706.98 × 10−100.610.93
rs77302817196,698,082indelCCTCTG0.170.707.32 × 10−100.761.00
rs12402808196,691,625intronAC0.170.707.65 × 10−100.761.00
rs11799380196,708,455intronGA0.170.707.91 × 10−100.731.00
rs2336221196,708,891intronTG0.170.707.91 × 10−100.731.00
rs11801630196,692,148intronTC0.170.708.03 × 10−100.75gt
rs1048663196,674,982intronAG0.170.709.14 × 10−100.751.00
rs74861068196,825,380intronAG0.130.659.60 × 10−100.360.99
rs74213209196,679,010intronGA0.170.709.65 × 10−100.751.00
rs201034534196,720,267bindelAAAAAC0.170.701.00 × 10−90.740.99
rs10489456a196,687,515intronAG0.170.701.05 × 10−90.731.00
rs12409571196,768,726bintergenicGA0.200.735.80 × 10−80.570.87
rs116249058196,767,218bdownstreamGA0.200.746.37 × 10−80.530.87
rs75703017196,744,699bintronAC0.200.746.80 × 10−80.560.88
rs387107196,757,881bmissenseTG0.210.747.19 × 10−80.510.87
rs11807997196,743,213bupstreamGA0.200.748.68 × 10−80.560.87
rs401188196,757,083bintronTC0.210.741.30 × 10−70.500.87
rs12408446196,741,197bupstreamAG0.210.741.38 × 10−70.530.88
rs620015196,748,676bintronGA0.210.741.47 × 10−70.510.87
rs376841196,746,600bintronCT0.210.751.53 × 10−70.510.87
rs385390196,743,927b5′ UTRCA0.210.752.36 × 10−70.500.87
rs1329423196,646,387exonCT0.260.794.09 × 10−70.710.99

BP = base position (GRCh37/hg19); MAF = minor-allele frequency; OR = odds ratio (estimated from LMM beta effects according to https://shiny.cnsgenomics.com/LMOR/); DR2 = mean dosage R-squared from the three single cohorts. gt = genotyped, no DR2 score. SNPs from the sequencing and genotyping are indicated in italics.

Previously reported as associated with MD susceptibility.

Within a CNV (nsv3888824, nsv4649133).

Fine mapping of the CFH-CFHR locus by GWAS (A) Known variants reported in the NHGRI-EBI catalog of human genome-wide association studies. An asterisk represents the location of the rs426736 SNP within the CNVs and annotated as associated with MD. (B) The plot represents the genes located in the captured region (ranging from CFH to CFHR5) of the sequencing approach and shows association results of all variants (SNPs and InDels arranged according to their GRCh37/hg19 build chromosomal position on the x axis) from GWAS meta-analysis with the lead SNP, rs1065489, set as a reference variant (purple diamond). The color intensity of each symbol reflects the extent of LD with the top GWAS SNP. (C) dbVar (https://www.ncbi.nlm.nih.gov/dbvar/)-annotated common CNVs with partial (nsv3888824 results in a CFH/CFHR1 hybrid gene) or complete (nsv4649133) deletion of CFHR3 and CFHR1. (D) Plot showing association results of all variants (SNPs and InDels arranged according to their GRCh37/hg19 build chromosomal position on the x axis) from a GWAS meta-analysis with the lead SNP, rs75703017, from stages 1+2 set as a reference variant (purple diamond) mapping to a smaller genetic area focused on the start of the CNVs. Variants, which were either previously reported, or notable findings from this study (stages 1–3) are annotated within the plot. #Annotated variants represent small InDels within the CNVs. Violet vertical and green lines represent the start of the CNVs nsv3888824 and nsv4649133, respectively. p values and ORs of the top 20 variants from the GWAS meta-analysis and of the 11 SNPs from the sequencing and genotyping BP = base position (GRCh37/hg19); MAF = minor-allele frequency; OR = odds ratio (estimated from LMM beta effects according to https://shiny.cnsgenomics.com/LMOR/); DR2 = mean dosage R-squared from the three single cohorts. gt = genotyped, no DR2 score. SNPs from the sequencing and genotyping are indicated in italics. Previously reported as associated with MD susceptibility. Within a CNV (nsv3888824, nsv4649133).

Serum concentrations of FH, but not FHR-3, are lower in controls than in individuals who survived MD

To explore the relationship between serum concentrations of FH, FHR-3, and MD, we measured concentrations in serum from individuals who survived invasive MD, at least six months after the acute illness. Serum concentrations of FH were significantly lower in healthy controls than in those with MD (Figure 4A). In contrast, FHR-3 serum concentrations were not significantly different between MD survivors and controls (Figure 4A).
Figure 4

FH and FHR-3 concentrations in MD survivors

(A) Box plots with 95% range of FH serum concentrations (left panel), determined by ELISA. FH serum concentrations are increased (p < 0.0001, Mann-Whitney test) in MD survivors (n = 367) compared to healthy controls (n = 124), whereas FHR-3 serum concentrations are not significantly different between the two groups.

(B) Violin plotsof FH serum concentrations delineated by genotype for the SNP (rs75703017) most associated with FH concentrations; p = 1.41 × 10−11.

(C) Violin plot of FHR-3 serum concentrations delineated by genotype for the SNP (rs75703017), p value < 2 × 10−16.

For both (B) and (C), the x and y axes indicate genotypes and protein concentration (µg/mL), repsectively, with D = CFHR3/CFHR1 deletion, A = minor allele, and C = major allele for rs75703017. The white dots indicate median concentrations, the thick black bars indicate the interquartile range, and the thin black bars represent total range. p values were estimated by ANCOVA.

FH and FHR-3 concentrations in MD survivors (A) Box plots with 95% range of FH serum concentrations (left panel), determined by ELISA. FH serum concentrations are increased (p < 0.0001, Mann-Whitney test) in MD survivors (n = 367) compared to healthy controls (n = 124), whereas FHR-3 serum concentrations are not significantly different between the two groups. (B) Violin plotsof FH serum concentrations delineated by genotype for the SNP (rs75703017) most associated with FH concentrations; p = 1.41 × 10−11. (C) Violin plot of FHR-3 serum concentrations delineated by genotype for the SNP (rs75703017), p value < 2 × 10−16. For both (B) and (C), the x and y axes indicate genotypes and protein concentration (µg/mL), repsectively, with D = CFHR3/CFHR1 deletion, A = minor allele, and C = major allele for rs75703017. The white dots indicate median concentrations, the thick black bars indicate the interquartile range, and the thin black bars represent total range. p values were estimated by ANCOVA.

Low serum concentrations of FH are associated with both SNPs and CNVs in CFHR3

To investigate the effect of the top associated SNPs on the serum concentrations of FH and FHR-3, we undertook pQTL analysis, relating protein concentrations and genotype. The minor allele (A) of the lead SNP (rs75703017), shown to confer protection against MD (OR = 0.63), also showed the most significant association with lower FH serum concentrations (p = 1.4 × 10−11, Figure 4B), as confirmed in a pairwise comparison between genotypes carrying the minor allele (A) of rs75703017 (Figures 4B and 4C and Tables S7 and S11). A common deletion spanning CFHR3 and CFHR1 (including rs75703017) has been shown to influence susceptibility to several inflammatory diseases.,,, To establish whether this deletion was also associated with susceptibility to MD, we determined CNVs in the sequenced individuals with MD by using cnvcapSeq, which permits detection of CNVs in long-range targeted sequencing data. We then validated the findings by MLPA or qPCR analysis in a subset of samples (1,302 individuals with MD, 1,463 controls) from three European cohorts. Meta-analysis of CNV data revealed an overall lack of association (p = 0.76) between the CFHR3/CFHR1 deletion and susceptibility to MD (Table S8), as previously reported., Considering that the pQTL data indicated a dominant effect of the minor allele (A) of rs75703017, we performed a second comparison consisting of minor-allele carriers (A) vs. CFHR3/CFHR1 deletion allele carriers. In contrast to the initial overall lack of association, this second comparison revealed that deletion of CFHR3/CFHR1 was in fact associated with higher genetic risk of MD (p = 0.0081, Table S9) and increased FH serum concentrations. This positive genetic association with the CFHR3/CFHR1 deletion was only detected when the combination of three alleles (wild type allele C, minor allele A, and deletion D) in rs75703017 were taken into account. Meta-analysis of the quantitative-trait association removing all samples with the CFHR3/CFHR1 deletion did not modify the results (Figure S3), indicating that the association of rs75703017 persists regardless of CFHR3/CFHR1 deletion status (Table S11).

CFHR3 controls CFH expression through epigenetic long-range interaction

Having established the correlation between “protective genotypes” and lower serum concentrations of FH and between “risk genotypes” and higher concentrations of FH, we next investigated the epigenetic histone marks in various cell lines to provide information on the putative regulatory role of the potential functional SNP in CFHR3. Histone marks (H3K4me3 and H3K9ac) from the Roadmap epigenomics database indicated that all investigated hepatic cell lines have an active regulatory site within CFHR3. Furthermore, no other cell types (non-hepatic) tested showed any indication of regulatory regions, suggesting that this functional site might be specifically active in liver cells,, which is concordant with the liver’s being the main FH-producing organ. In line with our hypothesis that there is a regulatory interaction between CFHR3 and CFH, we examined whether the homozygous deletion of CFHR3/CFHR1, carried by 3% of the European population, affected FH protein concentrations. Indeed, the deletion of CFHR3/CFHR1, identified by the lack of FHR-3 in serum, was associated with significantly higher FH protein concentrations (Figure 4B).

Dual-luciferase assays confirm liver-specific activity

To confirm the role of the rs75703017 minor allele identified in our fine mapping in regulating CFH activation, we compared luciferase activity of a liver cell line (HepG2) and of a line originating from embryonic kidney (HEK293T). We compared cells containing an empty vector (pGL3-empty) and three constructs containing the following: rs75703017 major allele C (pGL3-C); rs75703017 minor allele A (pGL3-A); and rs75703017 minor allele A together with minor alleles of two SNPs in close proximity (A of rs446868 and C of rs385390, pGL3-AAC) (Figure S7A, Table S12). Differential expression of a test reporter was detected in HepG2 (pGL3-A vs. pGL3-empty; p < 0.0001, Figure 5A) whereas HEK293T showed no significant change in expression (Figure S7B), supporting the liver-specific activity of the regulatory region.
Figure 5

Functional validation of the top associated variant

(A) Effect of 2.8 kb regulatory sequence and the lead SNP, rs75703017, on reporter (firefly luciferase) activity in the HepG2 human liver cell line. pGL3-empty is an empty construct only containing a promoter, whereas pGL3-C contains the 2.8 kb regulatory sequence with major allele C at rs75703017, pGL3-A contains minor allele A at rs75703017, and pGL3-AAC contains minor allele A at rs75703017 and minor allele C at rs446868 and rs385390. Firefly luciferase concentrations were normalized to renilla luciferase activity for each sample, and all values were plotted relative to the pGL3-empty construct. The graph is representative of six independent experiments, and error bars represent means with standard deviation. Level of significance, calculated by t test, is indicated.

(B) Schematic depiction of CRISPR/Cas9 targeting of the CFHR3 liver-specific regulatory region comprising 2,844 base pairs (chr1: 196,743,825–196,746,668). Excision sites of each guide RNA, located around the SNP of interest (rs75703017), are indicated by scissors. The position of screening primers designed for selection of positive clones with excision of the targeted region are indicated by red arrows.

(C) n-fold change, relative to the wild type, of CFH transcript expression levels of CRISPR-edited CFHR3 (CRISPR-edited H1; genotype DD) carrying one copy of CFHR3 with allele C (WT-H1; genotype DC) in liver-differentiated H1 human embryonic stem cells. Expression was measured by qRT-PCR. The graph represents three independent experiments with two biological replicates (different sets of gRNA were used for targeting; KO1 WT-Cas9 gRNA 1 and 3 and KO2 nickase-Cas9 gRNA 1, 2, 3, and 4; see Table S2) and one technical replicate of KO1. Error bars represent means with standard deviation. Level of significance, calculated by t test, is indicated. (D = CFHR3/CFHR1 deletion, and C = major allele rs75703017).

Functional validation of the top associated variant (A) Effect of 2.8 kb regulatory sequence and the lead SNP, rs75703017, on reporter (firefly luciferase) activity in the HepG2 human liver cell line. pGL3-empty is an empty construct only containing a promoter, whereas pGL3-C contains the 2.8 kb regulatory sequence with major allele C at rs75703017, pGL3-A contains minor allele A at rs75703017, and pGL3-AAC contains minor allele A at rs75703017 and minor allele C at rs446868 and rs385390. Firefly luciferase concentrations were normalized to renilla luciferase activity for each sample, and all values were plotted relative to the pGL3-empty construct. The graph is representative of six independent experiments, and error bars represent means with standard deviation. Level of significance, calculated by t test, is indicated. (B) Schematic depiction of CRISPR/Cas9 targeting of the CFHR3 liver-specific regulatory region comprising 2,844 base pairs (chr1: 196,743,825–196,746,668). Excision sites of each guide RNA, located around the SNP of interest (rs75703017), are indicated by scissors. The position of screening primers designed for selection of positive clones with excision of the targeted region are indicated by red arrows. (C) n-fold change, relative to the wild type, of CFH transcript expression levels of CRISPR-edited CFHR3 (CRISPR-edited H1; genotype DD) carrying one copy of CFHR3 with allele C (WT-H1; genotype DC) in liver-differentiated H1 human embryonic stem cells. Expression was measured by qRT-PCR. The graph represents three independent experiments with two biological replicates (different sets of gRNA were used for targeting; KO1 WT-Cas9 gRNA 1 and 3 and KO2 nickase-Cas9 gRNA 1, 2, 3, and 4; see Table S2) and one technical replicate of KO1. Error bars represent means with standard deviation. Level of significance, calculated by t test, is indicated. (D = CFHR3/CFHR1 deletion, and C = major allele rs75703017).

Genome editing of the CFHR3 region via CRISPR/Cas9 confirms its regulatory role in FH expression

To confirm that the identified CFHR3 region regulates FH expression, we undertook genome editing by using CRISPR/Cas technology (Figures 5B and 5C, and supplemental Information). This required a liver cell line that constitutively expressed FH and carried at least one copy of CFHR3. Because none of the tested cell lines complied with both requirements, we differentiated human embryonic stem cells (H1 cell line) to hepatocytes. H1 cells do not express FH or FHR-3 and carry only one copy of CFHR3. Upon differentiation to hepatocytes (Figure S8), we detected FH expression (Figure 5C, Table S13) supporting the liver-specific expression reported previously. Deletion of a 2.8 kb region (chr1: 196,743,825–196,746,668) within CFHR3 containing rs75703017 via CRISPR/Cas9 in H1 cells (Figure 5B), followed by differentiation to hepatocytes, revealed enhanced FH expression, confirming the regulatory function of this region (Figure 5C). This finding is consistent with our t-test analysis (Table S7) of rs75703017 genotypes showing significant differential FH expression between deletion/major allele C (DC) and homozygous deletion (DD) genotypes (p = 6.8 × 10−3) and is further supported by Hi-C sequencing data, a strategy by which one can study three-dimensional architecture of the genome by coupling proximity-based ligation with massive parallel sequencing and that allows identification of long-range genomic interactions. In two cell lines a long-range interaction could be observed between CFH and the association interval in CFHR3 (Figure S9). Moreover, these results were concordant with in vivo data of individuals who were homozygous for the CFHR3/CFHR1 deletion and who showed increased FH concentrations (Figure 4B).

Discussion

Genetic variants within CFH and the CFHR genes have been associated with genetic susceptibility to a range of human diseases.17, 18, 19, 20, 21, Concordant with our work, deletion of CFHR3-CFHR1 has been reported to alter FH concentrations in serum and modify genetic susceptibility to disease,,, suggesting that a regulatory region controlling FH concentrations might exist at this locus., Identification of the causal variants underlying these associations has been difficult because of the complexity of the region; CNVs and sequence homology hamper genotyping and sequencing efforts. Thus, previous reports relied on surrogate markers to identify the deletion. Our strategy here allowed us to type the CNV and polymorphisms in the CFH-CFHR region, to narrow the regulatory element to a short sequence in intron 1 of CFHR3, and to identify the complex interplay of six possible genotypes at one SNP locus, including the lead SNP and copy-number variant, with FH serum concentrations. Recent development of specific monoclonal antibodies for FH and FH-related proteins allowed for an accurate detection of serum concentrations of FH and FHR-3. We have fine mapped the complex CFH-CFHR region in individuals of European ancestry with MD and found that susceptibility and resistance to the disease is associated with a single SNP locus within intron 1 of CFHR3. This locus is affected by a well-known copy-number variant. Furthermore, by accounting for the protective effect of the minor allele (A) and the risk effect of the wild-type allele (C), we now demonstrate that the CFHR3 deletion does associate, although to a lesser extent than the identified SNP, with increased susceptibility for MD. Previous studies, have missed this effect because their deletion analysis has combined the protective and risk alleles. Interestingly, the intronic lead SNP in CFHR3, rs75703017 (p = 1.1 × 10−16, OR = 0.63, 95% CI 0.55–0.71) lies in a liver-specific regulatory region that has been shown to loop and interact with CFH at the genomic level. This interaction seems to regulate CFH transcription activity. Protective homozygous rs75703017 A allele CFHR3 genotypes were associated with low FH serum concentrations (p = 1.41 × 10−11), the homozygous rs75703017C allele genotype had higher FH serum concentrations. In our analyses, deletion of this region through genome editing in human embryonic stem cells differentiated to hepatocytes also showed a substantial increase (p value < 0.05) of CFH transcript concentrations and expression of FH protein. We showed that individuals surviving MD had higher serum concentrations of FH than controls and that low concentrations of FH were protective for MD. This is concordant with our previous report showing that addition of excess FH to blood increases the survival of N. meningitidis. Our data demonstrate that FH is a critical complement regulatory protein associated with MD susceptibility and that its serum concentrations are controlled through a cis-regulatory element in intron 1 of CFHR3, independent of FHR-3 concentrations. Whereas previous studies have suggested that competition between FHR-3 and FH for the fHbp on the surface of N. meningitidis could be the mechanism controlling susceptibility to MD, we suggest that serum concentrations of FHR-3 are too low to affect binding of the (on average) 132-fold more abundant FH to fHbp. Our genetic analysis confirms this assumption. In fact, our data indicate that the effect on MD susceptibility is predominantly defined by regulation of FH concentrations in serum by genetic variation in CFHR3, irrespective of serum FHR-3 concentrations. A schematic explanation of the inhibition of meningococcal bactericidal activity of complement in human blood by FH and its regulation by genetic variation in CFHR3 is shown in Figure 6. Importantly, our strongest genetic association is between low concentrations of serum FH and protection from disease, whereas high protein concentrations were less strongly associated with susceptibility. This suggests that N. meningitidis is able to harvest sufficient FH to prevent complement activity (thus ensuring serum survival) in most individuals and that high serum concentrations of FH only offer marginal additional bacterial protection as compared to average concentrations.
Figure 6

Schematic model of the effect of CFHR3/CFHR1 deletion and SNP (rs75703017) on FH concentrations, interactions with N. meningitidis, and susceptibility to MD

The schematic diagram shows the structure of the gene region containing CFH and five CFHR genes. Carriers of the minor allele, A, on rs75703017 show the lowest FH concentrations. Increased concentrations of FH are found with the major allele, C, on rs7570317, whereas deletion of CFHR3/CFHR1 is associated with the highest concentrations of FH. Susceptibility to MD is driven by FH serum availability, which increases binding to the meningococcal surface protein fHbp. This binding results in FH’s impairing complement-mediated killing and allowing survival and growth of N. meningitidis in blood.

Schematic model of the effect of CFHR3/CFHR1 deletion and SNP (rs75703017) on FH concentrations, interactions with N. meningitidis, and susceptibility to MD The schematic diagram shows the structure of the gene region containing CFH and five CFHR genes. Carriers of the minor allele, A, on rs75703017 show the lowest FH concentrations. Increased concentrations of FH are found with the major allele, C, on rs7570317, whereas deletion of CFHR3/CFHR1 is associated with the highest concentrations of FH. Susceptibility to MD is driven by FH serum availability, which increases binding to the meningococcal surface protein fHbp. This binding results in FH’s impairing complement-mediated killing and allowing survival and growth of N. meningitidis in blood. Our findings show that serum concentrations of FH are genetically regulated by a locus within CFHR3. Complement activation is an important immune protection mechanism against infections, but uncontrolled or excessive complement activation is potentially damaging to host cells and tissues. FH is a major regulator of complement-mediated damage to host cells as highlighted by the severe diseases associated with inadequate concentration or function of FH; such diseases include TTP/aHUS,, glomerulonephritis, other inflammatory diseases, and AMD. Next to N. meningitidis, many other pathogens (see also Moore et al.), including Streptococcus pneumoniae, group A streptococcus, Borrelia burgdorferi, and Plasmodium falciparum possess FH-binding proteins and might use FH to evade complement-mediated killing. The genomic regulation of serum FH concentration that we have identified through genetic variation in CFHR3 may thus be relevant to many other infectious and inflammatory diseases.
  46 in total

1.  Binding of complement factor H to PorB3 and NspA enhances resistance of Neisseria meningitidis to anti-factor H binding protein bactericidal activity.

Authors:  Serena Giuntini; Rolando Pajon; Sanjay Ram; Dan M Granoff
Journal:  Infect Immun       Date:  2015-02-02       Impact factor: 3.441

Review 2.  Potential influences of complement factor H in autoimmune inflammatory and thrombotic disorders.

Authors:  Janez Ferluga; Lubna Kouser; Valarmathy Murugaiah; Robert B Sim; Uday Kishore
Journal:  Mol Immunol       Date:  2017-02-16       Impact factor: 4.407

3.  Genome-wide association study identifies variants in the CFH region associated with host susceptibility to meningococcal disease.

Authors:  Sonia Davila; Victoria J Wright; Chiea Chuen Khor; Kar Seng Sim; Alexander Binder; Willemijn B Breunis; David Inwald; Simon Nadel; Helen Betts; Enitan D Carrol; Ronald de Groot; Peter W M Hermans; Jan Hazelzet; Marieke Emonts; Chui Chin Lim; Taco W Kuijpers; Federico Martinon-Torres; Antonio Salas; Werner Zenz; Michael Levin; Martin L Hibberd
Journal:  Nat Genet       Date:  2010-08-08       Impact factor: 38.330

Review 4.  Factor H family proteins and human diseases.

Authors:  Mihály Józsi; Peter F Zipfel
Journal:  Trends Immunol       Date:  2008-07-02       Impact factor: 16.687

5.  cnvCapSeq: detecting copy number variation in long-range targeted resequencing data.

Authors:  Evangelos Bellos; Vikrant Kumar; Clarabelle Lin; Jordi Maggi; Zai Yang Phua; Ching-Yu Cheng; Chui Ming Gemmy Cheung; Martin L Hibberd; Tien Yin Wong; Lachlan J M Coin; Sonia Davila
Journal:  Nucleic Acids Res       Date:  2014-09-16       Impact factor: 16.971

6.  Genome-wide efficient mixed-model analysis for association studies.

Authors:  Xiang Zhou; Matthew Stephens
Journal:  Nat Genet       Date:  2012-06-17       Impact factor: 38.330

7.  A Roadmap for Human Liver Differentiation from Pluripotent Stem Cells.

Authors:  Lay Teng Ang; Antson Kiat Yee Tan; Matias I Autio; Su Hua Goh; Siew Hua Choo; Kian Leong Lee; Jianmin Tan; Bangfen Pan; Jane Jia Hui Lee; Jen Jen Lum; Christina Ying Yan Lim; Isabelle Kai Xin Yeo; Chloe Jin Yee Wong; Min Liu; Jueween Ling Li Oh; Cheryl Pei Lynn Chia; Chet Hong Loh; Angela Chen; Qingfeng Chen; Irving L Weissman; Kyle M Loh; Bing Lim
Journal:  Cell Rep       Date:  2018-02-20       Impact factor: 9.423

8.  Deletion of complement factor H-related genes CFHR1 and CFHR3 is associated with atypical hemolytic uremic syndrome.

Authors:  Peter F Zipfel; Matthew Edey; Stefan Heinen; Mihály Józsi; Heiko Richter; Joachim Misselwitz; Bernd Hoppe; Danny Routledge; Lisa Strain; Anne E Hughes; Judith A Goodship; Christoph Licht; Timothy H J Goodship; Christine Skerka
Journal:  PLoS Genet       Date:  2007-02-01       Impact factor: 5.917

9.  Spontaneous hemolytic uremic syndrome triggered by complement factor H lacking surface recognition domains.

Authors:  Matthew C Pickering; Elena Goicoechea de Jorge; Rubén Martinez-Barricarte; Sergio Recalde; Alfredo Garcia-Layana; Kirsten L Rose; Jill Moss; Mark J Walport; H Terence Cook; Santiago Rodriguez de Córdoba; Marina Botto
Journal:  J Exp Med       Date:  2007-05-21       Impact factor: 14.307

Review 10.  Complement factor H in host defense and immune evasion.

Authors:  Raffaella Parente; Simon J Clark; Antonio Inforzato; Anthony J Day
Journal:  Cell Mol Life Sci       Date:  2016-12-10       Impact factor: 9.261

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.