Literature DB >> 29177435

De novo mutations implicate novel genes in systemic lupus erythematosus.

Venu Pullabhatla1, Amy L Roberts2, Myles J Lewis3, Daniele Mauro3, David L Morris2, Christopher A Odhams2, Philip Tombleson2, Ulrika Liljedahl4, Simon Vyse2, Michael A Simpson2, Sascha Sauer5, Emanuele de Rinaldis1, Ann-Christine Syvänen4, Timothy J Vyse2.   

Abstract

The omnigenic model of complex disease stipulates that the majority of the heritability will be explained by the effects of common variation on genes in the periphery of core disease pathways. Rare variant associations, expected to explain far less of the heritability, may be enriched in core disease genes and thus will be instrumental in the understanding of complex disease pathogenesis and their potential therapeutic targets. Here, using complementary whole-exome sequencing, high-density imputation, and in vitro cellular assays, we identify candidate core genes in the pathogenesis of systemic lupus erythematosus (SLE). Using extreme-phenotype sampling, we sequenced the exomes of 30 SLE parent-affected-offspring trios and identified 14 genes with missense de novo mutations (DNM), none of which are within the >80 SLE susceptibility loci implicated through genome-wide association studies. In a follow-up cohort of 10, 995 individuals of matched European ancestry, we imputed genotype data to the density of the combined UK10K-1000 genomes Phase III reference panel across the 14 candidate genes. Gene-level analyses indicate three functional candidates: DNMT3A, PRKCD, and C1QTNF4. We identify a burden of rare variants across PRKCD associated with SLE risk (P = 0.0028), and across DNMT3A associated with two severe disease prognosis sub-phenotypes (P = 0.0005 and P = 0.0033). We further characterise the TNF-dependent functions of the third candidate gene C1QTNF4 on NF-κB activation and apoptosis, which are inhibited by the p.His198Gln DNM. Our results identify three novel genes in SLE susceptibility and support extreme-phenotype sampling and DNM gene discovery to aid the search for core disease genes implicated through rare variation.
© The Author(s) 2017. Published by Oxford University Press.

Entities:  

Mesh:

Substances:

Year:  2018        PMID: 29177435      PMCID: PMC5886157          DOI: 10.1093/hmg/ddx407

Source DB:  PubMed          Journal:  Hum Mol Genet        ISSN: 0964-6906            Impact factor:   6.150


Introduction

Considerable progress has been made in elucidating the genetic basis of complex diseases. The vast majority of identified disease-associated genetic polymorphisms are common in the population and the risk alleles impart a modest individual increment to the likelihood of developing disease. Although large-scale genome-wide association studies (GWAS) have so far explained less of the heritability than originally predicted (1), much of the ‘missing heritability’ is expected to be accounted for by common variants with effect sizes below the genome-wide significance threshold (2). However, under the newly proposed omnigenic model of complex traits, the majority of associated common variants—both identified and unidentified—will primarily be found in periphery genes expressed in relevant cell types but not necessarily biologically relevant to disease (3). In contrast, the role of rare variants in complex disease is largely unknown and often dismissed. A recent study, however, with an extremely large sample size, identified rare and low frequency variants contributing to the genetic variance of adult human height (4)—a polygenic trait with a genetic architecture similar to that of complex diseases (5)—suggesting previous complex disease studies with seemingly large sample sizes were perhaps still insufficiently powered to detect rare variant associations (6). Furthermore, studies of rare variants typically find gene sets enriched in biologically relevant functions/pathways (3,7,8). Therefore, although estimated to explain less of the heritable disease risk at a population level than common variants, identifying rare and low frequency variants is of paramount importance to understanding disease pathogenesis as they are likely to implicate biologically relevant core genes (3). The underrepresentation of rare variant associations within GWAS loci supports the theory that a discrete set of genes will be implicated through rare variants (9). Exome-wide searches, which provides a highly enriched source of potential disease-causing mutations (10), have revealed limited numbers of rare variation associated with complex diseases. Even though greater statistical power is achieved by gene-level analyses whereby aggregated variants are tested for an allelic burden of collective rare variation, widely used gene-based association tests have been shown to lack power at the exome-wide level (11). Coupled with the insufficient sample sizes currently available in the study of most complex diseases, hypothesis-free searches for core genes with rare variant associations are unlikely to be fruitful. Our strategy to address this problem in autoimmune disease SLE (SLE; MIM 152700), is outlined here and summarised in Figure 1. Using a discovery cohort of 30 unrelated SLE cases with a severe disease (young age of onset and clinical features associated with poorer outcome), we hypothesized that these individuals would exhibit unique mutation events in their protein-coding DNA that may predispose to disease risk. We undertook whole-exome sequencing (WES) in 30 family trios (both parents and affected offspring) and scrutinized the data for non-inherited de novo mutations (DNM) in the individual with SLE to identify a group of candidate genes for an independent follow-up rare variant analysis. This method allowed the identification of novel loci harbouring disease risk through collective rare variation, and emphasises the value of phenotypic extremes in the search for core genes in multifactorial disorders (12).
Figure 1.

Overview of study. De novo mutations (DNM) in a discovery cohort revealed candidate genes for imputation-based rare variant burden testing using a follow-up cohort. Independent functional analyses demonstrate the functional effects of one DNM in a candidate gene.

Overview of study. De novo mutations (DNM) in a discovery cohort revealed candidate genes for imputation-based rare variant burden testing using a follow-up cohort. Independent functional analyses demonstrate the functional effects of one DNM in a candidate gene.

Results

Identification of DNM in extreme-phenotype SLE cases

We screened for DNM by WES of 30 family trios with an affected offspring with more severe SLE (Supplementary Material, Fig. S1). A total of 584 798 variants (≥20X), including single nucleotide variants and indels, were identified in the 30 affected probands. Using three bioinformatic tools and employing conservative parameters, 17 putative missense DNM were identified across 17 genes (Supplementary Material, Table S1, Fig. S2). We also analysed the SLE proband WES data alone, without the unaffected parents. This revealed 1194 non-silent, heterozygous, rare variants in 1, 067 genes distributed across the genome, which would make prioritization for downstream analysis a difficult task, highlighting the benefit of parent-offspring trio sequencing (Supplementary Material, Fig. S3). Sanger sequencing confirmed 14 true positive non-silent DNM (Table 1; Supplementary Material, Table S2), present in the SLE proband but absent in both parents and any unaffected siblings, in 11 of the 30 probands (36.7%) for further analysis. No DNM was found in any of the >80 known SLE-associated genes. Of the three false positive DNM (11.7%; Supplementary Material, Table S1) one, within LAMC2, is likely a result of germline mosaicism because, although not observed in either parent, it is observed in an unaffected sibling in addition to the SLE proband (13), and the other two variants are within KRTAP10–2 and KLRC1—both members of highly homologous gene families. Such sequence identity may have caused false positive identification of DNM in the WES analysis and suggests our NGS error-prone genes (NEPG) filter, which removes loci known to be problematic for genome mapping during NGS analyses, should have been more conservative. Indeed the KRLC1 p.Ile225Met missense variant appears to be a polymorphic Paralogous Sequence Variant (PSV)—the paralogous variant being p.Met223Ile in KLRC2.
Table 1.

De novo mutations in SLE probands with extreme phenotypes

FamilyMutation (chr: position ref: alt)GeneGene descriptionExonAmino acidMAF in ExACaCADD PhredMutation typeb
SLE075122: 38336799 C: TMICALL1MICAL-like 116Arg852Cys1.5 × 10−435Ti CpG
SLE04963: 53223122 G: APRKCDprotein kinase C, delta16Gly535Arg34Ti CpG
SLE067912: 57588368 C: TLRP1Low-density lipoprotein receptor-related protein 150Arg2693Cys8.3 × 10−434Ti CpG
SLE05926: 36260896 G: APNPLA1patatin-like phospholipase domain containing 13Arg166His5.8 × 10−533Ti CpG
SLE02962: 25457236 G: ADNMT3ADNA (cytosine-5-)-methyltransferase 3 alpha19Ala695Val32Ti CpG
SLE05714: 79512728 G: TANXA3annexin A37Ser145Ile25.2Tv
SLE06793: 171431716 G: APLD1phospholipase D1, phosphatidylcholine-specific9Thr293Met5.8 × 10−525.1Ti CpG
SLE04115: 179743769 C: TGFPT2glutamine-fructose-6-phosphate transaminase 212Val383Met2.6 × 10−523.4Ti CpG
SLE06797: 138968784 C: AUBN2ubinuclein 215Pro1045Thr18.46Tv
SLE008016: 2812426 C: TSRRM2serine/arginine repetitive matrix 211Arg633Cys14.32Ti CpG
SLE085211: 47611769 G: CC1QTNF4C1q and tumor necrosis factor related protein 42His198Gln12.29Tv
SLE032118: 61621642 G: AHMSDhistocompatibility (minor) serpin domain containing3Ala25Thr9.732Ti
SLE039012: 32369376 G: CBICD1bicaudal D homolog 1 (Drosophila)2Val137Leu8.673Tv
SLE03211: 35251125 C: GGJB3gap junction protein, beta 32Asp254Glu0.002Tv

The mutations are ordered by level of severity, from most to least, predicted by CADD score.

Frequencies are presented from all 61 468 multiethnic individuals in ExAC because the de novo mutations observed in ExAC are likely to be identity-by-state not identity-by-descent.

Tv = Transversion; Ti = Transition; Ti CpG = Transition within a CpG dinucleotide.

De novo mutations in SLE probands with extreme phenotypes The mutations are ordered by level of severity, from most to least, predicted by CADD score. Frequencies are presented from all 61 468 multiethnic individuals in ExAC because the de novo mutations observed in ExAC are likely to be identity-by-state not identity-by-descent. Tv = Transversion; Ti = Transition; Ti CpG = Transition within a CpG dinucleotide.

Variant- and gene-level functional characterization of DNM

In order to best predict the phenotypic effect of the 14 DNM, we used both variant-level and gene-level metrics (14). We used the ExAC database (15) and Combined Annotation Dependent Depletion (CADD) scores (16) to characterise the frequency and predicted functional effects, respectively, of the variants. Five of the 14 DNM—found in MICALL1, LRP1, PNPLA1, PLD1, and GFTP2—have been observed, at very rare frequencies, in the ∼60 000 exomes documented in ExAC (Table 1). All five mutations are CpG transitions and therefore likely to be identity-by-state, reflecting the higher mutability rate of these sites. Within the mutation set, five (35.7%)—found in DNMT3A, PRKCD, MICALL1, LRP1, and PNPLA1—have CADD Phred scores >30, placing them in the top 0.1% of possible damaging mutations in the human genome (Table 1). We further explored the function, expression (BioGPS), existing autoimmunity associations (ImmunoBase), and gene-level constraint against missense mutations (ExAC), of the DNM genes to build a profile of a priori evidence of a role in SLE pathogenesis. None of the candidate genes have been previously associated with SLE through GWAS in any population (17). We also identify candidate genes through known/predicted function and expression profiles (C1QTNF4, SRRM2, HMSD), and four genes (PRKCD, DNMT3A, C1QTNF4 and LRP1) with a significant (Z > 3.09) constraint against missense variants (Table 2). However, across the entire gene set, there was no difference in the median Z-score (0.50) compared with the median Z-score across all genes in ExAC (0.51).
Table 2.

Evidence for role of de novo mutation gene in autoimmunity

GeneFunctional candidateaAssociation with SLEbAssociations with other AIDbImmune cell type with highest expressioncMissense constraintd
PRKCDB cell signaling and self-antigen induced B cell tolerance inductionMonogenic forms30IBD, UC, CD28Dendritic3.75*
DNMT3ADNA methyltransferaseCandidate gene study35CD294.31*
C1QTNF4Pro-inflammatory cytokineCD34+3.17*
SRRM2Spliceosome-associated pre-mRNA splicingCD8+No data
LRP1Endo/Phagocytosis of apoptotic cells10.60*
HMSDMinor histocompatibility antigenn/a0.25
UBN2DNA binding0.01
ANXA3RA17−0.37
PLD1Lymphoblasts−0.73
PNPLA10.27
GFPT21.59
BICD12.12
GJB3−0.81
MICALL10.50

Genes appear in descending order of supporting evidence. UC = ulcerative colitis, CD = Crohn’s Disease, IBD = inflammatory bowel disease, RA = Rheumatoid Arthritis.

See Supplementary Material, Table S5.

See Supplementary Material, Table S6.

See Supplementary Material, Figure S4. Data from BioGPS. If gene expression is highest in immune cells compared with all other cells, the immune cell type with highest expression is listed.

Gene-wise ExAC Constraint Z-scores. Genes with significant restraint against missense variants are highlighted with an asterisk.

Evidence for role of de novo mutation gene in autoimmunity Genes appear in descending order of supporting evidence. UC = ulcerative colitis, CD = Crohn’s Disease, IBD = inflammatory bowel disease, RA = Rheumatoid Arthritis. See Supplementary Material, Table S5. See Supplementary Material, Table S6. See Supplementary Material, Figure S4. Data from BioGPS. If gene expression is highest in immune cells compared with all other cells, the immune cell type with highest expression is listed. Gene-wise ExAC Constraint Z-scores. Genes with significant restraint against missense variants are highlighted with an asterisk.

PRKCD and DNMT3A are associated with SLE through collective rare variation

Although the variant- and gene-level metric analyses suggested intriguing functional candidates, we took a comprehensive approach and tested each locus for an allelic burden of rare variation. We hypothesized that, while some observed DNM were random background variation as present in the exome of every individual regardless of disease status (18), others may be reflecting a hitherto unknown gene contributing to SLE risk, and this may be shown through rare variant burden. Therefore, genotype data were imputed (Supplementary Material, Figs S6 and S7) to the density of the combined UK10K and 1000 genomes Phase III reference panel (UK10K-1000GP3) across all 14 DNM genes in a follow-up cohort of 10 995 individuals of matched European ancestry previously genotyped on the Illumina HumanOmni1 BeadChip (19). Under the hypothesis that rare variants at these loci would be causal and not protective, we employed a one-tailed collapsing burden test (20) to survey each of the 14 genes for an excess of aggregated rare (MAF < 1%) exonic variants in SLE cases compared with healthy controls. We identify an association of PRKCD rare variants with SLE (Supplementary Material, Table S3; P = 0.0028; ncases=4036). In sub-phenotype analyses, we identify collective rare exonic variants in DNMT3A associated with both anti-dsDNA (Supplementary Material, Table S3; P = 0.0005; ncases=1261) and renal involvement with hypocomplementemia (Supplementary Material, Table S3; P = 0.0033; ncases=186), both of which are markers of more severe disease. We also collapsed all exons from the 14 genes together to test for an overall burden of rare variants across these loci. These analyses revealed no excess of rare exonic variants across the grouped genes, reflecting the hypothesis that some/most genes will not be relevant to disease status because the observed DNM are random background variation only. These data reflect the results of our gene-level constraint metric, in which the aggregated gene set do not have a significant mutation constraint. Together, these results suggest further prioritization based on gene-level metrics would not have resulted in true positive associations being excluded from analyses.

Implication of C1QTNF4 in SLE through functional effect of DNM p.His198Gln

Although no rare variant association was found at the novel candidate gene C1QTNF4, its potential role in disease is supported by gene-level metrics—it is a compelling functional candidate and one of four genes constrained against missense variants (ExAC gene-level constraints Z = 3.17, Table 2). Although gene coding length does not correlate with missense constraint scores (15), the small (<1Kb) coding sequence of this candidate gene may have contributed to insufficient power to detect a rare variant association in the burden testing. On the variant-level, the DNM in C1QTNF4 generates a p.His198Gln sequence change with a modest CADD score of 12.3 (Table 1). Although useful in the absence of suitable functional assays, the sensitivity of bioinformatic prediction tools is known to be suboptimal. Where functional assays are available, previous studies have also demonstrated functional effects of variants predicted to be tolerated/benign (21). We therefore pursued a functional analysis of the p.His198Gln DNM detected in the C1QTNF4 gene as an alternative method to add support for its potential role in disease. Although its function is rather poorly understood, the protein product, C1QTNF4 (CTRP4) is secreted and may act as a cytokine, as it has homology with TNF and the complement component C1q (Fig. 2). C1QTNF4 has been shown to influence NF-κB activation (22), a pathway known to be implicated in SLE pathogenesis, therefore we looked for an effect of the p.His198Gln mutation on NF-κB production. Using a HEK293-NF-κB reporter cell line, we showed that C1QTNF4 p.His198Gln mutant protein was expressed and that it inhibited the NF-κB activation generated by exposure to TNF (Fig. 2). Furthermore, we showed that the fibroblast L929 cell line, which is sensitive to TNF-induced cell death, was rescued by exposure to C1QTNF4 p.His198Gln, but not by wild type C1QTNF4. Thus, the mutant form of C1QTNF4 appears to inhibit some of the actions of TNF (23–25).
Figure 2.

Structural and functional characterization of C1QTNF4 p.His198Gln substitution. (A) Domain organization of human C1QTNF4, showing signal peptide (yellow), first C1q domain (green), second C1q domain (blue) and linker peptides (grey). Arrow highlights substitution site. (B) 3D structure prediction of C1QTNF4 and C1QTNF4 p.His198Gln using Phyre2 (47). Ribbons show the interaction between the positively charged Histidine 198 and Proline 196 lost in C1QTNF4 p.His198Gln due to the substitution of Histidine with Glutamine. (C) Immunoblot demonstrating that p.His198Gln does not affect secretion of C1QTNF4 in HEK293 supernatants. (D) Size exclusion chromatography profile showing no difference in oligomerization between supernatant containing C1QTNF4 (blue) and C1QTNF4 p.His198Gln (red). (E) Luciferase assay in HEK293-NF-κB reporter cell line showing that C1QTNF4 p.His198Gln inhibits NF-κB activation in response to 4 h stimulation with 5 ng/ml TNFα. Error bars represent standard error of the mean. (F) Inhibition of L929 induced cell death by C1QTNF4 p.His198Gln after 24h of stimulation with 0.45 ng/ml TNFα in presence of Actinomycin 1 μg/ml. EV = empty vector.

Structural and functional characterization of C1QTNF4 p.His198Gln substitution. (A) Domain organization of human C1QTNF4, showing signal peptide (yellow), first C1q domain (green), second C1q domain (blue) and linker peptides (grey). Arrow highlights substitution site. (B) 3D structure prediction of C1QTNF4 and C1QTNF4 p.His198Gln using Phyre2 (47). Ribbons show the interaction between the positively charged Histidine 198 and Proline 196 lost in C1QTNF4 p.His198Gln due to the substitution of Histidine with Glutamine. (C) Immunoblot demonstrating that p.His198Gln does not affect secretion of C1QTNF4 in HEK293 supernatants. (D) Size exclusion chromatography profile showing no difference in oligomerization between supernatant containing C1QTNF4 (blue) and C1QTNF4 p.His198Gln (red). (E) Luciferase assay in HEK293-NF-κB reporter cell line showing that C1QTNF4 p.His198Gln inhibits NF-κB activation in response to 4 h stimulation with 5 ng/ml TNFα. Error bars represent standard error of the mean. (F) Inhibition of L929 induced cell death by C1QTNF4 p.His198Gln after 24h of stimulation with 0.45 ng/ml TNFα in presence of Actinomycin 1 μg/ml. EV = empty vector.

DNM genes do not harbour common variant associations

We next tested for additional common variant associations at these 14 loci using the high-density UK10K-1000GP3 imputed data. No significant association at any locus was observed with overall risk in a case-control comparison (ncases=4036), nor with anti-dsDNA (ncases=1261) or renal involvement with hypocomplementemia (ncases=186) sub-phenotypes (Supplementary Material, Table S4). The lack of an associated common variant within PRKCD and DNMT3A supports the hypothesis that discrete gene sets will be identified through rare and common variant associations, with the former expecting to be enriched for core disease genes (3).

Discussion

To fully understand the pathogenesis of complex diseases we must analyse the full frequency spectrum of genetic variants (4). The study of rare variants associated with disease is of paramount importance to the discovery of core genes that have the potential to be therapeutic targets (12). Our data support the omnigenic hypothesis that rare genetic risk may be found in a discrete set of non-canonical susceptibility genes, as we report an association of collective rare variation across PRKCD and DNMT3A, and found no evidence of an association with common variants across these loci. This, to the best of our knowledge, is the first WES study in polygenic cases of autoimmune disease to use DNM discovery to identify candidate genes for rare variant analyses. Furthermore, our study supports the importance of phenotypic extremes in elucidating the genetic basis of multifactorial disorders (26). Searching GWAS-identified canonical disease susceptibility genes for additional rare variant risk has not been fruitful. Although there are examples—and perhaps more to discover—of canonical disease genes harbouring both common and rare risk alleles (27), the vast majority of such loci do not. Indeed the common variant associated loci which have also been shown to harbour rare coding variant risk are often those distinct minority of loci where the common polymorphisms are non-silent coding variants [e.g. NCF2 (9)]. It is important to note, however, that the separation of periphery and core genes may not necessarily be binary (3). DNMT3A and PRKCD, although hitherto not associated with polygenic SLE, are known autoimmunity susceptibility loci; DNMT3A is associated with Crohn’s disease (CD) (28) and PRKCD is associated with both CD and ulcerative colitis (UC) (29). The notion that a locus could harbour common variants contributing to one autoimmune disease and rare variants contributing to another is intriguing, and could provide further hypothesis-driven searches in the hunt for disease-specific core genes. A functional missense variant p.G510S (c.G1528A) in PRKCD has previously been reported in a consanguineous family with monogenic SLE (30). It was demonstrated that the PRKCD-encoded protein, PRCδ, was essential in the regulation of B cell tolerance and affected family members with the homozygous mutation had increased numbers of immature B cells. Our study implicates the role of rare variants in PRKCD in the broader context of SLE susceptibility, beyond a monogenic recessive disease model. Indeed the analysis of rare and low frequency variants contributing to human height found significant overlap with genes mutated in monogenic growth disorders (4). Furthermore, PRKCB, another member of the protein kinase C gene family, has been implicated in SLE risk (31). DNMT3A, a DNA methyltransferase, is a very intriguing candidate gene for SLE as altered patterns of DNA methylation are reported in autoimmune diseases (32), and hypomethylation of apoptotic DNA has been reported to induce autoantibody production in SLE (33). DNA methylation changes are also associated with monozygotic twin discordance in SLE (34). A candidate gene study previously reported a trend of association between the common DNMT3A intronic SNP rs1550117 (MAF∼7%) and SLE in a European cohort (35). Our analysis did not replicate this finding (P = 0.23) and found no evidence of a common variant association at this locus. Instead, we find an association of collective rare variants and SLE sub-phenotypes and emphasises the importance of deep phenotyping and the potential role of rare variants in specific sub-phenotype, or indeed autoimmune, manifestations. Despite progress with diagnosis and treatment, particular SLE sub-phenotypes—including those used in this study—are still associated with reduced life expectancy. Therefore, elucidating the specific underlying genetic risk is of paramount importance. Through two in vitro assays, we demonstrated the functional effect of a DNM, p.His198Gln in candidate gene C1QTNF4, despite this mutation being predicted to be of little functional importance across variant-level prediction tools. We showed the mutated protein product of C1QTNF4, C1QTNF4, inhibits some TNF-mediated cellular responses, including activation of NF-κB and TNF-induced apoptosis. The role of TNF in SLE is complex and incompletely understood, although, in this context, it is noteworthy that TNF inhibition may promote antinuclear autoimmunity (24). Gene-level metrics for C1QTNF4 were supportive of a role in disease and our result support the importance of combined gene- and variant-level metrics, and the dangers of relying heavily on variant-level metrics alone, when interpreting the potential role of mutations (14). C1QTNF6 is a known susceptibility locus for Type 1 Diabetes and is implicated in Rheumatoid Arthritis (36,37), and a suggestive association with SLE has recently been described in a transancestral Immunochip analysis (38). Together, these data suggest a potential role of the hitherto understudied C1QTNF superfamily of genes in autoimmunity. Although our study allowed a comprehensive approach to test all DNM genes for allelic burden of rare variants, our results show that filtering based on gene- or variant-level metrics would not have resulted in true associations of DNMT3A and PRKCD being missed. When larger datasets require further prioritization of genes, we suggest both variant- and gene-level metrics are used. Each human—regardless of the disease status—is estimated to have one DNM in their exome (18). The simple presence of a provisionally functional DNM in a proband is therefore not sufficient evidence that it contributes to disease risk. A major challenge of WES studies, therefore, is how to differentiate between variants truly important to disease and background variation (39). In light of recent studies which have demonstrated the limitations of large-scale exome-wide case-control studies in detecting rare variant associations (6,40), despite such associations being found when no limitation on sample size exists (4), our results support extreme-phenotype sampling and DNM discovery to aid a hypothesis-driven search for rare variant associations with complex diseases, in the hunt to determine core disease genes.

Materials and Methods

Selection of trios for sequencing

SLE patients of European ancestry—as determined by genome-wide genotyping as part of a GWAS (19)—were selected from the UK SLE genetic repository assembled in the Vyse laboratory on the following criteria: age of onset of SLE < 25 years (median age 21 years); more marked disease phenotype as shown by either evidence for renal involvement as per standard classification criteria and/or the presence of hypocomplementemia and anti-dsDNA autoantibodies; and DNA available from both unaffected parents. The 30 trios (90 individuals) were exome sequenced, as described in SI Methods. Ethical approval for the research was granted by the NRES Committee London (12/LO/1273 and 06/MRE02/9).

DNM calling

Three bioinformatics tools with conservative parameters were used for DNM screening: BCFtools (41), DeNovoGear (42) and DeNovoCheck (43). A detailed description of the methods applied can be found in SI Methods. Briefly, 454 variants were identified with BCFtools and DeNovoGear and eight additional variants were identified by DeNovoCheck and validated by IGV, resulting in a total of 462 variants, which map to 257 genes. The variants were next filtered sequentially filtered (Supplementary Material, Fig. S2): (A) Removal of NEPG; (B) Fulfil a Het: Ref: Ref for Child: Father: Mother de novo pattern of inheritance and further selected variants that did not contain any trace of alternate allele in any of the parents; (C) Non-silent variant annotation. This process resulted in a total of 17 variants in 17 genes (Supplementary Material, Table S1).

Analysis of WES in cases only

584, 798 variants with ≥20X coverage depth and within Gencode capture regions were identified in the analysis of 30 SLE probands only. Stringent filters were applied for variant refinement, described in full in SI Methods, resulting in 1194 variants in 1067 genes (Supplementary Material, Fig. S3).

Sanger sequencing confirmation

Primers were designed using Primer 3. 10ng of DNA from SLE probands, any unaffected siblings and both parents was amplified with Hot Start Taq polymerase. PCR products were first purified with EXO-SAP before BigDye labelling in a linear PCR and sequenced on an ABI 3300XL. Primers and PCR conditions available on request. The reads were analysed using Chromas Lite (v.2.1.1).

Imputation

Illumina HumanOmni1 BeadChip genotype data from 6995 controls and 4036 SLE patients of matched European ancestry were used, which had undergone quality control as previously described including Principal Component Analysis (PCA) to account for population structure (19). The UK10K (REL-2012–06-02) plus 1000 Genomes Project Phase3 data (release 20131101.v5) merged reference panel (UK10K-1000GP3) was accessed through the European Genome-phenome Archive (EGAD00001000776). The genotype data were imputed using the UK10K-1000GP3 reference panel across the coding regions of the 14 DNM genes plus a 2Mb flanking region. To increase the accuracy of imputed genotype calls, a full imputation without pre-phasing was conducted using IMPUTE2 (44,45). Imputed genotypes were filtered for confidence using an info score (IMPUTE2) threshold of 0.3 (Supplementary Material, Figs S6 and S7). The most likely genotype from IMPUTE2 was taken if its probability was > 0.5. If the probability fell below this threshold, it was set as missing. Variants with >10% missing genotype calls were removed for further analysis. All individuals had <8% missing genotype data.

Rare variant burden tests

Imputed data were filtered, using Plink v1.9, to include only variants mapping to coding exons of hg19 RefSeq transcripts. Plink/SEQv1.0 (20) was used to run gene-wise one-tailed burden testing with a MAF < 1% threshold. A 5% false discovery rate was used for multiple testing correction for 14 genes.

Common variant association tests

SNPTEST 2.5.2 (46) was used to test for associated variants with MAF > 1% across the region spanning the encoded gene. The first four covariates from the original GWAS were included (19). Bonferroni correction was used for 3000 tests across the loci (q = 1.66E-5).

Plasmids

Myc-Flag-tagged C1QTNF4 on the pCMV6 vector and the empty pCMV6 vector were used (OriGene). The mutant pCMV6-C1QTNF4 C594G (p.His198Gln) was generated by site-directed mutagenesis (Quikchange II XL; Stratagene) according the manufacturer’s instructions: mutagenic primer: 5’-GCGAGTGGTTGCTGCCGCGGCCC-3’ (Sigma-Aldrich). The plasmids production was carried out in XL10-Gold Ultracompetent cells, isolated and purified using EndoFree Maxi Prep kit (Qiagen) and plasmid ORFs were confirmed by full Sanger sequencing (GATC-Biotech). The expression and secretion of the flagged proteins was confirmed by western blot on cell lysates and supernatants with monoclonal anti-FLAG antibody (clone M2; Sigma-Aldrich).

Luciferase assays and TNF-induced programmed cell death

GloResponse NF-κB-RE-luc2P HEK293 cell line (Promega) and TNF-sensitive L929 fibrosarcoma cell line (ATCC) were cultured in Dulbecco's Modified Eagle Medium (DMEM) supplemented with 10% fetal bovine serum (FBS) and 1% Penicillin/Streptomycin at 37 °C, 5% CO2. HEK293 were seeded 24 h before transfection in antibiotic free DMEM in 96 wells plate (2 × 104 cells/well), transfected with either C1QTNF4, C1QTNF4 C594G or Empty Vector via Fugene HD (Promega). Forty eight hours after transfection the cell were left unstimulated or stimulated with TNFα 5 ng/ml (PeproTech) for 4 h. Luciferase activity was assayed by One-Glo (Promega) on Berthold Orion luminometer, the values were normalized to cell viability measured by CellTiter Glo (Promega). L929 were challenged with TNFα 0.45 ng/ml and Actinomycin D 1 μg/ml (R&D) for 24 h in presence of C1QTNF4 or C1QTNF4 p.His198Gln containing media, cell viability was measured by CellTiter Glo.

Size exclusion chromatography

Supernatants (750 µl) of HEK293 producing C1QTNF4 or C1QTNF4 p.His198Gln were buffer exchanged in PBS on Zeba Spin Desalting Columns (Thermo Fisher) and 0.5 ml loaded on an AKTA FPLC with a Superdex 200 10/300 GL column (GE Healthcare). Absorbance was normalized to the maximum peak of each sample.

Supplementary Material

Supplementary Material is available at HMG online. Click here for additional data file.
  43 in total

1.  Protein structure prediction on the Web: a case study using the Phyre server.

Authors:  Lawrence A Kelley; Michael J E Sternberg
Journal:  Nat Protoc       Date:  2009       Impact factor: 13.491

Review 2.  Genetic advances in systemic lupus erythematosus: an update.

Authors:  Lingyan Chen; David L Morris; Timothy J Vyse
Journal:  Curr Opin Rheumatol       Date:  2017-09       Impact factor: 5.006

Review 3.  De novo mutations in human genetic disease.

Authors:  Joris A Veltman; Han G Brunner
Journal:  Nat Rev Genet       Date:  2012-07-18       Impact factor: 53.242

Review 4.  Revealing rate-limiting steps in complex disease biology: The crucial importance of studying rare, extreme-phenotype families.

Authors:  Aravinda Chakravarti; Tychele N Turner
Journal:  Bioessays       Date:  2016-04-08       Impact factor: 4.345

5.  Identification of C1qTNF-related protein 4 as a potential cytokine that stimulates the STAT3 and NF-κB pathways and promotes cell survival in human cancer cells.

Authors:  Qi Li; Lanlan Wang; Weifeng Tan; Zhi Peng; Yang Luo; Yingmei Zhang; Guoying Zhang; Daxiang Na; Peng Jin; Taiping Shi; Dalong Ma; Lu Wang
Journal:  Cancer Lett       Date:  2011-06-11       Impact factor: 8.679

6.  Common SNPs explain a large proportion of the heritability for human height.

Authors:  Jian Yang; Beben Benyamin; Brian P McEvoy; Scott Gordon; Anjali K Henders; Dale R Nyholt; Pamela A Madden; Andrew C Heath; Nicholas G Martin; Grant W Montgomery; Michael E Goddard; Peter M Visscher
Journal:  Nat Genet       Date:  2010-06-20       Impact factor: 38.330

7.  Formation of antinuclear and double-strand DNA antibodies and frequency of lupus-like syndrome in anti-TNF-α antibody-treated patients with inflammatory bowel disease.

Authors:  Florian Beigel; Fabian Schnitzler; Rüdiger Paul Laubender; Simone Pfennig; Maria Weidinger; Burkhard Göke; Julia Seiderer; Thomas Ochsenkühn; Stephan Brand
Journal:  Inflamm Bowel Dis       Date:  2011-01       Impact factor: 5.325

8.  Rare and common variants in CARD14, encoding an epidermal regulator of NF-kappaB, in psoriasis.

Authors:  Catherine T Jordan; Li Cao; Elisha D O Roberson; Shenghui Duan; Cynthia A Helms; Rajan P Nair; Kristina Callis Duffin; Philip E Stuart; David Goldgar; Genki Hayashi; Emily H Olfson; Bing-Jian Feng; Clive R Pullinger; John P Kane; Carol A Wise; Raphaela Goldbach-Mansky; Michelle A Lowes; Lynette Peddle; Vinod Chandran; Wilson Liao; Proton Rahman; Gerald G Krueger; Dafna Gladman; James T Elder; Alan Menter; Anne M Bowcock
Journal:  Am J Hum Genet       Date:  2012-04-19       Impact factor: 11.025

9.  Genetic association analyses implicate aberrant regulation of innate and adaptive immunity genes in the pathogenesis of systemic lupus erythematosus.

Authors:  James Bentham; David L Morris; Deborah S Cunninghame Graham; Christopher L Pinder; Philip Tombleson; Timothy W Behrens; Javier Martín; Benjamin P Fairfax; Julian C Knight; Lingyan Chen; Joseph Replogle; Ann-Christine Syvänen; Lars Rönnblom; Robert R Graham; Joan E Wither; John D Rioux; Marta E Alarcón-Riquelme; Timothy J Vyse
Journal:  Nat Genet       Date:  2015-10-26       Impact factor: 38.330

10.  Exploring the genetic architecture of inflammatory bowel disease by whole-genome sequencing identifies association at ADCY7.

Authors:  Yang Luo; Katrina M de Lange; Luke Jostins; Loukas Moutsianas; Joshua Randall; Nicholas A Kennedy; Christopher A Lamb; Shane McCarthy; Tariq Ahmad; Cathryn Edwards; Eva Goncalves Serra; Ailsa Hart; Chris Hawkey; John C Mansfield; Craig Mowat; William G Newman; Sam Nichols; Martin Pollard; Jack Satsangi; Alison Simmons; Mark Tremelling; Holm Uhlig; David C Wilson; James C Lee; Natalie J Prescott; Charlie W Lees; Christopher G Mathew; Miles Parkes; Jeffrey C Barrett; Carl A Anderson
Journal:  Nat Genet       Date:  2017-01-09       Impact factor: 41.307

View more
  13 in total

Review 1.  'There and Back Again'-Forward Genetics and Reverse Phenotyping in Pulmonary Arterial Hypertension.

Authors:  Emilia M Swietlik; Matina Prapa; Jennifer M Martin; Divya Pandya; Kathryn Auckland; Nicholas W Morrell; Stefan Gräf
Journal:  Genes (Basel)       Date:  2020-11-26       Impact factor: 4.096

2.  Systemic lupus erythematosus as a genetic disease.

Authors:  Isaac T W Harley; Amr H Sawalha
Journal:  Clin Immunol       Date:  2022-02-09       Impact factor: 10.190

Review 3.  Monogenic systemic lupus erythematosus: insights in pathophysiology.

Authors:  Ezgi Deniz Batu
Journal:  Rheumatol Int       Date:  2018-05-15       Impact factor: 2.631

4.  One for all and all for One: Improving replication of genetic studies through network diffusion.

Authors:  Daniel Lancour; Adam Naj; Richard Mayeux; Jonathan L Haines; Margaret A Pericak-Vance; Gerard D Schellenberg; Mark Crovella; Lindsay A Farrer; Simon Kasif
Journal:  PLoS Genet       Date:  2018-04-23       Impact factor: 5.917

5.  Exploring Impact of Rare Variation in Systemic Lupus Erythematosus by a Genome Wide Imputation Approach.

Authors:  Manuel Martínez-Bueno; Marta E Alarcón-Riquelme
Journal:  Front Immunol       Date:  2019-02-26       Impact factor: 7.561

6.  Molecular pathways in patients with systemic lupus erythematosus revealed by gene-centred DNA sequencing.

Authors:  Johanna K Sandling; Pascal Pucholt; Lina Hultin Rosenberg; Fabiana H G Farias; Sergey V Kozyrev; Maija-Leena Eloranta; Andrei Alexsson; Matteo Bianchi; Leonid Padyukov; Christine Bengtsson; Roland Jonsson; Roald Omdal; Benedicte A Lie; Laura Massarenti; Rudi Steffensen; Marianne A Jakobsen; Søren T Lillevang; Karoline Lerang; Øyvind Molberg; Anne Voss; Anne Troldborg; Søren Jacobsen; Ann-Christine Syvänen; Andreas Jönsen; Iva Gunnarsson; Elisabet Svenungsson; Solbritt Rantapää-Dahlqvist; Anders A Bengtsson; Christopher Sjöwall; Dag Leonard; Kerstin Lindblad-Toh; Lars Rönnblom
Journal:  Ann Rheum Dis       Date:  2020-10-09       Impact factor: 19.103

Review 7.  Monogenic Lupus: A Developing Paradigm of Disease.

Authors:  Jessie M Alperin; Lourdes Ortiz-Fernández; Amr H Sawalha
Journal:  Front Immunol       Date:  2018-10-30       Impact factor: 7.561

8.  Whole-genome sequencing identifies complex contributions to genetic risk by variants in genes causing monogenic systemic lupus erythematosus.

Authors:  Jonas Carlsson Almlöf; Sara Nystedt; Dag Leonard; Maija-Leena Eloranta; Giorgia Grosso; Christopher Sjöwall; Anders A Bengtsson; Andreas Jönsen; Iva Gunnarsson; Elisabet Svenungsson; Lars Rönnblom; Johanna K Sandling; Ann-Christine Syvänen
Journal:  Hum Genet       Date:  2019-02-01       Impact factor: 4.132

Review 9.  New Horizons in the Genetic Etiology of Systemic Lupus Erythematosus and Lupus-Like Disease: Monogenic Lupus and Beyond.

Authors:  Erkan Demirkaya; Sezgin Sahin; Micol Romano; Qing Zhou; Ivona Aksentijevich
Journal:  J Clin Med       Date:  2020-03-05       Impact factor: 4.241

Review 10.  A Systematic Review of Extreme Phenotype Strategies to Search for Rare Variants in Genetic Studies of Complex Disorders.

Authors:  Sana Amanat; Teresa Requena; Jose Antonio Lopez-Escamez
Journal:  Genes (Basel)       Date:  2020-08-25       Impact factor: 4.096

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.