| Literature DB >> 29884787 |
Angélica M Delgado-Vega1, Manuel Martínez-Bueno2, Nina Y Oparina3,4, David López Herráez5, Helga Kristjansdottir6, Kristján Steinsson6, Sergey V Kozyrev4, Marta E Alarcón-Riquelme7,8.
Abstract
In an effort to identify rare alleles associated with SLE, we have performed whole exome sequencing of the most distantly related affected individuals from two large Icelandic multicase SLE families followed by Ta targeted genotyping of additional relatives. We identified multiple rare likely pathogenic variants in nineteen genes co-segregating with the disease through multiple generations. Gene co-expression and protein-protein interaction analysis identified a network of highly connected genes comprising several loci previously implicated in autoimmune diseases. These genes were significantly enriched for immune system development, lymphocyte activation, DNA repair, and V(D)J gene recombination GO-categories. Furthermore, we found evidence of aggregate association and enrichment of rare variants at the FAM71E1/EMC10 locus in an independent set of 4,254 European SLE-cases and 4,349 controls. Our study presents evidence supporting that multiple rare likely pathogenic variants, in newly identified genes involved in known disease pathogenic pathways, segregate with SLE at the familial and population level.Entities:
Mesh:
Year: 2018 PMID: 29884787 PMCID: PMC5993790 DOI: 10.1038/s41598-018-26274-y
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Graphic summary of the study design. Step-by-step scheme shows initial variant detection, validation and filtering followed by in silico functional analysis of candidate loci and analysis of candidate genes in independent population-based data.
Segregation analysis and population frequency for exome-sequence variants.
| Variant number | Gene | Chromosome position, hg19 | Protein change | rsid | Top functional categories (pathways//GO biological process) | Expression in GTEX blood/immune samples | Family 6 carriers/cases | Family 8 carriers/cases | Linkage within 20 Mb Family 6 (Z-score) | Linkage within 20 Mb Iceland (Z-score) | Controls | Nordic controls | Maximum MAF in public databases (global population) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 |
| chr4:125593332 | T367M | rs140232140 | Transport; protein transport; retrograde transport, endosome to plasma membrane |
| 3/8 | — | — | — | 0.0024 | — | 0.0091 |
| 2 |
| chr17:7810250 | A1523T | rs148451716 | Activated PKN1 stimulates transcription of AR (androgen receptor) regulated genes KLK2 and KLK3; Chromatin organization; Gene expression//chromatin organization; chromatin assembly or disassembly; transcription, DNA-templated |
| 4/8 | — | — | 1.79 | 0 | — | 0.0003 |
| 3 |
| chr4:126238305 | P247T | rs191329848 | Hippo signaling pathway//branching involved in ureteric bud morphogenesis; kidney development; heart morphogenesis; plasma membrane organization | No | 3/8 | — | — | — | 0.0016 | 0.0061 | 0.0042 |
| 4 |
| chr19:55358686 | I255L | rs112697729 |
| 6/8 | 1/7 | — | 2.06 | 0 | 0.1689 | — | |
| 5 |
| chr9:134027138 | I765V | rs61756081 |
| 3/8 | — | 1.3 | — | — | 0.012 | 0.0097 | |
| 6 |
| chr4:96762158 | R286P | rs147966234 | Citrate cycle (TCA cycle); Glucose metabolism; Carbon metabolism//carbohydrate metabolic process; glucose metabolic process | No | 3/8 | — | — | — | 0.0071 | 0.0241 | 0.0091 |
| 7 |
| chr1:48694594 | G103R | rs61746559 | Transport of glucose and other sugars, bile salts and organic acids, metal ions and amine compounds; Hexose transport//transport; ion transport | No | 4/8 | 2/7 | 1.13 | 2.4 | 0.0071 | 0.006 | 0.0525 |
| 8 |
| chr12:58350618 | A229V | rs117230607 | Double-strand break repair via nonhomologous end joining; protein phosphorylation; proteolysis |
| 6/8 | — | — | — | 0.0063 | 0.0183 | 0.0053 |
| 9 |
| chr3:127292588 | E300K | rs372625321 | Lipid metabolic process; G-protein coupled receptor signaling pathway; aging; negative regulation of mitotic cell cycle phase transition |
| — | 3/7 | — | — | — | — | 0.00008 |
| 10 |
| chr17:39261693 | D18V | rs113059833 | Aging; keratinization; hair cycle | No | 8/8 | 4/7 | — | — | 0 | 0.3765 | 0.1879 |
| 11 |
| chr13:20224319 | E499K | rs147594834 | Transcription, DNA-templated; regulation of transcription, DNA-templated; regulation of DNA methylation; negative regulation of transcription, DNA-templated |
| — | 3/7 | — | — | 0.008 | — | 0.003 |
| 12 |
| chr9:139404360 | D932N | rs758642073 |
| — | 5/7 | — | — | 0 | — | 0.0002 | |
| 13 |
| chr13:25670676 | A114T | rs117014540 | mRNA surveillance pathway; Deadenylation-dependent mRNA decay; RNA transport//mRNA metabolic process | No | — | 3/7 | — | — | 0.0094 | 0.012 | 0.0044 |
| 14 |
| chr14:100847878 | R206H | rs146976933 | _//_ |
| — | 3/7 | — | — | 0.0031 | — | 0.008 |
| 15 |
| chr19:40225031 | N65K | rs146776010 |
|
| — | 5/7 | — | 2.06 | 0.0071 | 0.006 | 0.0176 |
| 16 |
| chr10:14970085 | H283N | rs772438042 |
| — | 5/7 | 1 | — | 0 | — | 0.000008 | |
| 17 |
| chr19:50978724 | L7F | rs185418641 | _//_ | No | — | 4/7 | — | 2.06 | 0.0063 | 0.0061 | 0.0073 |
| 18 |
| chr12:1702929 | N102H | rs117331652 |
| — | 5/7 | — | — | 0.0055 | — | 0.0049 | |
| 19 |
| chr6:17601340 | G234R | rs202036280 | _//_ |
| — | 3/7 | — | — | — | — | 0.00005 |
Table 1 shows the results from the segregation analysis for the variants identified by WES as the number of family members who were variant carriers affected with SLE (family 8) or any autoimmune disease (family 6), over the total number of cases in each family. Z-score is indicated for the variants located in regions linked to SLE in Iceland according to ref.[27]. MAF (minor allelic frequency) of the variants in 642 internal whole-genome sequenced European and 83 Nordic controls. The maximum MAF corresponds to the highest frequency of allele in ExAc, 1000 Genomes and GO-ESP data for global population. Variant numbers correspond to those seen in Fig. 3. Functional annotation is shown according to GeneCards SuperPathways and Gene Ontology top biological process categories. The presence of the particular gene transcripts in GTEX blood/immune samples is marked according to Fig. 3. Additional annotation of the variants shown in Table S3.
Figure 3Graphic summary of annotation-based filtering for the variants identified by whole exome sequencing of patients with SLE from multicase families. The Venn diagram shows the genes harbouring non-synonymous variants (nsSNV) with a likely deleterious effect on protein function based on: (Left) a predicted deleterious effect by ≥3 with standard algorithms (SIFT, PolyPhen2, FATHMM, LRT, MetaLR, MutationAccessor, MutationTester, Provean) - genes for which nsSNVs were also predicted as deleterious by one-to-one orthologue-specific SNPDryad algorithm (score > 0.5) are shown in bold - (See detailed scores in Table S3); and (right) a maximum minor allelic frequency of 1% in European populations (internal sequencing and genotyping controls, 1000 Genomes EUR, ESP6500 EurAm and ExAc Eur non-Finish populations) (See Tables S3 and 1). Genes with a significant aggregate association and enrichment of rare variants (top) have further genetic evidence of being implicated in SLE. Genes associated after removing linked variants (r2 < 0.1), adjustment by 10 principal components and genomic control, and multiple test correction are highlighted with a star (See Table 2).
Figure 2Tissue-specific expression patterns for genes with nsSNVs. The heat plot shows GTEX RNAseq expression levels based on FPKM values. Immune-relevant samples, reproductive system and gastrointestinal tract are outlined in orange, blue and green boxes. For the bidirectional FAM71E1-EMC10 locus expression pattern of both genes is shown.
Rare-variant Association Analysis.
|
|
| Enrichment test | SKAT | ||
|---|---|---|---|---|---|
|
|
|
|
| ||
|
| 58 |
| 6.51E-02 | 5.50E-01 | 6.54E-01 |
|
| 41 |
|
|
|
|
|
| 19 |
|
|
|
|
Gene-based case-control association analysis of unlinked rare variants (MAF < 1% and r² < 0.1) of the genes identified by exome sequencing in an independent imputed genome-wide association scan from a set of 4,254 SLE patients and 4,349 controls with European ancestry[25]. Two procedures were used, an enrichment case-control association test and the sequence kernel association test (SKAT). Correction for multiple testing was run through a bootstrapping permutation process (Pmult). All tests were corrected for stratification by adjusting for the first 10 principal components (PC) and Genomic Inflation Control (λ).
Figure 5GIANT network for tissue-specific gene connectivity. The connections are shown for “all tissues”, lymph node, blood and for separate blood cell populations and correspond to one or more common partner genes between each of the studied genes. The connecting lines coloured according to the specificity.
Figure 4Global pairwise gene interaction network of the exome identified genes. Gene co-expression and direct protein-protein interactions are shown as a combined network. Family 8-specific genes are shown in red ovals, family 6 - in blue ovals. The names of the genes with significant enrichment or aggregated association of rare variants in SLE cases are shown in red italic. Genes associated after removing linked variants (r2 < 0.1), adjustment by 10 principal components and genomic control, and multiple test correction are highlighted with a star. Note the high connectivity within the network of the SLE-associated genes. Blue-filled ovals correspond to the genes with known immunity-related Mendelian disorders (OMIM data), green-filled - to the genes with published genome-wide significant associations with autoimmune and autoinflammatory disorders, yellow-filled - with other genome-wide significant immunity-related traits (Table S4).
Immune-related disorders and traits associated with genes identified through exome sequencing and their partners.
| Trait/disorder name | Exome sequencing genes | Potential partner genes |
|---|---|---|
| Acute lymphoblastic leukemia | ||
| Acute myeloid leukemia | ||
| Alloimmunization response to red blood cell transfusion in sickle cell anemia |
| |
| Antibody status in Tripanosoma cruzi seropositivity |
| |
| Asthma |
| |
| Chronic lymphocytic leukemia |
| |
| Chronic myeloid leukemia | ||
| Clozapine-induced agranulocytosis |
| |
| Crohn’s disease |
|
|
| Familial cold autoinflammatory syndrome 2 | ||
| Hennekam lymphangiectasia-lymphedema syndrome 2 | ||
| HIV-1 control |
| |
| IgA nephropathy |
| |
| IgG glycosylation |
| |
| Immune response to smallpox vaccine |
| |
| Inflammatory bowel disease |
| |
| Multiple sclerosis |
| |
| Omenn syndrome | ||
| Platelet count |
| |
| Primary biliary cirrhosis |
| |
| Psoriasis |
| |
| Psoriasis and Crohn’s disease combined |
| |
| Response to tocilizumab in rheumatoid arthritis |
| |
| Rheumatoid arthritis |
|
|
| Selective immunoglobulin A deficiency |
| |
| Self-reported allergy |
| |
| Severe combined immunodeficiency with sensitivity to ionizing radiation | ||
|
|
| |
|
|
| |
| Type 1 diabetes |
|
|
| Ulcerative colitis |
|
|
| Wegener’s granulomatosis |
|
Genes associated with SLE (in bold), autoimmune diseases, and other immune-related phenotypes according to published GWAS studies. Genes related to diseases according to OMIM Morbid are indicated with a star (*). MIM phenotype numbers are specified between brackets. AD = autosomal dominant, AR = autosomal recessive, S = somatic. See details and references in Table S6.
Figure 6GO categories enrichment for genes with SLE-associated nsSNVs in multicase families and their interaction partners according to TOPPGENE. The overrepresented GO terms were summarized by REVIGO to representative subsets of terms using a simple clustering algorithm that relies on semantic similarity measures. The scatterplots are shown for Biological Processes and Cellular Component, the circles correspond to the cluster representatives (i.e. terms remaining after the redundancy reduction) in a two dimensional space derived by applying multidimensional scaling to a matrix of the GO terms’ semantic similarities with an allowed similarity threshold = 0.9. The colour code and the size of circles reflect p-value of the overrepresented term (Table S7).