Literature DB >> 32830257

A Genome-wide Association Study Identifies SERPINB10, CRLF3, STX7, LAMP3, IFNG-AS1, and KRT80 As Risk Loci Contributing to Cutaneous Leishmaniasis in Brazil.

Léa C Castellucci1,2, Lucas Almeida1,2, Svetlana Cherlin3, Michaela Fakiola4, Richard W Francis5, Edgar M Carvalho1, Anadílton Santos da Hora2, Tainã Souza do Lago2, Amanda B Figueiredo6, Clara M Cavalcanti6, Natalia S Alves6, Katia L P Morais6, Andréa Teixeira-Carvalho7, Walderez O Dutra1,8, Kenneth J Gollob1,6,9, Heather J Cordell3, Jenefer M Blackwell5,10.   

Abstract

BACKGROUND: Our goal was to identify genetic risk factors for cutaneous leishmaniasis (CL) caused by Leishmania braziliensis.
METHODS: Genotyping 2066 CL cases and 2046 controls using Illumina HumanCoreExomeBeadChips provided data for 4 498 586 imputed single-nucleotide variants (SNVs). A genome-wide association study (GWAS) using linear mixed models took account of genetic diversity/ethnicity/admixture. Post-GWAS positional, expression quantitative trait locus (eQTL) and chromatin interaction mapping was performed in Functional Mapping and Annotation (FUMA). Transcriptional data were compared between lesions and normal skin, and cytokines measured using flow cytometry and Bioplex assay.
RESULTS: Positional mapping identified 32 genomic loci associated with CL, none achieving genome-wide significance (P < 5 × 10-8). Lead SNVs at 23 loci occurred at protein coding or noncoding RNA genes, 15 with eQTLs for functionally relevant cells/tissues and/or showing differential expression in lesions. Of these, the 6 most plausible genetic risk loci were SERPINB10 (Pimputed_1000G = 2.67 × 10-6), CRLF3 (Pimputed_1000G = 5.12 × 10-6), STX7 (Pimputed_1000G = 6.06 × 10-6), KRT80 (Pimputed_1000G = 6.58 × 10-6), LAMP3 (Pimputed_1000G = 6.54 × 10-6), and IFNG-AS1 (Pimputed_1000G = 1.32 × 10-5). LAMP3 (Padjusted = 9.25 × 10-12; +6-fold), STX7 (Padjusted = 7.62 × 10-3; +1.3-fold), and CRLF3 (Padjusted = 9.19 × 10-9; +1.97-fold) were expressed more highly in CL biopsies compared to normal skin; KRT80 (Padjusted = 3.07 × 10-8; -3-fold) was lower. Multiple cis-eQTLs across SERPINB10 mapped to chromatin interaction regions of transcriptional/enhancer activity in neutrophils, monocytes, B cells, and hematopoietic stem cells. Those at IFNG-AS1 mapped to transcriptional/enhancer regions in T, natural killer, and B cells. The percentage of peripheral blood CD3+ T cells making antigen-specific interferon-γ differed significantly by IFNG-AS1 genotype.
CONCLUSIONS: This first GWAS for CL identified multiple genetic risk loci including a novel lead to understanding CL pathogenesis through regulation of interferon-γ by IFNG antisense RNA 1.
© The Author(s) 2020. Published by Oxford University Press for the Infectious Diseases Society of America.

Entities:  

Keywords:  zzm321990 IFNG-AS1zzm321990 ; zzm321990 Leishmaniazzm321990 ; GWAS; interferon-γ; post-GWAS integrated analysis

Mesh:

Substances:

Year:  2021        PMID: 32830257      PMCID: PMC8130031          DOI: 10.1093/cid/ciaa1230

Source DB:  PubMed          Journal:  Clin Infect Dis        ISSN: 1058-4838            Impact factor:   9.079


American cutaneous leishmaniasis (ACL) caused by Leishmania braziliensis has multiple presentations including cutaneous (CL), mucosal (ML), and disseminated (DL) leishmaniasis. ML and DL are generally preceded by CL. The common CL form of disease is associated with localized skin lesions, mainly ulcers, on exposed body parts. While normally self-limiting, the degree of pathology and rate of healing varies, with lesions leaving lifelong scars. Not all infected individuals go on to develop disease. Subclinical infection is associated with Leishmania-specific cellular immune responses, measured as delayed-type hypersensitivity (DTH) skin test responses [1]. Leishmania antigen-stimulated peripheral blood lymphocytes also produce interferon gamma (IFN-γ) and tumor necrosis factor (TNF) in subclinical infection, but at lower levels than CL [1]. In a longitudinal study, IFN-γ was associated with protection, but a positive skin-test response was not [2]. Indeed, a positive DTH response has high sensitivity for diagnosis of L. braziliensis CL [1, 3]. All forms of ACL are associated with exaggerated cellular immunity. In CL, there is a positive correlation between the frequency of CD4+ T cells expressing IFN-γ and TNF and lesion size [4], with higher levels in ML than CL [5]. The outcome of L. braziliensis infection is determined by a fine balance between proinflammatory IFN-γ and TNF and anti-inflammatory interleukin-10 (IL-10) [3, 5]. One question is whether host genetics influence these responses. Racial differences, familial clustering, and murine studies support genetic control of leishmaniasis (reviewed in [6]). Human family-based genetic epidemiology of CL caused by Leishmania peruviana, a member of the L. braziliensis species complex, was consistent with a gene by environment multifactorial model, a 2-locus model of inheritance providing best fit [7]. This suggested that major genetic risk factors might be found for CL. Candidate gene studies [8-16] of L. braziliensis complex suggest that multiple genes associated with pro- and anti-inflammatory responses (TNFA, SLC11A1, CXCR1, IL6, IL10, CCL2/MCP1, IFNG) and/or wound healing (FLI1, CTGF, TGFBR2, SMAD2, SMAD3, SMAD7, COL1A1) influence CL or ML disease. Although frequently underpinned by functional data [12–14, 16] and/or supported by prior immunological studies [17, 18], these studies have generally lacked statistical power. Here we perform the first well-powered genome-wide association study for L. braziliensis CL, combining analysis across 2 cohorts comprising 2066 cases and 2046 controls. Integrative post-GWAS analysis [19] is used to positionally map genomic loci associated with CL, with functional annotation and experimental studies used to identify plausible genes that act as genetic risk factors for CL.

MATERIALS AND METHODS

Ethical Considerations, Sampling, and Clinical Data Collection

The study, approved by the Hospital Universitário Professor Edgard Santos Ethical Committee (018/2008 and 22/2012) and the Brazilian National Ethical Committee (CONEP–305/2007; CONEP–1258513.1.000.5537), complied with principles of the Helsinki declaration. All participants or parents/guardians signed written consents. Post-quality control (QC) genotype data are lodged in the European Genome-Phenome Archive (accession number EGAS00001004596). CL cases were ascertained at the Public Health Post, Corte de Pedra, Bahia, Brazil, where L. braziliensis is the confirmed species [9-12]. CL is defined as presence of chronic ulcerative lesions without mucosal involvement (ML) or dissemination to ≥10 sites (DL). ML and DL cases were excluded due to insufficient power. All CL cases had confirmed parasite detection and/or minimally met 2 of 3 criteria: positive leishmania-specific DTH, positive leishmania serology, and leishmania histopathology. Endemic controls were attendants of cases with no current/previous history of CL, DL, or ML, including no scars. Samples were collected in 2 phases: 2008–2010 and 2016–2017. Blood bank controls were collected during 2015–2017 at the HEMOBA Foundation, Salvador. Demographic data (age, sex) were recorded. Blood (8 mL) was taken by venipuncture into dodecyl citrate acid–containing vacutainers (Becton Dickinson). Genomic DNA was prepared using proteinase K and salting-out and shipped to the United Kingdom for genotyping at Cambridge Genomic Services.

Array Genotyping and Marker QC

DNAs were genotyped on Illumina Infinium HumanCoreExome Beadchips (Illumina, San Diego, California) with probes for 551 004 single-nucleotide variants (SNVs): 282 373 informative across ancestries; 268 631 exome-focused. Human genome build 37 (hg19) was used. Exclusions were individuals with missing data rate >5%, SNVs with genotype missingness >5%, minor allele frequency <0.01, or deviation from Hardy-Weinberg equilibrium (threshold P < 1.0 × 10−8). Post-QC datasets comprised 312 503 genotyped SNVs, 956 CL cases, 868 controls for phase 1; and 298 919 SNVs, 1110 CL cases, and 1178 controls for phase 2. Phases 1 and 2 had 52% and 81% power, respectively, the combined sample 99% power, to detect genome-wide significance (P < 5 × 10−8 [20]), assuming a disease allele frequency of 0.25, effect size (genotype relative risk) 1.5, and disease prevalence 2%.

SNV Imputation and GWAS

Imputation was performed using the multiethnic 1000 Genomes Project phase 3 reference panel (1000G): 84.8 million variants, 2504 samples, and 26 populations. The 293 563 post-QC genotyped SNVs common across phases 1 and 2 were imputed using the Michigan Imputation Server version 1.0.4 [21]. Imputed SNVs with information metric <0.8 or genotype probability <0.9 were excluded. Remaining variants were converted to genotype calls and filtered for <5% missingness and minor allele frequency >0.005. Imputation accuracy was assessed as the squared Pearson correlation between imputed SNV dosage and known allele dosage (r2 > 0.5). Genome-wide association analysis was performed using a linear mixed model in FaST-LMM version 2.07 under an additive model [22]. Population structure and relatedness were controlled using the genetic similarity matrix, computed from 32 696 phase 1 and 45 569 phase 2 linkage disequilibrium (LD)–pruned array variants. Systematic confounding was assessed using quantile-quantile (Q-Q) plots and an inflation factor (denoted λ; median observed/median theoretical χ 2 distributions). Manhattan plots were generated in R using mhtplot in the genetic analysis package “gap.” Regional association plots were created using LocusZoom [23]. The 32 696 phase 1 and 45 569 phase 2 LD-pruned variants were matched to HapMap populations and Principal Component Analysis plots prepared in R.

Post-GWAS Annotation in FUMA

Functional Mapping and Annotation (FUMA) [19] was used to characterize regions of association based on positional, expression quantitative trait loci (eQTL) and chromatin interaction mapping. Summary statistics from the combined GWAS were loaded into FUMA. SNP2GENE was used to identify independent significant SNVs based on 1000G multiethnic LD data. SNP2GENE mapping used the default GWAS P < 10−5 plus 1 manually entered seed hit at P = 1.32 × 10−5. Independent significant SNVs and SNVs in LD with them were annotated for consequences on gene function using ANNOVAR, potential regulatory functions (Regulome DB score), and 15-core chromatin state predicted by ChromHMM for 127 tissue/cell types. Effects of SNVs on gene expression were determined using eQTLs from multiple tissue/cell types of healthy donors from databases: eQTLgen (44 different tissue types); BIOSQTL (BIO_eQTL_gene level, whole peripheral blood, 2116 healthy donors); DICE (B and T cells, monocytes, natural killer cells); and GTEx version 8 (whole blood; cultured fibroblasts; skin exposed and not sun exposed).

Expression Analysis in CL Lesions

RNA expression for mapped genes was examined using published microarray data [24] comparing CL lesion biopsies (n = 25) with normal skin (n = 10) from nonendemic unexposed donors (GEO database: GSE55664). Between-group comparisons were made on log-transformed data using the GEO2R tool with Benjamini and Hochberg false discovery rate–adjusted P values.

Cytokine and Antigen-stimulated T-Cell Responses

Plasma IFN-γ was measured using BioPlex-220 (Bio-Rad Laboratories) with Cytokine Grp-I-panel 27-plex. Peripheral blood mononuclear cells from a subset (n = 40) of untreated phase 2 CL patients were separated from heparinized blood over Ficoll and used to examine T-cell responses by IFNG-AS1 genotype. Cells (1 × 106 cells/mL) were stimulated: 37°C/5% CO2, 10 μg/mL L. braziliensis (strain MHOM/BR/2001) log-phase promastigote soluble Leishmania antigen, 1 μg/mL purified NA/LE anti-human CD28 (clone CD28.2, BD Biosciences, San Jose, California) for 15 hours, then brefeldin A (BD Biosciences) for 4 hours. Washed cells (phosphate-buffered saline/0.2% bovine serum albumin) were incubated (4°C; 30 minutes) with BUV661 anti-human CD3 monoclonal antibody (UCHT1 clone, BD Biosciences). Cells were fixed, washed, permeabilized using BD Cytofix/Cytoperm, and incubated (4°C; 30 minutes) with BV605 anti-human IFN-γ (B27 clone) and BUV395 anti-human TNF-α (MAB11 clone, BD Biosciences) in permeabilization buffer. Live/dead cells were distinguished using Fixable Viability Stain 575V (BD Biosciences). Data were acquired by BD FACSymphony A5 flow cytometry and analyzed using FlowJo 10.6.1 software (BD Biosciences), and differences between genotypes determined using nonparametric Kruskal-Wallis analysis of variance (ANOVA) with multiple comparisons.

RESULTS

Characteristics of the Study Population

Demographic and clinical details comparing cases and controls are provided in Supplementary Table 1. The younger age of endemic controls was counterbalanced by older age of blood bank controls. Phase 1 and 2 CL cases were matched for lesion number/size and DTH, with no correlation between lesion and DTH sizes. Blood bank controls fell within genetic heterogeneity of endemic controls (Supplementary Figures 1 and 2), with all controls matched to cases. A few outliers occurred in phase 1 endemic controls, which also showed greater heterogeneity in phase 2. Comparison against HapMap populations showed predominant admixture between white and African ethnicities. Linear mixed models used in association analyses take account of genetic heterogeneity.

Genome-wide Association Study

Manhattan and Q-Q plots for genotyped data for phases 1/2 (Supplementary Figures 3 and 4) showed no systematic bias (λs 0.998/1). A Manhattan plot for the combined imputed genotype data (Figure 1) shows no hits at P < 5 × 10−8. Four approaches were used to identify susceptibility genes: (1) integrative post-GWAS annotation in FUMA [19]; (2) analysis of transcriptional data comparing lesions with normal skin [24]; (3) review of gene function for relevance to parasite biology/immunopathology; and (4) analysis of genotypic differences in T-cell responses.
Figure 1.

Manhattan plot of results from the combined analysis for the 4.46M high-quality 1000G imputed single-nucleotide variants (SNVs) common to phase 1 and phase 2 samples. Data are for analysis in FastLMM looking for association between SNVs and cutaneous leishmaniasis. The y-axis indicates −log10P values for association; the x-axis indicates the positions across each chromosome. The dotted line indicates the P = 5 × 10−5 cutoff used to look for suggestive associations.

Manhattan plot of results from the combined analysis for the 4.46M high-quality 1000G imputed single-nucleotide variants (SNVs) common to phase 1 and phase 2 samples. Data are for analysis in FastLMM looking for association between SNVs and cutaneous leishmaniasis. The y-axis indicates −log10P values for association; the x-axis indicates the positions across each chromosome. The dotted line indicates the P = 5 × 10−5 cutoff used to look for suggestive associations.

Integrative Post-GWAS Mapping and Annotation in FUMA

SNP2GENE identified 32 genomic loci associated with CL (Table 1; Supplementary Table 2). Positionally mapped SNVs localized to noncoding sequence, 58% intronic, 21% intergenic, 7% intronic in noncoding RNA genes, and 4% other. Most genomic loci (29/32) had a single lead SNV, with 5 additional independent significant SNVs at loci 9, 13, and 14. Top GWAS hits (=lead SNVs) at 23 loci were taken forward (Table 1): 18 at/near protein coding genes (21 genes: 15 intronic; 6 upstream/downstream), 5 intronic in noncoding RNA genes, and 1 intergenic <5 kb. Nine lead SNVs intergenic at >5 kb from the nearest gene were excluded from further consideration.
Table 1.

Summary of SNP2GENE Results for Lead Genome-wide Association Study Single-nucleotide Variants and Associated Gene Information

Genomic LocusLead IndSigSNParsIDNearest GenebType of GeneDistance From GeneFunctional LocationNo. of Posc Mapped SNVsNo. of eQTL SNVseQTL DatabaseeQTL TypedLesion vs Normal SkineFold- changef
11:175280806rs12753656RP3-518E13.2: TNRAntisense: protein coding0ncRNA intronic intronic60(TNR) NS
21:238427203rs139144273RP11-136B18.1lincRNA4590Intergenic10ND
42:50706764NoneNRXN1Protein coding0Intronic400NS
53:22117736rs1383086ZNF385DProtein coding0Intronic306.28 × 10–4−1.8
63:149314267rs536034590WWTR1Protein coding0Intronic110NS
73:182857261s74285558 MCCC1 Protein coding231Upstream412eQTLGen GTEx/v8 GTEx/v8 BIOSQTLcis_eQTLs Skin SE Skin NSE Gene-level7.98 × 10–10−2.1
73:182857261s74285558 LAMP3 Protein coding0Intronic141eQTLGencis_eQTLs9.25 × 10–125.9
96:132815562rs144488134STX7Protein coding0Intronic600.0081.3
127:93065079rs143586968CALCRProtein coding0Intronic803.62 × 10–41.7
138:40245200rs125676CTA-392C11.2lincRNA0ncRNA intronic470ND
148:52628820rs13261618 PXDNL Protein coding0Intronic620NS
148:52628820rs13261618 PCMTD1 Protein coding0Downstream3770eQTLGenGTEx/v8cis_eQTLs Fibroblasts6.20 × 10–9−1.9
1611:80470102NoneRP11-686G23.2lincRNA0ncRNA intronic60ND
1812:3397404rs77563142TSPAN9Protein coding1673Downstream302.54 × 10–5−1.8
1912:52590004rs10783496KRT80Protein coding4219Upstream3231GTEx/v8; GTEx/v8; GTEx/v8Fibroblasts Skin SE Skin NSE3.07 × 10–8−3.0
2012:68407845rs4913269IFNG-AS1Antisense0ncRNA intronic1010eQTLGenGTEx/v8 BIOSQTLcis_eQTLs Blood Gene-levelND
2214:47560881rs6572403MDGA2:MDGA2Protein coding0Intronic40NS
2314:53686046rs1255253AL163953.3lincRNA0ncRNA intronic520ND
2517:29136126rs75270613CRLF3Protein coding0Intronic809.19 × 10–92.0
2618:46766154rs4939853DYMProtein coding0Intronic414390eQTLGenGTEx/v8 BIOSQTLcis_eQTLs Blood Gene-levelNS
2718:52955675rs8090418TCF4Protein coding0Intronic226eQTLGencis_eQTLsNS
2818:61598763rs8084306SERPINB10Protein coding0Intronic3128eQTLGenGTEx/v8GTEx/v8cis_eQTLs Skin SE Skin NSENS
2919:55746886rs12709949PPP6R1Protein coding0Intronic1906.09 × 10–71.7
3121:48023640rs201555201S100BProtein coding0Intronic1434eQTLGenGTEx/v8 GTEx/v8 BIOSQTLcis_eQTLs Skin SE Skin NSE Gene-levelNS
3222:51038824rs112974449 CHKB Protein coding0Downstream2802.60 × 10–51.4
3222:51038824rs112974449 MAPK8IP2 Protein coding0Upstream3229GTEx/v8 GTEX/v8Skin SE Skin NSENS

Full details of genomic loci are provided in Supplementary Table 2.

Abbreviations: blood, whole blood; eQTL, expression quantitative trait locus; fibroblasts, cultured fibroblasts; ncRNA, noncoding RNA; ND, not done, NS, not significant; skin NSE, skin not sun exposed suprapubic; rsID, reference SNP cluster identity; SE, skin sun exposed lower leg; SNV, single-nucleotide variant; TNR, tenascin receptor.

aLead IndSigSNP is the genome-wide association study top SNV.

bBold indicates pairs of genes mapped with respect to the same Lead IndSigSNP.

cPos indicates positionally mapped SNVs from SNP2GENE analysis.

deQTL type/tissue-cell type.

eAnalyzed from data in GEO database GSE55664 using the GEO2R tool with Benjamini and Hochberg false discovery rate–adjusted P values.

fFold-change for GEO2R lesion vs normal skin analysis.

Summary of SNP2GENE Results for Lead Genome-wide Association Study Single-nucleotide Variants and Associated Gene Information Full details of genomic loci are provided in Supplementary Table 2. Abbreviations: blood, whole blood; eQTL, expression quantitative trait locus; fibroblasts, cultured fibroblasts; ncRNA, noncoding RNA; ND, not done, NS, not significant; skin NSE, skin not sun exposed suprapubic; rsID, reference SNP cluster identity; SE, skin sun exposed lower leg; SNV, single-nucleotide variant; TNR, tenascin receptor. aLead IndSigSNP is the genome-wide association study top SNV. bBold indicates pairs of genes mapped with respect to the same Lead IndSigSNP. cPos indicates positionally mapped SNVs from SNP2GENE analysis. deQTL type/tissue-cell type. eAnalyzed from data in GEO database GSE55664 using the GEO2R tool with Benjamini and Hochberg false discovery rate–adjusted P values. fFold-change for GEO2R lesion vs normal skin analysis. GWAS SNVs are generally enriched for eQTLs [25]. Focusing on data from tissues (whole blood, skin) and cell types (immune cells) relevant to CL (Table 1), SNP2GENE mapped eQTLs associated with expression of MCCC1/LAMP3, PCMTD1, KRT80, IFNG-AS1, DYM, SERPINB10, S100B, and MAPK8IP2.

Transcriptional Analyses of Putative Susceptibility Genes

Additional evidence (Table 1) to support genes as candidates was sought by comparing expression in CL lesions vs normal skin [24]. Six genes were expressed at higher level in lesions (LAMP3, STX7, CALCR, CRLF3, PPP6R1, CHKB), 5 genes at lower levels (ZNF385D, MCCC1, PCMTD1, TSPAN9, KRT80). For 5 differentially expressed genes, ZNF385D, STX7, CALCR, TSPAN9, PPP6R1, SNP2GENE eQTL mapping provided no evidence for SNVs associated with expression in selected tissues (whole blood, skin) or cell types (fibroblasts, immune cells). A role for GWAS SNVs regulating expression of KRT80 was supported by eQTL and chromatin interaction mapping (Figure 2). The lead SNV and others in strong LD lie upstream of KRT80 in a region of strong transcriptional/enhancer activity and act as eQTLs in cultured fibroblasts, sun and non-sun-exposed skin. Other genes supported by both eQTL and lesion expression were in tandem with genes positionally mapped to the same lead SNV (Table 1), namely LAMP3/MCCC1, PXDNL/PCMTD1, and CHKB/MAPK8IP2. For PXDNL/PCMTD1, the lead SNV was intronic in PXDNL but neither it nor numerous mapped SNVs in LD with it were eQTL for PXDNL itself. Rather, they acted as eQTLs for PCMTD1 expression in fibroblasts (Supplementary Figure 5). For CHKB/MAPK8IP2, the lead SNV lies upstream of both genes transcribed in opposing directions. Mapped SNV act as eQTLs for MAPK8IP2 in sun and not-sun-exposed skin but not for CHKB (Supplementary Figure 6). For LAMP3/MCCC1, the lead SNV and SNVs in LD with it map predominantly within LAMP3 (Supplementary Figure 7). While they act as cis-eQTLs for MCCC1 expression, data from CL lesions (Table 1) suggest stronger upregulation of LAMP3 compared to downregulation of MCCC1.
Figure 2.

Results of positional, chromatin interaction, and expression quantitative trait locus (eQTL) activity mapping in functional mapping and annotation for KRT80. A, Map showing the top lead single-nucleotide variant (SNV), and SNVs in linkage disequilibrium with it according to the r2 color-coded key, across the 2 genes. There were no additional independent significant SNVs. B, Chromatin-15 states color coded for transcriptional/enhancer activity as shown in the key. The y-axis color coding relates to cell/tissue types in which chromatin interaction was mapped. C, eQTL activity for genes (y-axis) in different cells/tissues from public domain databases as shown in the key. Full explanation of keys provided as preamble to the supplementary figures. Color figure available online.

Results of positional, chromatin interaction, and expression quantitative trait locus (eQTL) activity mapping in functional mapping and annotation for KRT80. A, Map showing the top lead single-nucleotide variant (SNV), and SNVs in linkage disequilibrium with it according to the r2 color-coded key, across the 2 genes. There were no additional independent significant SNVs. B, Chromatin-15 states color coded for transcriptional/enhancer activity as shown in the key. The y-axis color coding relates to cell/tissue types in which chromatin interaction was mapped. C, eQTL activity for genes (y-axis) in different cells/tissues from public domain databases as shown in the key. Full explanation of keys provided as preamble to the supplementary figures. Color figure available online. Ten genes showed no differential expression in lesions (Table 1). This included SERPINB10 across which multiple cis-eQTLs mapped to chromatin states of transcriptional/enhancer activity in neutrophils, monocytes, B cells, and hematopoietic stem cells (Supplementary Figure 8). Four noncoding RNA genes (Table 1) not present on the chips used for CL lesion data [24] included IFNG-AS1 which had 10 eQTLs across a chromatin state region of transcriptional/enhancer activity in immune cells (whole blood: T cells, B cells, hematopoietic stem cells) that were associated with expression of IFNG-AS1, IFNG, and IL26 (Figure 3).
Figure 3.

Results of positional, chromatin interaction, and expression quantitative trait locus (eQTL) activity mapping in functional mapping and annotation for IFNG-AS1. A, Map showing the top lead single-nucleotide variant (SNV), and SNVs in linkage disequilibrium with it according to the r2 color-coded key, across the 2 genes. There were no additional independent significant SNVs. B, Chromatin-15 states color coded for transcriptional/enhancer activity as shown in the key. The y-axis color coding relates to cell/tissue types in which chromatin interaction was mapped. C, eQTL activity for genes (y-axis) in different cells/tissues from public domain databases as shown in the key. Full explanation of keys provided as preamble to supplementary figures. Color figure available online.

Results of positional, chromatin interaction, and expression quantitative trait locus (eQTL) activity mapping in functional mapping and annotation for IFNG-AS1. A, Map showing the top lead single-nucleotide variant (SNV), and SNVs in linkage disequilibrium with it according to the r2 color-coded key, across the 2 genes. There were no additional independent significant SNVs. B, Chromatin-15 states color coded for transcriptional/enhancer activity as shown in the key. The y-axis color coding relates to cell/tissue types in which chromatin interaction was mapped. C, eQTL activity for genes (y-axis) in different cells/tissues from public domain databases as shown in the key. Full explanation of keys provided as preamble to supplementary figures. Color figure available online.

Relevance to the Biology and Immunopathology of CL Disease

In summary, of 32 positional mapped genomic loci, 23 occurred at protein coding or noncoding RNA genes of which 15 had eQTLs for expression in relevant cells/tissues and/or showed differential expression in CL lesions. To determine which genes in these 15 loci might act as CL susceptibility genes, we reviewed gene function in relation to parasite biology and CL immunopathology (Supplementary Table 3). A plausible functional role for 12 genes was not found, including PCMTD1 for which eQTL and lesion expression data were strong. A role for these genes cannot be discounted, but 6 genes had plausible links to CL pathogenesis (Table 2 and Figure 4): LAMP3 and STX7 play a role in lysosome function; KRT80 and CRLF3 relate to skin perturbations; SERPINB10 and IFNG-AS1 play central roles in immune responses.
Table 2.

Top Genome-wide Association Study Hits in Genes of Plausible Functional Interest as Genetic Risk Factors for Cutaneous Leishmaniasis Caused by Leishmania braziliensis

ChrPosition, bprsID P ValueOdds Ratio (95% CI)Beta (SE)AlleleaVariant OriginLocationGeneFunction
3182857261rs742855586.54 × 10–60.87 (.82–.92)−.034 (0.008)T (C/T)Globalintron LAMP3 Lysosomal associated membrane protein 3
6132815562rs1444881346.10 × 10–60.82 (.75–.89)−.034 (0.007)A (C/A)Africanintron STX7 Syntaxin 7
1252590004rs107834966.58 × 10–61.06 (1.03–1.09).035 (0.008)A (G/A)Globalintron KRT80 Keratin 80
1268407845rs49132691.32 × 10–51.06 (1.03–1.08).033 (0.008)G (C/G)Globalintron IFNG-AS1 IFNG antisense RNA 1
1729136126rs752706135.12 × 10–60.83 (.77–.90)−.034 (0.008)T (C/T)Africanintron CRLF3 Cytokine receptor like factor 3
1861598763rs80843061.56 × 10–61.07 (1.04–1.10).038 (0.008)C (T/C)Globalintron SERPINB10 Serpin family B member 10

Details of all post–genome-wide association study candidate genes are provided in Supplementary Table 3.

Abbreviations: Chr, chromosome; CI, confidence interval; rsID, reference SNP cluster identity; SE, standard error.

aAssociated allele (ancestral/minor) for risk or protection as indicated by the odds ratio.

Figure 4.

LocusZoom plots for genome-wide association study (GWAS) associations identified as plausible genetic risk factors for cutaneous leishmaniasis following post-GWAS annotation: LAMP3 (A), STX7 (B), KRT80 (C), CRLF3 (D), SERPINB10 (E), and IFNG-AS1 (F). The −log10P values (left y-axis) are shown in the top section of each plot. Dots representing individual single-nucleotide variants (SNVs) are color coded/grey-scale shaded (see key) based on their population-specific linkage disequilibrium r2 with the top SNV (annotated by rsID [reference SNP cluster identity]) in the region. The right y-axis is for recombination rate (blue line), based on HapMap data. The bottom section of each plot shows the positions of genes across the region.

Top Genome-wide Association Study Hits in Genes of Plausible Functional Interest as Genetic Risk Factors for Cutaneous Leishmaniasis Caused by Leishmania braziliensis Details of all post–genome-wide association study candidate genes are provided in Supplementary Table 3. Abbreviations: Chr, chromosome; CI, confidence interval; rsID, reference SNP cluster identity; SE, standard error. aAssociated allele (ancestral/minor) for risk or protection as indicated by the odds ratio. LocusZoom plots for genome-wide association study (GWAS) associations identified as plausible genetic risk factors for cutaneous leishmaniasis following post-GWAS annotation: LAMP3 (A), STX7 (B), KRT80 (C), CRLF3 (D), SERPINB10 (E), and IFNG-AS1 (F). The −log10P values (left y-axis) are shown in the top section of each plot. Dots representing individual single-nucleotide variants (SNVs) are color coded/grey-scale shaded (see key) based on their population-specific linkage disequilibrium r2 with the top SNV (annotated by rsID [reference SNP cluster identity]) in the region. The right y-axis is for recombination rate (blue line), based on HapMap data. The bottom section of each plot shows the positions of genes across the region.

Relating IFNG-AS1 Genotypes to Antigen-specific T-Cell Responses

The GWAS (Supplementary Data) showed modest support (P < .01) for some previous candidate genes, but not for IFNG variants associated with Leishmania guyanensis CL in Brazil [16], including no association with plasma IFN-γ (Figure 5A and 5B). IFNG-AS1 expression influences IFN-γ production [26]. While eQTL mapping supported SNVs at IFNG-AS1 acting as cis-eQTLs for IFNG and IL26, they were stronger eQTLs for IFNG-AS1 (Figure 3). Plasma IFN-γ did not differ by IFNG-AS1 genotype (Figure 5C), but a significant difference in the percentage of antigen-specific IFN-γ–producing T cells across genotypes was observed at rs4913269 (ANOVA Padjusted = .044) (Figure 5D). Individuals homozygous for the disease-associated G allele showed a significantly lower percentage of IFN-γ–producing T cells compared with heterozygotes. Similar genotype associations occurred for TNF-producing T cells (Figure 5E; ANOVA Padjusted = .021), which were strongly correlated with IFN-γ–producing T cells (Figure 5F; r2 = 0.31; P = .0003). Parallel observations were made for 6 other SNVs in strong LD (Figure 4D) with rs4913269.
Figure 5.

Plots examining IFNG and IFNG-AS1 genotypes by interferon gamma (IFN-γ) and tumor necrosis factor (TNF) responses. A–C, Results for plasma levels of IFN-γ. A and B, There is no association between plasma IFN-γ and genotypes for 2 single-nucleotide variants (SNVs) at IFNG, rs1861494 that was associated with cutaneous leishmaniasis (CL) disease for Leishmania guyanensis in a previous study [16] and rs2080414 that was in the strongest linkage disequilibrium with rs2069705 that was associated with CL disease and plasma IFN-γ in that study (rs2069705 was not genotyped or imputed in the present study). C, There is no association between plasma IFN-γ and rs4913269 at IFNG-AS1. D and E, Differences in percentages of antigen-stimulated IFN-γ and TNF-producing CD3+ T cells by IFNG-AS1 genotype for the top SNV rs4913269 at chromosome 12 bp position 68407845 associated with CL disease in our study. F, Correlation between percentage IFN-γ + and percentage TNF+ CD3+ T cells for individuals genotyped. Abbreviations: ANOVA, analysis of variance; IFN-γ, interferon gamma; NS, not significant; TNF, tumor necrosis factor.

Plots examining IFNG and IFNG-AS1 genotypes by interferon gamma (IFN-γ) and tumor necrosis factor (TNF) responses. A–C, Results for plasma levels of IFN-γ. A and B, There is no association between plasma IFN-γ and genotypes for 2 single-nucleotide variants (SNVs) at IFNG, rs1861494 that was associated with cutaneous leishmaniasis (CL) disease for Leishmania guyanensis in a previous study [16] and rs2080414 that was in the strongest linkage disequilibrium with rs2069705 that was associated with CL disease and plasma IFN-γ in that study (rs2069705 was not genotyped or imputed in the present study). C, There is no association between plasma IFN-γ and rs4913269 at IFNG-AS1. D and E, Differences in percentages of antigen-stimulated IFN-γ and TNF-producing CD3+ T cells by IFNG-AS1 genotype for the top SNV rs4913269 at chromosome 12 bp position 68407845 associated with CL disease in our study. F, Correlation between percentage IFN-γ + and percentage TNF+ CD3+ T cells for individuals genotyped. Abbreviations: ANOVA, analysis of variance; IFN-γ, interferon gamma; NS, not significant; TNF, tumor necrosis factor.

DISCUSSION

Our GWAS provides the first hypothesis-free insights into genetic risk factors for L. braziliensis CL. Despite prior evidence for genetic regulation of leishmaniasis [7], and the robust well-powered study undertaken, no signals of association achieved genome-wide significance (P < 5 × 10−8) and only modest support was found for previous candidate gene studies (Supplementary Data). We therefore employed integrative approaches [19] to prioritize 6 genes as plausible genetic risk loci for CL. Two genes relate to intracellular localization of Leishmania parasite in phagolysosomes [27]. LAMP3 encodes lysosomal-associated membrane protein 3, also known as dendritic cell LAMP or DCLAMP [28]. Expression of DCLAMP increases in activated dendritic cells, localizing to the MHC class II compartment immediately before translocation of class II to the cell surface [28]. LAMP3 was expressed at 5.9-fold higher levels in lesions compared to normal skin, supporting its role in CL. Since dendritic cells are the most potent antigen-presenting cells that induce primary T-cell responses, it is likely that variation at LAMP3 will relate to presentation of Leishmania antigens to T cells. STX7 encodes syntaxin 7, which influences vesicle trafficking to lysosomes, including phagosome-lysosome fusion [29]. Variants at STX7 could influence delivery of Leishmania phagosomes to lysosomes of macrophages. Two other susceptibility genes, KRT80 and CRLF3, likely relate to skin perturbations and/or the wound healing response. Keratin 80 is a type II epithelial keratin with biased expression in skin keratinocytes [30]. Pathogens invading skin cause keratinocytes to produce chemokines which attract monocytes, natural killer cells, T cells, and dendritic cells [31]. KRT80 and multiple other keratins (KRT77/81/4/39/32/33B) were expressed at lower levels in lesions compared to normal skin. Whether this reflects a paucity of keratinocytes in lesions, or specific downregulation of gene expression in keratinocytes within lesions, requires further investigation. Keratinocytes play a role in wound healing, are potent producers of IL-10 and transforming growth factor–β [32], and can change to a sclerotic phenotype by gene silencing of Fli1 [33]. Although association of human CL and FLI1 [9, 10] was not replicated here, Fli1 is a confirmed murine CL susceptibility gene [34]. Our novel associations continue to focus on molecules/cells involved in wound healing. In contrast, the cytokine receptor-like factor 3 CRLF3 is expressed in normal skin, and shows pathologically enhanced expression in premalignant actinic keratosis and malignant squamous cell carcinoma [35]. CRLF3 appears to be similarly dysregulated in CL lesions. SERPINB10 and IFNG-AS1 are highlighted as central regulators of immune responses. Serpin family B member 10 is a peptidase inhibitor expressed in bone marrow [36] in the monocytic lineage and can inhibit TNF-induced apoptosis [37]. Epithelial SERPINB10 contributes to allergic eosinophilic inflammation [38]. However, despite eQTL data showing SERPINB10 expressed in sun-exposed skin, we found no evidence for its expression in lesions, consistent with more central roles in immune regulation. IFNG antisense RNA 1 fine-tunes the magnitude of IFN-γ responses [26]. It is expressed in mouse and human T-helper1 cells and positively regulates Ifng expression [39, 40]. Transient overexpression of Ifng-as1 is associated with increased IFN-γ and reduced susceptibility to Salmonella enterica [39]. Conversely, deletion of Ifng-as1 in mice compromises host defence against Toxoplasma gondii by reducing Ifng expression. Discordant expression of IFNG and IFNG-AS1 is seen in long-lasting memory T cells, where high IFNG-AS1 associated with low IFNG suggests feedback inhibition [26]. We observed that the IFNG-AS1 genotype was associated with downstream effects on percentage of IFN-γ–producing CD3+ T cells and highly correlated TNF-producing CD3+ T cells following antigen stimulation. Individuals homozygous for the disease-associated allele at 7 IFNG-AS1 associated SNVs had significantly lower percentages of IFN-γ/TNF T cells, suggesting that lower IFN-γ and TNF regulated by IFNG-AS1 causes increased disease risk. These 2 proinflammatory cytokines are important activators of macrophages for anti-leishmanial activity. Our GWAS identified novel genetic risk factors for CL that provide interesting leads to further understanding CL pathogenesis, including through regulation of IFN-γ responses.

Supplementary Data

Supplementary materials are available at Clinical Infectious Diseases online. Consisting of data provided by the authors to benefit the reader, the posted materials are not copyedited and are the sole responsibility of the authors, so questions or comments should be addressed to the corresponding author. Click here for additional data file. Click here for additional data file. Click here for additional data file.
  40 in total

1.  Evaluation of IFN-gamma and TNF-alpha as immunological markers of clinical outcome in cutaneous leishmaniasis.

Authors:  Argemiro D'Oliveira; Paulo Machado; Olívia Bacellar; Lay Har Cheng; Roque P Almeida; Edgar M Carvalho
Journal:  Rev Soc Bras Med Trop       Date:  2002 Jan-Feb       Impact factor: 1.581

2.  The (in)famous GWAS P-value threshold revisited and updated for low-frequency variants.

Authors:  João Fadista; Alisa K Manning; Jose C Florez; Leif Groop
Journal:  Eur J Hum Genet       Date:  2016-01-06       Impact factor: 4.246

3.  Apamin inhibits TNF-α- and IFN-γ-induced inflammatory cytokines and chemokines via suppressions of NF-κB signaling pathway and STAT in human keratinocytes.

Authors:  Woon-Hae Kim; Hyun-Jin An; Jung-Yeon Kim; Mi-Gyeong Gwon; Hyemin Gu; Sun-Jae Lee; Ji Y Park; Kyung-Duck Park; Sang-Mi Han; Min-Kyung Kim; Kwan-Kyu Park
Journal:  Pharmacol Rep       Date:  2017-04-18       Impact factor: 3.024

4.  Activated inflammatory T cells correlate with lesion size in human cutaneous leishmaniasis.

Authors:  Lis R V Antonelli; Walderez O Dutra; Roque P Almeida; Olivia Bacellar; Edgar M Carvalho; Kenneth J Gollob
Journal:  Immunol Lett       Date:  2005-11-15       Impact factor: 3.685

5.  Human syntaxin 7: a Pep12p/Vps6p homologue implicated in vesicle trafficking to lysosomes.

Authors:  H Wang; L Frelin; J Pevsner
Journal:  Gene       Date:  1997-10-15       Impact factor: 3.688

6.  IL6 -174 G/C promoter polymorphism influences susceptibility to mucosal but not localized cutaneous leishmaniasis in Brazil.

Authors:  Lea Castellucci; Eliane Menezes; Joyce Oliveira; Andrea Magalhaes; Luiz Henrique Guimaraes; Marcus Lessa; Silvana Ribeiro; Jeancarlo Reale; Elza Ferreira Noronha; Mary E Wilson; Priya Duggal; Terri H Beaty; Selma Jeronimo; Sarra E Jamieson; Ashlee Bales; Jenefer M Blackwell; Amelia Ribeiro de Jesus; Edgar M Carvalho
Journal:  J Infect Dis       Date:  2006-07-10       Impact factor: 5.226

7.  Immunological and genetic evidence for a crucial role of IL-10 in cutaneous lesions in humans infected with Leishmania braziliensis.

Authors:  Adnene Salhi; Virmondes Rodrigues; Ferrucio Santoro; Helia Dessein; Audrey Romano; Lucio Roberto Castellano; Mathieu Sertorio; Sima Rafati; Christophe Chevillard; Aluisio Prata; Alexandre Alcaïs; Laurent Argiro; Alain Dessein
Journal:  J Immunol       Date:  2008-05-01       Impact factor: 5.422

8.  Human genetic susceptibility and infection with Leishmania peruviana.

Authors:  M A Shaw; C R Davies; E A Llanos-Cuentas; A Collins
Journal:  Am J Hum Genet       Date:  1995-11       Impact factor: 11.025

9.  The NeST long ncRNA controls microbial susceptibility and epigenetic activation of the interferon-γ locus.

Authors:  J Antonio Gomez; Orly L Wapinski; Yul W Yang; Jean-François Bureau; Smita Gopinath; Denise M Monack; Howard Y Chang; Michel Brahic; Karla Kirkegaard
Journal:  Cell       Date:  2013-02-14       Impact factor: 41.582

10.  Functional mapping and annotation of genetic associations with FUMA.

Authors:  Kyoko Watanabe; Erdogan Taskesen; Arjen van Bochoven; Danielle Posthuma
Journal:  Nat Commun       Date:  2017-11-28       Impact factor: 14.919

View more
  4 in total

Review 1.  Genetics, Transcriptomics and Meta-Taxonomics in Visceral Leishmaniasis.

Authors:  Jenefer M Blackwell; Michaela Fakiola; Om Prakash Singh
Journal:  Front Cell Infect Microbiol       Date:  2020-11-25       Impact factor: 5.293

2.  Cytokine Receptor-Like Factor 3 (CRLF3) Contributes to Early Zebrafish Hematopoiesis.

Authors:  Tarannum Taznin; Kaushalya Perera; Yann Gibert; Alister C Ward; Clifford Liongue
Journal:  Front Immunol       Date:  2022-06-20       Impact factor: 8.786

Review 3.  LncRNA: A Potential Target for Host-Directed Therapy of Candida Infection.

Authors:  Ye Wang; Hongdan Xu; Na Chen; Jin Yang; Hongmei Zhou
Journal:  Pharmaceutics       Date:  2022-03-11       Impact factor: 6.321

4.  SERPINB10 contributes to asthma by inhibiting the apoptosis of allergenic Th2 cells.

Authors:  Yuqing Mo; Ling Ye; Hui Cai; Guiping Zhu; Jian Wang; Mengchan Zhu; Xixi Song; Chengyu Yang; Meiling Jin
Journal:  Respir Res       Date:  2021-06-14
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.