Literature DB >> 26616738

Conjunctival fibrosis and the innate barriers to Chlamydia trachomatis intracellular infection: a genome wide association study.

Chrissy h Roberts1, Christopher S Franklin2, Pateh Makalo3, Hassan Joof3, Isatou Sarr3, Olaimatu S Mahdi3, Ansumana Sillah4, Momodou Bah5, Felicity Payne2, Anna E Jeffreys6, William Bottomley2, Angels Natividad1, Sandra Molina-Gonzalez1, Sarah E Burr1,3, Mark Preston1, Dominic Kwiatkowski2,6, Kirk A Rockett6, Taane G Clark1, Matthew J Burton7, David C W Mabey1, Robin Bailey1, Inês Barroso2,8,9, Martin J Holland1.   

Abstract

Chlamydia trachomatis causes both trachoma and sexually transmitted infections. These diseases have similar pathology and potentially similar genetic predisposing factors. We aimed to identify polymorphisms and pathways associated with pathological sequelae of ocular Chlamydia trachomatis infections in The Gambia. We report a discovery phase genome-wide association study (GWAS) of scarring trachoma (1090 cases, 1531 controls) that identified 27 SNPs with strong, but not genome-wide significant, association with disease (5 × 10(-6) > P > 5 × 10(-8)). The most strongly associated SNP (rs111513399, P = 5.38 × 10(-7)) fell within a gene (PREX2) with homology to factors known to facilitate chlamydial entry to the host cell. Pathway analysis of GWAS data was significantly enriched for mitotic cell cycle processes (P = 0.001), the immune response (P = 0.00001) and for multiple cell surface receptor signalling pathways. New analyses of published transcriptome data sets from Gambia, Tanzania and Ethiopia also revealed that the same cell cycle and immune response pathways were enriched at the transcriptional level in various disease states. Although unconfirmed, the data suggest that genetic associations with chlamydial scarring disease may be focussed on processes relating to the immune response, the host cell cycle and cell surface receptor signalling.

Entities:  

Mesh:

Year:  2015        PMID: 26616738      PMCID: PMC4663496          DOI: 10.1038/srep17447

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


Chlamydia trachomatis (Ct) infections have a significant impact on global health. As the cause of trachoma, Ct is the most common infectious cause of blindness1. The same bacterium is attributed to 106 million sexually transmitted infections (STIs) per annum2. Chronic immunopathological reactions have been implicated as the primary cause of pathology resulting from Ct infections3 and in both trachoma and STIs the primary pathological mechanism is the progressive formation of fibrotic scars at the site of infection456. In STIs, this can ultimately lead to infection related infertility and ectopic pregnancy7, whilst in the eye, trachomatous scarring (TS) causes the tarsal plate of the eyelid to become deformed, leading to entropion and trachomatous trichiasis (TT). When TT occurs, the eyelashes turn towards the globe of the eye and scratch the cornea, causing pain, opacity, visual impairment and blindness. Since the TS phenotype can be readily observed in the field using a magnifying loupe, trachoma represents an ideal platform for the study of genetic and biological factors that modulate the pathophysiological response to human Ct infections. The study of pathology in the reproductive tract meanwhile requires invasive techniques and it is difficult to obtain specimens from large numbers of confirmed Ct-STI-related scarring cases. Not all individuals who become infected with Ct (either through STI or in the eye) will suffer clinically significant levels of disease. Those who suffer repeated inflammatory episodes at the conjunctiva have been identified as being at high risk of progressing to TS8 and a similarly increased risk of genitourinary tract pathology has been linked to repeated Ct STIs6. The observation of TS correlates well with increased inflammatory cell activity in the epithelium as measured using in vivo confocal microscopy9. Substantial amounts of pathology can be attributed to environmental risk factors, but the familial clustering10 of TS, the transmission disequilibrium of TS risk alleles11 and evidence from studies focussed on immune response genes1213141516171819 each point to a significant role for host genetics in determining risk of developing TS. Infectious diseases have been relatively less studied by genome-wide association study (GWAS) than non-communicable diseases. Surprisingly few genome-wide associations were reported in several large-scale GWAS202122 and this could be attributed to the reduced power of the studies which resulted from the increased genetic diversity, complex population structures and the low degree of linkage disequilibrium (LD) in the studied populations23. Recent developments in pathways based analysis of GWAS data enable more extensive, multi-SNP analysis and the collapsing of information across networks of functionally interacting polymorphisms24252627. This may add substantially to the discovery power of GWAS in complex phenotypes such as infectious diseases28. The aim of this work was identify candidate human SNPs, loci and pathways that are associated with protection or predisposition of the host to pathological sequelae of ocular Ct infections. To do this, we tested the association of ~1.5 million directly genotyped and ~9 million imputed SNPs with disease status in 1090 TS cases and 1531 controls from The Gambia. Using the same GWAS genotyping data we went on to use Pathways of Distinction Analysis (PODA)25 and the Assignment List GO AnnotaTOR (ALIGATOR)26 to test the association between TS and 1345 multi-gene pathways. We finally sought to support the GWAS pathway level analysis features by performing a pathways level analysis on Gene Expression Omnibus (GEO) deposited trachoma data-sets from The Gambia, Tanzania and Ethiopia.

Results

Sample Demographics

The case and control groups were approximately equivalent with respect to gender and ethnicity. 70% of cases and 63% of controls were female. Self-described ethnic composition of the cases was 29% Jola, 27% Mandinka, 21% Wolof, 10% Fula and 13% other/no data. In controls the composition was 25% Jola, 24% Mandinka, 21% Wolof, 8% Fula and 22% other/no data. Median age was 49 (range 32–60) in cases and 37 (range 12–52) in controls (t = −3.6019, P = 0.0003). First pass tests of association in EMMAX led to genome-wide deflation of the test statistics (λ = 0.982 SE = 1.74e−05) (Fig 1A) that could be directly attributed to the differing age and gender distributions between cases and controls. Modelling the phenotype to adjust for age and gender successfully controlled for this and no genome-wide deviation from the null was subsequently observed (λ = 1.001, SE = 8.7 × 10−7) (Fig. 1B). Neither principal components analysis (PCA) (Supplementary Figures 3 & 4), nor tests of proportional identity by state (IBS) variance in PLINK identified significant levels of within or between group genetic variance.
Figure 1

Results of GWAS Analysis using EMMAX.

(A) QQ plot: There is genome-wide deflation (λ = 0.982 SE = 1.74e−05) of the test statistic when the phenotype is corrected for kinship only. (B) QQ plot: Phenotype correction for age, gender and pairwise kinship removed any deviation from the null expectation of the genome-wide test statistic (λ = 1.001, SE = 8.7e−7). (C) Manhattan plot showing index SNPs with PEMMAX < 5 × 10−6.

SNP variants

Twenty-seven genomic regions were identified by an index SNP with PEMMAX < 5 × 10−6 (Fig. 1B, Supplementary Table 1), although none achieved genome-wide significance (PEMMAX < 5 × 10−8). Twelve index SNPs (Table 1 and Supplementary Figure 6) had at least one supporting SNP (either directly genotyped or imputed) in the region that was in high LD (r2 > 0.6) with the index. Five of these were located within non-coding regions of genes including PREX2 (rs111513399), CTNND (rs28731189), PHYH (rs11258313), NSUN6 (rs201134023) and USP6 (rs9895748). The most significant SNP (rs111513399) was in high LD with a number of SNPs with PEMMAX < 1 × 10−5 and was located in close proximity to the site of a common splice variation of unknown biological relevance in PREX2 (Fig. 2).
Table 1

Index SNPs (Pemmax ≤ 5 × 10−6, at least one supporting SNP in LD with R2 > 0.6) for candidate associated regions.

RSIDCHRPOSITION Assembly 3GRCh37.p13ALLELES (effect allele bold)Freq EATypePOR*,**Within gene
rs116828186318854524A,G0.027Genotyped1.37E-060.824
rs28731189511212170C,T0.011Imputed1.42E-060.736CTNND2
rs74291850529577370T,C0.022Imputed4.63E-071.291
rs187984259573681906T,C0.016Imputed1.43E-060.758
rs803178416132398394G,C0.84Genotyped1.97E-061.092
rs116141145797249265A,G0.037Imputed1.24E-061.190
rs111513399869036056A,G0.113Imputed5.38E-071.114PREX2
rs112583131013339766G,A0.233Imputed1.35E-061.101PHYH
rs127745191018925493C,G0.191Imputed2.06E-061.086NSUN6
rs9895748175053598T,A0.068Imputed1.72E-060.872USP6
rs620799451810247566A,C0.048Imputed1.22E-060.842
rs6033064201175527T,C0.803Imputed9.94E-070.909

*Expected frequency of the effect allele.

**The OR indicates the estimated allele frequency odds ratio for the effect allele. Values less than one indicate that the effect allele is less common in cases than controls and vice versa.

Figure 2

Regional Plot of the most significant index SNP region (rs111513399, PREX2).

Window size 250 kb. LD with index SNP (R2 value) is indicated by colour. LD structure was generated from the GWAS data after imputation. The most significant PREX2 region coincides with a common splice variation. Known transcript variants (A: NP_079146.2 and B: NP_079446.3) are indicated by horizontal red lines and exons are indicated by crosshatching verticals.

Pathways analysis

One-hundred-and-three Reactome pathways had an ALIGATOR P value (PALIGATOR) ≦0.05 after a pre-screening round of 100 permutations. Eighty-four candidate pathways had P after 100,000 permutations (P100k) ≦ 0.05 (Table 2). Reactome is arranged in an event hierarchy, a structured relationship table where related pathways share parent terms and where all pathways belong to one of 23 root level event terms. The significant pathways in the ALIGATOR analysis all came under one of nine root level event terms (Cell cycle, Developmental Biology, Disease, Immune system, Metabolism, Metabolism of proteins, Programmed Cell Death, Signal Transduction, Trans-membrane transport of small molecules; Fig. 3). The most significant pathway (P = 0.00001) related to the biology of the innate and adaptive cellular immune response (Table 2), followed by the adaptive response (P = 0.00023) and a number of highly significant and closely related pathways relating to polymorphism in the Fibroblast Growth Factor receptors (FGFR) and their ligands (minimum P = 0.00028). Other significant pathways included mitotic cell cycle processes and several signal transduction pathways, including multiple pathways relating to G-protein coupled receptors (GPCR), Epidermal growth factor receptors (EGFR) and the insulin-like growth factor receptor (IGFR1). The “Disease” pathways appeared to be synonymous with the FGFR pathways.
Table 2

Reactome pathways with significant enrichment in scarring trachoma: ALIGATOR analysis.

Pathway NameP100k
Immunoregulatory interactions between a Lymphoid and a non Lymphoid cell0.00001
Adaptive Immune System0.00023
FGFR3b ligand binding and activation0.00028
FGFR2c ligand binding and activation0.00029
FGFR4 ligand binding and activation0.00048
FGFR3 ligand binding and activation0.00050
FGFR3c ligand binding and activation0.00050
Signaling by activated point mutants of FGFR30.00050
Signaling by FGFR3 mutants0.00050
FGFR ligand binding and activation0.00078
Activated point mutants of FGFR20.00117
FGFR2 ligand binding and activation0.00117
Signaling by FGFR2 mutants0.00117
Phospholipase C mediated cascade0.00139
Glucagon type ligand receptors0.00204
Class B2 Secretin family receptors0.00212
Mitotic G2 G2M phases0.00262
Immune System0.00270
Golgi Cisternae Pericentriolar Stack Reorganization0.00294
Androgen biosynthesis0.00376
activated TAK1 mediates p38 MAPK activation0.00408
SHC mediated cascade0.00450
Loss of Nlp from mitotic centrosomes0.00484
Loss of proteins required for interphase microtubule organization from the centrosome0.00484
G protein betagamma signalling0.00488
G2M Transition0.00515
Downstream signal transduction0.00546
Alpha defensins0.00576
Mitotic Prophase0.00586
G betagamma signalling through PI3Kgamma0.00599
Signaling by PDGF0.00642
Polo like kinase mediated events0.00649
Activation of NOXA and translocation to mitochondria0.00747
FGFR1c ligand binding and activation0.00967
Signaling by activated point mutants of FGFR10.00967
PD 1 signaling0.01000
Cell Cycle Mitotic0.01036
Centrosome maturation0.01103
Recruitment of mitotic centrosome proteins and complexes0.01103
Negative regulation of FGFR signaling0.01125
FRS2 mediated cascade0.01159
Opioid Signalling0.01291
DAP12 interactions0.01315
Insulin receptor signalling cascade0.01502
Effects of PIP2 hydrolysis0.01510
Constitutive PI3KAKT Signaling in Cancer0.01641
FGFR1 ligand binding and activation0.01674
Intrinsic Pathway for Apoptosis0.01743
DAP12 signaling0.01814
IRS related events0.01867
Amino acid and oligopeptide SLC transporters0.01903
G alpha q signalling events0.01928
Transport of inorganic cationsanions and amino acidsoligopeptides0.02043
Translocation of ZAP 70 to Immunological synapse0.02068
Amino acid transport across the plasma membrane0.02111
Signaling by Insulin receptor0.02145
Costimulation by the CD28 family0.02279
IRS related events triggered by IGF1R0.02305
Downstream signaling of activated FGFR0.02430
Chondroitin sulfatedermatan sulfate metabolism0.02463
PI3K Cascade0.02494
TAK1 activates NFkB by phosphorylation and activation of IKKs complex0.02825
IGF1R signaling cascade0.02859
Signaling by Type 1 Insulin like Growth Factor 1 Receptor IGF1R0.02859
Leukotriene receptors0.02905
Cell Cycle0.02944
Activation of BH3 only proteins0.03137
Synthesis Secretion and Inactivation of Glucagon like Peptide 1 GLP 10.03145
Cyclin B2 mediated events0.03186
IRS mediated signalling0.03214
CTLA4 inhibitory signaling0.03366
Acyl chain remodeling of DAG and TAG0.03374
Signaling by EGFR0.03471
A tetrasaccharide linker sequence is required for GAG synthesis0.03491
Signaling by NODAL0.03554
Signaling by EGFR in Cancer0.03722
Gastrin CREB signalling pathway via PKC and MAPK0.03909
HCN channels0.04039
GAB1 signalosome0.04156
Electric Transmission Across Gap Junctions0.04363
Transmission across Electrical Synapses0.04363
Signaling by ERBB20.04826
Signaling by Activin0.04877
Phosphorylation of CD3 and TCR zeta chains0.04903
Figure 3

Summary of pathways analysis with ALIGATOR and PODA.

Blue circle shows root level Reactome hierarchy event terms and stable identifiers with at least one significant pathway under ALIGATOR. Red circle shows same for PODA analysis. Six branches contained significant pathways under both analyses. Ten branches contained no significant pathways in either analysis.

Fifty-one pathways were significant under PODA with a Pathway Distinction Score P value (DSp) ≦ 0.05 after a pre-screening round of 100 random pathway simulations. Thirty-two pathways had a DSp ≦ 0.05 after 1000 simulations (Table 3). The three most significant pathways in PODA analysis all related to the mitotic cell cycle (DSp = 0.001). The single most significant pathway in PODA was “M phase”, which had a discrimination score of 7.8 and DSp of 0.001. Each unit increase in the S score for a sample was therefore estimated to impart a relative increase in risk of being a TS case of 1.84. Other highly significant pathways related to G protein signalling (DSp = 0.001), events surrounding golgi cisternae pericentriolar stack reorganization during mitosis (DSp = 0.001), GABA receptor activation (DSp = 0.002) and adherens junction organisation (DSp = 0.003). PODA also identified several pathways related to insulin signalling/glucose regulation and the T cell mediated immune response (CD3, ZAP70). All significant PODA pathways came under one of ten root-level Reactome event terms (Cell cycle, Cell-Cell communication, Disease, Extracellular matrix organisation, Haemostasis, Immune System, Metabolism, Metabolism of proteins, Neuronal system, Signalling transduction; Fig. 3).
Table 3

Reactome pathways with significant enrichment in scarring trachoma: PODA analysis.

NameDSDSpOR
M Phase7.7740.0011.842
Mitotic G2-G2/M phases6.9350.0011.649
G2/M Transition6.6630.0011.623
G alpha (z) signalling events4.8030.0011.525
Golgi Cisternae Pericentriolar Stack Reorganization4.1460.0011.239
Proteolytic cleavage of SNARE complex proteins3.9550.0011.230
GABA receptor activation5.6160.0021.595
Mitotic Prophase5.2460.0021.392
Adherens junctions interactions4.9900.0031.547
Cell-cell junction organization5.1000.0041.641
Synthesis of very long-chain fatty acyl-CoAs3.8330.0051.323
Phosphorylation of CD3 and TCR zeta chains3.6680.0051.268
G-protein beta:gamma signalling4.1590.0061.388
Calmodulin induced events4.0200.0071.441
Glucagon signaling in metabolic regulation4.2660.0071.425
Loss of Nlp from mitotic centrosomes5.1810.0111.551
Loss of proteins required for interphase microtubule organization from the centrosome5.1810.0111.551
G beta:gamma signalling through PI3Kgamma3.9480.0111.350
Botulinum neurotoxicity3.6080.0121.238
Inhibition of Insulin Secretion by Adrenaline/Noradrenaline3.6890.0131.332
Synthesis of PIPs at the late endosome membrane2.7980.0141.169
Insulin Processing3.8340.0151.346
Mitotic Metaphase and Anaphase7.0290.0181.760
PLC beta mediated events4.2050.0241.541
Mitotic M-M/G1 phases7.4610.0271.903
PKA activation in glucagon signalling3.2430.0311.322
GPVI-mediated activation cascade3.8870.0351.403
Translocation of ZAP-70 to Immunological synapse2.6310.0351.177
Assembly of collagen fibrils and other multimeric structures4.4030.0461.454
Inhibition of the proteolytic activity of APC/C required for the onset of anaphase by mitotic spindle checkpoint components3.0240.0481.261
Adenylate cyclase inhibitory pathway2.6370.0481.247
Inhibition of adenylate cyclase_pathway2.6370.0481.247

DS: Discrimination Score. DSp: P value for DS based on 1000 permutations, OR: Odds ratio of being a case, per unit increase in S score.

From a combined list of 111 unique significant pathways, 79 were significant in ALIGATOR only; 27 were significant in PODA only and 5 were significant in both ALIGATOR and PODA. The pathways that were significant in both analyses were “Golgi Cisternae Pericentriolar Stack Reorganization” (P100k = 0.0029, DSP = 0.001), “Mitotic Prophase” (P100k = 0.0059, DSP = 0.002), “Phosphorylation of CD3 and TCR zeta chains” (P100k = 0.049, DSP = 0.005), “Loss of Nlp from mitotic centrosomes” (P100k = 0.0048, DSP = 0.011) and “Loss of proteins required for interphase microtubule organization from the centrosome” (P100k = 0.0029, DSP = 0.011). There was substantial redundancy and gene overlap between the pathways that were significant in the pathways analysis and hierarchical clustering identified nine clusters of closely related pathways with supporting evidence for TS association in both PODA and ALIGATOR. Each cluster had an approximate UA value > 90 and at least one pathway that was significant in each of PODA and ALIGATOR. GO terms describing the gene content of the 9 clusters are indicated in Table 4.
Table 4

Gene Ontology terms associated with pathways of significance in PODA and ALIGATOR.

GO Terms: Biological Process#genes in listP value for GO term*UA value for cluster**
Microtubule-based Process4432.3 × 10−21100
G-protein coupled Receptor protein signalling pathway3531.3 × 10−11691
Cellular response to hormone stimulus532.5 × 10−24100
Cell surface receptor linked signal transduction1472.4 × 10−2497
Cell Cycle13804.8 × 10−189100
Regulation of T cell activation822.1 × 10−15100
Sodium Ion Transport1752.1 × 10−61100
Phosphorous metabolic process364.3 × 10−298
Regulation of phosphorylation1312.6 × 10−1397

*Modified Fisher exact P, EASE SCORE.

**Unbiased Alpha.

Supporting data from independent trachoma Transcriptome Analysis

Pathways level enrichment analysis was performed in four published transcriptome data sets (GSE23705, GSE24383, GSE20436, GSE20430) including two (GSE20436, GSE20430) that included specimens from Gambian individuals who were distinct from those sampled for the GWAS data set. Highly enriched Gene Ontology Biological Processes (GO:BP) and Reactome events were identified in each transcriptome and these are shown in Table 5. The most frequently identified GO:BPs were “Immune Response” and “Cell Cycle”. At pathway level we identified a total of 7 stable Reactome pathways among the 4 GSE mRNA transcriptome series. Overall the analysis of the event hierarchy within Reactome reinforced that the majority of pathways identified were related to either cell cycle (REACT_115566) or the immune system (REACT_6900), where signalling in immune system was the most frequently recognised pathway).
Table 5

Matrix of Gene Ontology: Biological Process for trachoma transcriptome co-expression modules (FDR P values).

Transcriptome series
GEO Accession NumberGSE23705GSE24383GSE20436GSE20430
PhenotypeTrichiasisScarringActive trachomaActive trachoma
Country of originEthiopiaTanzaniaGambiaGambia
Number of Specimens in Series42426029
Specimen TypeConjunctival swab
Disease States (see footer)abcd
PlatformIllumina WG-6 v3.0Illumina WG-6 v3.0Affymetrix U133A plus 2.0Affymetrix HG-focused
Pathway Enrichment (FDR P values)
Cell cycle1.4 × 10−321.7 × 10−64
Cell cycle - M Phase1.9 × 10−19
Regulation of Apoptosis1.9 × 10−9
Immune Response5.6 × 10−43.1 × 10−73.4 × 10−172.3 × 10−3
Lymphocyte activation4 × 10−269.7 × 10−10
Cell activation8.9 × 10−5
Defense Response1 × 10−106 × 10−6
Response to virus9.3 × 10−262.5 × 10−11
T cell activation2 × 10−12
Chemotaxis3 × 10−5
Ectoderm development2.1 × 10−124.9 × 10−8
Transcription1.7 × 10−8
Translation2.1 × 10−21
Translational elongation4 × 10−148.7 × 10−34

aTrachomatous Trichiasis with and without inflammation (no current C. trachomatis infection).

bTrachomatous conjunctival scarring with and without inflammation (no current C. trachomatis infection).

cActive follicular disease with and without C. trachomatis infection.

dActive follicular disease with and without C. trachomatis infection.

Discussion

This is the first GWAS study of chlamydial disease. We identified twelve regions of association with PEMMAX < 5 × 10−6 for which there was at least one supporting SNP in LD with R2 > 0.6. Five of these SNPs were in the regions of genes and some of these may have biological relevance to chlamydial infection and disease (Supplementary Table 2). This study was only modestly powered to detect main effects at the genome-wide level of significance and whilst they are intriguing, the associations that we have observed are at present unconfirmed and will require validation in replication studies; followed by fine mapping in order to identify the underlying causal variants or genes. The leading SNP-identified candidate gene from this study was PREX2 (Phosphatidylinositol-3,4,5-trisphosphate-dependent Rac exchange factor 2), a Guanine Nucleotide Exchange Factor (GNEF) and G-protein coupled receptor (G-PCR). PREX2 is known to interact with both Rac and the PI3K inhibitor PTEN29 (Fig. 4). Other GNEFs acting upstream of PI3K and Rac have been shown to directly interact with chlamydial TARP30, a key molecule that transduces the earliest signals between the chlamydial body and the host cell31. PREX2 variants may therefore play a key role in protecting the host cell from Ct entry.
Figure 4

PREX2 is closely involved in processes surrounding TARP mediated Chlamydial entry.

Downstream signalling via RAC leads to changes in cell cycle control and actin skeleton rearrangements that facilitate infection. PREX2 can indirectly mediate downstream changes to cell cycle control and glucose homeostasis via RAC and Akt/p53.

In this study we found no association with rs4149310 (Chr9:107589134) and rs7648467 (Chr3:45936322), two SNPs that were recently predicted to be under selection by trachoma32. We note however that the classification of exposed and non-exposed populations in that study better reflects current exposure to ocular infection than historical endemicity for scarring disease. We also did not confirm the findings of a number pre-GWAS era candidate studies1213153334 carried out by our group. No SNP (PEMMAX < 0.01) was detected in any of the genes IL8, IL10, CSF2, IFNG, HP, CCL8 or MMP9; all of which had been previously reported to associate with trachoma. The previous candidate gene studies had small sample sizes as well as a high burden of adjusted testing. They also were unable to correct for cryptic relatedness between participants, which might have inflated the test statistics. Whilst there are many possible reasons for this failure to verify the antecedent studies, we believe that the most probable explanation is that they reported false positive associations. It is however possible that the previous studies reported true positive findings, but that the GWAS SNPs that were genotyped or imputed in the region of these genes were ineffective markers for causative SNPs in the region, as has been demonstrated in another study22. The ALIGATOR/PODA analyses identified a number of highly significantly enriched pathways (Tables 2 & 3) and a joint analysis (Table 4) identified a set of highly enriched GO:BP terms that were prominent among the findings of both ALIGATOR and PODA. The GO:BP terms were compatible with the findings of earlier research in to the biology of chlamydial disease (Supplementary Table 3). The results of these analyses particularly highlighted roles for the immune system, the cell cycle and surface receptor signalling; with metabolism related pathways being a less prominent but still significant feature of the results. As trachoma is a disease primarily characterised by immune mediated pathology, it is perhaps unsurprising that immune response pathways featured prominently in both the PODA and ALIGATOR analyses. The most significant pathway from ALIGATOR analysis referred to cellular immunity and included both adaptive and innate immune response genes. T cell mediated immunity was also highlighted directly by both methods. The role of complex immune response genes may still have been underestimated by this study as GWAS has limitations with regards to studying immune response genetics; most particularly because the highly polymorphic gene systems that encode the primary innate and adaptive cellular immunoreceptors, including the Human Leucocyte Antigens (HLA) and Killer-cell Immunoglobulin-like Receptors (KIR) are not well covered by genome-wide SNP arrays. Immune response genes are often inconsistently annotated in ENSEMBL and ENTREZ, with the consequence that they may be poorly represented in the gene lists used in the pathways analysis. The imputation and QC strategies are also likely to reduce the information that is available from complex regions. Previous studies have pointed towards an important, but functionally complex role for both the HLA1116171935 and KIR11 systems in TS and to fully appreciate the extent of immunogenetic associations with TS, future studies will be required to perform full sequence resolution genotyping of immunoreceptor genes in large and well powered studies. G-PCR signalling pathways were significantly enriched in our pathways analyses. In a recent report from a GWAS study of Chlamydia muridarum infection in the BXD advanced recombinant inbred mouse36, Su and colleagues reported eleven candidate associations with murine oviduct or uterine disease severity. Of these, four were G-PCR signalling molecules36. Should the findings from GWAS studies of chlamydial STIs in mice and on-going GWAS in human STI contexts continue to overlap substantially with our own findings, then these are important results providing parallels between tissue tropisms and species. Many of the significantly enriched pathways (including FGF, hormone receptor and GPCR signalling pathways) converge on events surrounding PI3K and the downstream Akt/mdm2/Caspase9/p53 axis of cell cycle control. A number of papers add support for the p53 tumour suppressor gene being a key player in mediating responses to Ct infection3738394041424344. This protein may be linked to both G2/M arrest and up-regulation of the pro-fibrotic molecules during Ct infection45. The GWAS and transcriptome data sets of active trachoma (GSE20436, GSE20430) and scarring disease (GSE23705, GSE24383) were obtained using distinct approaches and from separate population samples of trachoma endemic communities in Ethiopia, Tanzania and The Gambia. Both “cell cycle” and “immune system” modules were consistently detected in association with disease in each (Table 5) of the four studies. These findings are complementary to the main findings of the GWAS pathways analysis. Many of the prioritized SNPs, genes and pathways that we identified in this study are known to be functionally linked to one another, as well as to systems that are known to be involved in trachomatous scar formation. A simplified summary of these interactions is presented in Fig. 5, which is derived from our own data and from multiple publicly available open databases (e.g. Genecards, NCBI, Reactome and others). The figure, which shows the convergence of the systems on pathways of cell cycle control, is a model building interpretation of our results that requires experimental validation.
Figure 5

Trachoma associated genes and pathways.

Potential roles for candidate genes (red) that were identified through this GWAS are indicated. Various significant cell surface receptors pathways including FGFR, GPCR, ILGFR1 and GLPR1 are linked to cell cycle control by PI3K/Akt/p53 signalling. Chlamydial elementary bodies are known to interact with this system via sos1 and vav2. Downstream signalling from these pathways can lead to actin remodelling (facilitating cell entry), cell cycle arrest and inhibition of apoptosis; all factors that facilitate parasitism. Glucose and sodium ion homeostasis resulting from p53/cell-cycle control may increase nutrient availability to the growing inclusion. Up-regulation of NFKB, CTGF, MMP9 and TGFB are potential routes to fibrosis.

In the context of our data, we propose that innate barriers to the intracellular lifestyle, centring on cell cycle control, may be as important as a well-regulated and proportionate cellular immune response in controlling the pathological sequelae of chlamydial infections.

Methods

Ethics statement

Specimens included in this study were obtained from archival stocks of DNA and were used anonymously. All participants had previously consented to the use of their DNA in a study of genetic associations with trachoma (MRC Gambia study codes SCC598, SCC721, SCC729, SCC804, SCC857 and SCC1177) or Chlamydia related tubal infertility (SCC786 and SCC804). Written informed consent was obtained from all adult participants and from a parent/guardian on behalf of those subjects aged under 18 years who wished to take part in the studies. The Ethics Committee of the Gambian Government/Medical Research Council Unit and of the London School of Hygiene & Tropical Medicine approved the antecedent studies for which initial consent was taken. Project approvals for SCC729 and SCC857 were both updated in L 2003.46. All studies were conducted in accordance with the tenets of the Declaration of Helsinki.

Study population, sampling and ascertainment

The mixed-ethnicity case-control sample was ascertained in multiple rural regions of The Gambia, West Africa. Community screening for trachoma identified cases and each case was asked to identify an unrelated, same-sex member of their community who was also a member of the same ‘kafo’ as the case. A kafo (Mandinka) is a social network of similarly aged individuals of the same gender who are born into the same community. Samples for DNA analysis were collected from buccal mucosae using sterile cyto-brushes (Part Number F-440151, SLS, Nottingham, UK). DNA extraction was performed using either a salting out procedure or the QIAamp Blood DNA mini kit (Part Number 51106, Qiagen, Manchester, UK). Genomic DNA underwent whole genome amplification by multiple displacement amplification using the Repli-g Midi-Kit (Qiagen, Manchester, UK). Amplified DNA was quantified using PicoGreen (Life Technologies, Paisley, UK), normalised to a standard concentration and analysed by Agilent 2100 Bioanalyzer (Agilent Technologies, Stockport, UK) to verify DNA quality and integrity.

Trachoma phenotypes

Trachoma was graded in the field using the WHO simplified grading system. The field graders were regularly checked for quality and accuracy of grading as indicated in the manual of operations for the PRET clinical trial46. A subject was considered to be a case if they could be defined according to the WHO simplified system as having TS in either eye.

GWAS genotyping and SNP Quality Control

Specimens (n = 2956) were genotyped at 2,379,855 SNPs using the HumanOmni2.5-8v1_A (Illumina Inc, San Diego, CA. USA). Three genotype-calling algorithms (Illuminus47, GenCall48 and GenoSNP49) were used on the initial set of SNPs (n = 2,379,855). Data from each algorithm was filtered to retain only SNPs with a call rate ≥ 0.98. The number of SNPs retained was 1,403,253 in Illuminus, 219,259 in GenCall and 929,088 in GenoSNP (Supplementary Figure 1). To obtain a merged set of SNPs, all genotypes that matched across call-sets were retained, whilst those that mismatched between call-sets were set to missing (Supplementary Figure 2). Genotypes that were present in one call set and missing in others were also retained (Supplementary Figure 2). The merged SNP set contained 1,467,876 SNPs. Merged SNPs were finally retained for analysis if (a) the call rate ≥ 0.99 and (b) the Hardy-Weinberg equilibrium P value < 5 × 10−8. 1,457,295 directly genotyped SNPs were retained after quality control (QC) (Supplementary Figure 1).

Specimen Quality Control and statistical power

Individuals were removed if they were identified as being outliers because they had: (a) > 5% missing genotype data. (b) Genome-wide heterozygosity > 1.96 × standard deviation of the sample wide average genome-wide heterozygosity. (c) Average identity by state with all other individuals > 0.05. (d) Identity by descent sharing with another individual of two alleles at all loci. (e) Identity by state with the fifth nearest neighbour with Z < −4 compared to the mean IBS of all possible pairs. (f) Unresolved gender mismatch between sex chromosome genotypes and clinical record. After these QC steps, 2621 specimens were retained (Supplementary Figure 1). Tests of proportional IBS variance were performed in PLINK. Analysis of population stratification by supervised PCA was implemented in Eigenstrat smartPCA5051 (Supplementary Methods and Supplementary Figures 3 & 4). Familial relationships within the sample were identified using an analysis of pairwise identity by state/allele sharing (Supplementary Figure 5) in PLINK and R. The STATA “power twoproportions” command was used to estimate the power of the study to detect genome-wide significant (α < 1 × 10−8) associations. At this level of significance, a study of 1090 cases and 1531 controls has 80% power to detect allele frequency odds ratios (OR) of 2.61, 2.11, 1.81 and 1.72 for minor allele frequencies (MAF) in the control group of 0.05, 0.1, 0.2 and 0.3 respectively.

Imputation

Imputation was carried out as described by Howie et al.52. Shapeit5354 was used to pre-phase using data from HapMap Phase II, build 37. Imputation was performed with IMPUTE2 5255 and utilized data from 1092 reference samples included in the worldwide 1000 Genomes phase I data set56. Post imputation filtering was based on Southam et al.57. SNPs with an IMPUTE2 info score <0.8 and/or MAF <0.01 were discarded. In total, 28,755,674 SNPs were imputed. After filtering for quality 11,851,747 SNPs were retained (Supplementary Figure 1).

Tests of association

Tests of association of SNPs with age and sex corrected trachoma phenotypes were performed using EMMAX58 and were informed by an IBS ‘kinship’ matrix (*.hIBS.kinf format) generated from the directly genotyped SNP data in PLINK. In order to compensate for the relatively lower age distribution of the control set, we modeled the phenotype with age and gender under logistic regression and then used the residuals of this analysis as an age and gender corrected TS phenotype in the EMMAX test. Highlighted SNPs were annotated using SNPNexus (http://snp-nexus.org).

Pathways analyses

ALIGATOR26 was performed using the SNPath R package (linchen.fhcrc.org/grass.html) and 1345 Reactome pathways59 (www.reactome.org, accessed April 2013), also known as “events”. The input data were the P values from EMMAX tests of association. These were thinned to remove SNPs that were >20 kb away from any gene in a list of ~17,000 genes that were consistently cross-referenced between Entrez and Ensembl and for which a HUGO gene name had been assigned (http://www.gettinggeneticsdone.com/2011_06_01_archive.html). Genes in this list are likely to be included in pathway lists (such as Reactome), whilst genes with inconsistent cross-referencing are unlikely to be included. ALIGATOR counts the number of genes in a pathway that contain a SNP with an EMMAX P value (PEMMAX) value more extreme than a nominally significant threshold value. We set this threshold at PEMMAX < 0.001 (Supplementary Methods and Supplementary Table 4). Pathways of Distinction Analysis (PODA)25 was performed (Supplementary Methods) on directly genotyped GWAS data using 1345 Reactome59 pathways and the PODA script for R (http://braun.tx0.org/PODA/). A proportional gene content intersection between the members of a combined list of significant pathways from ALIGATOR and PODA was used to detect functional redundancy and gene content overlap in significant pathways. For each possible pair of pathways, the number of intersecting genes between pathways was divided by the number of genes in the union of the two pathways. This generated a distance matrix that was subjected to hierarchical clustering with multi-scale bootstrap sampling using the pvclust R package (Suzuki & Shimodaira 2014, http://CRAN.R-project.org/package = pvclust). Clusters of interest were identified as having an approximate unbiased alpha (UA) value greater than 90 and containing at least one pathway that was significant in each of the PODA and ALIGATOR results. UA is a probability measure where UA = 90 indicates that there is 90% confidence that the pathways form a cluster. The combined gene content from all pathways in each cluster were then functionally annotated with Gene Ontology: Biological Process terms using DAVID Bioinformatics Resources 6.7 (accessed 05/2015)60.

Supporting data from independent trachoma Transcriptome Analysis

We sought to support the GWAS pathway level analysis features by performing a pathways level analysis on GEO deposited trachoma data-sets from the Gambia, Tanzania and Ethiopia (Table 5). Each transcriptome data set was reanalysed in a standardised way in which differential expression was calculated using GEO2R (Limma)61. Pearson correlation network graphs were then generated and partitioned into co-expression clusters by Markov-chain clustering using Biolayout express 3D62. The top level features of complete networks and co-expression clusters were then extracted by interrogation using NCBI DAVID v6.7 and where 5% FDR significant pathways were identified in Reactome via DAVID, these gene sets were directly queried within the Reactome database (accessed April 2013) to describe pathway hierarchical structure and fine level features from the same pathway database used in the GWAS analysis. For each data set we obtained an FDR adjusted p-value queried against the 1345 Reactome pathways. For each of the top 12 Reactome pathways identified by ALIGATOR/PoDA (Tables 2 and 3) we identified the event hierarchy and the associated p-value of the sub-pathway from transcriptome gene-set enrichment analysis.

Data Dissemination

Managed access to the individual-level genotypes, TS phenotypes, age and gender data will available to all appropriately qualified researchers from academia, charitable organizations and private companies in the UK or abroad under the terms of the Wellcome Trust Community Access Policy and via the European Genome-phenome Archive (EGA at EMBL-EBI). Details of how to access the data will be available on the project information page at the EGA website. The EGA study accession number for this project is EGAS00001001516.

Additional Information

Accession codes: The EGA study accession number for this project is EGAS00001001516. How to cite this article: Roberts, C. et al. Conjunctival fibrosis and the innate barriers to Chlamydia trachomatis intracellular infection: a genome wide association study. Sci. Rep. 5, 17447; doi: 10.1038/srep17447 (2015).
  59 in total

1.  Polymorphisms in candidate genes and risk of scarring trachoma in a Chlamydia trachomatis--endemic population.

Authors:  N Mozzato-Chamay; O S Mahdi; O Jallow; D C Mabey; R L Bailey; D J Conway
Journal:  J Infect Dis       Date:  2000-10-09       Impact factor: 5.226

2.  Principal components analysis corrects for stratification in genome-wide association studies.

Authors:  Alkes L Price; Nick J Patterson; Robert M Plenge; Michael E Weinblatt; Nancy A Shadick; David Reich
Journal:  Nat Genet       Date:  2006-07-23       Impact factor: 38.330

3.  GenoSNP: a variational Bayes within-sample SNP genotyping algorithm that does not require a reference population.

Authors:  Eleni Giannoulatou; Christopher Yau; Stefano Colella; Jiannis Ragoussis; Christopher C Holmes
Journal:  Bioinformatics       Date:  2008-07-24       Impact factor: 6.937

4.  Gene ontology analysis of GWA study data sets provides insights into the biology of bipolar disorder.

Authors:  Peter Holmans; Elaine K Green; Jaspreet Singh Pahwa; Manuel A R Ferreira; Shaun M Purcell; Pamela Sklar; Michael J Owen; Michael C O'Donovan; Nick Craddock
Journal:  Am J Hum Genet       Date:  2009-06-18       Impact factor: 11.025

5.  Scarring trachoma is associated with polymorphism in the tumor necrosis factor alpha (TNF-alpha) gene promoter and with elevated TNF-alpha levels in tear fluid.

Authors:  D J Conway; M J Holland; R L Bailey; A E Campbell; O S Mahdi; R Jennings; E Mbena; D C Mabey
Journal:  Infect Immun       Date:  1997-03       Impact factor: 3.441

6.  Pathways of distinction analysis: a new technique for multi-SNP analysis of GWAS data.

Authors:  Rosemary Braun; Kenneth Buetow
Journal:  PLoS Genet       Date:  2011-06-09       Impact factor: 5.917

7.  Population structure and eigenanalysis.

Authors:  Nick Patterson; Alkes L Price; David Reich
Journal:  PLoS Genet       Date:  2006-12       Impact factor: 5.917

8.  Reactome knowledgebase of human biological pathways and processes.

Authors:  Lisa Matthews; Gopal Gopinath; Marc Gillespie; Michael Caudy; David Croft; Bernard de Bono; Phani Garapati; Jill Hemish; Henning Hermjakob; Bijay Jassal; Alex Kanapin; Suzanna Lewis; Shahana Mahajan; Bruce May; Esther Schmidt; Imre Vastrik; Guanming Wu; Ewan Birney; Lincoln Stein; Peter D'Eustachio
Journal:  Nucleic Acids Res       Date:  2008-11-03       Impact factor: 16.971

9.  Identification of novel single nucleotide polymorphisms in inflammatory genes as risk factors associated with trachomatous trichiasis.

Authors:  Berna Atik; Troy A Skwor; Ram Prasad Kandel; Bassant Sharma; Him Kant Adhikari; Lori Steiner; Henry Erlich; Deborah Dean
Journal:  PLoS One       Date:  2008-10-31       Impact factor: 3.240

10.  Genome-wide and fine-resolution association analysis of malaria in West Africa.

Authors:  Muminatou Jallow; Yik Ying Teo; Kerrin S Small; Kirk A Rockett; Panos Deloukas; Taane G Clark; Katja Kivinen; Kalifa A Bojang; David J Conway; Margaret Pinder; Giorgio Sirugo; Fatou Sisay-Joof; Stanley Usen; Sarah Auburn; Suzannah J Bumpstead; Susana Campino; Alison Coffey; Andrew Dunham; Andrew E Fry; Angela Green; Rhian Gwilliam; Sarah E Hunt; Michael Inouye; Anna E Jeffreys; Alieu Mendy; Aarno Palotie; Simon Potter; Jiannis Ragoussis; Jane Rogers; Kate Rowlands; Elilan Somaskantharajah; Pamela Whittaker; Claire Widden; Peter Donnelly; Bryan Howie; Jonathan Marchini; Andrew Morris; Miguel SanJoaquin; Eric Akum Achidi; Tsiri Agbenyega; Angela Allen; Olukemi Amodu; Patrick Corran; Abdoulaye Djimde; Amagana Dolo; Ogobara K Doumbo; Chris Drakeley; Sarah Dunstan; Jennifer Evans; Jeremy Farrar; Deepika Fernando; Tran Tinh Hien; Rolf D Horstmann; Muntaser Ibrahim; Nadira Karunaweera; Gilbert Kokwaro; Kwadwo A Koram; Martha Lemnge; Julie Makani; Kevin Marsh; Pascal Michon; David Modiano; Malcolm E Molyneux; Ivo Mueller; Michael Parker; Norbert Peshu; Christopher V Plowe; Odile Puijalon; John Reeder; Hugh Reyburn; Eleanor M Riley; Anavaj Sakuntabhai; Pratap Singhasivanon; Sodiomon Sirima; Adama Tall; Terrie E Taylor; Mahamadou Thera; Marita Troye-Blomberg; Thomas N Williams; Michael Wilson; Dominic P Kwiatkowski
Journal:  Nat Genet       Date:  2009-05-24       Impact factor: 38.330

View more
  9 in total

1.  Genotype-Phenotype Associations of IL6 and PRG4 With Conjunctival Fibrosis After Glaucoma Surgery.

Authors:  Cynthia Yu-Wai-Man; Aristides D Tagalakis; Jinhong Meng; Yann Bouremel; Richard M H Lee; Alex Virasami; Stephen L Hart; Peng T Khaw
Journal:  JAMA Ophthalmol       Date:  2017-11-01       Impact factor: 7.389

2.  Fibroblasts profiling in scarring trachoma identifies IL-6 as a functional component of a fibroblast-macrophage pro-fibrotic and pro-inflammatory feedback loop.

Authors:  Jenny Z Kechagia; Daniel G Ezra; Matthew J Burton; Maryse Bailly
Journal:  Sci Rep       Date:  2016-06-20       Impact factor: 4.379

3.  Differential frequency of NKG2C/KLRC2 deletion in distinct African populations and susceptibility to Trachoma: a new method for imputation of KLRC2 genotypes from SNP genotyping data.

Authors:  Adriana Goncalves; Pateh Makalo; Hassan Joof; Sarah Burr; Athumani Ramadhani; Patrick Massae; Aiweda Malisa; Tara Mtuy; Tamsyn Derrick; Anna R Last; Meno Nabicassa; Eunice Cassama; Joanna Houghton; Christine D Palmer; Harry Pickering; Matthew J Burton; David C W Mabey; Robin L Bailey; Martin R Goodier; Martin J Holland; Chrissy H Roberts
Journal:  Hum Genet       Date:  2016-06-16       Impact factor: 4.132

4.  Progression of scarring trachoma in Tanzanian children: A four-year cohort study.

Authors:  Athumani M Ramadhani; Tamsyn Derrick; David Macleod; Patrick Massae; Elias Mafuru; Aiweda Malisa; Kelvin Mbuya; Chrissy H Roberts; William Makupa; Tara Mtuy; Robin L Bailey; David C W Mabey; Martin J Holland; Matthew J Burton
Journal:  PLoS Negl Trop Dis       Date:  2019-08-14

5.  Low Prevalence of Conjunctival Infection with Chlamydia trachomatis in a Treatment-Naïve Trachoma-Endemic Region of the Solomon Islands.

Authors:  Robert M R Butcher; Oliver Sokana; Kelvin Jack; Colin K Macleod; Michael E Marks; Eric Kalae; Leslie Sui; Charles Russell; Helena J Tutill; Rachel J Williams; Judith Breuer; Rebecca Willis; Richard T Le Mesurier; David C W Mabey; Anthony W Solomon; Chrissy H Roberts
Journal:  PLoS Negl Trop Dis       Date:  2016-09-07

6.  Immunofibrogenic Gene Expression Patterns in Tanzanian Children with Ocular Chlamydia trachomatis Infection, Active Trachoma and Scarring: Baseline Results of a 4-Year Longitudinal Study.

Authors:  Athumani M Ramadhani; Tamsyn Derrick; David Macleod; Patrick Massae; Tara Mtuy; David Jeffries; Chrissy H Roberts; Robin L Bailey; David C W Mabey; Martin J Holland; Matthew J Burton
Journal:  Front Cell Infect Microbiol       Date:  2017-09-15       Impact factor: 5.293

7.  Pathway-Wide Genetic Risks in Chlamydial Infections Overlap between Tissue Tropisms: A Genome-Wide Association Scan.

Authors:  Chrissy H Roberts; Sander Ouburg; Mark D Preston; Henry J C de Vries; Martin J Holland; Servaas A Morré
Journal:  Mediators Inflamm       Date:  2018-06-03       Impact factor: 4.711

8.  Immunopathogenesis of Progressive Scarring Trachoma: Results of a 4-Year Longitudinal Study in Tanzanian Children.

Authors:  Tamsyn Derrick; Athumani M Ramadhani; David Macleod; Patrick Massae; Elias Mafuru; Malisa Aiweda; Kelvin Mbuya; William Makupa; Tara Mtuy; Robin L Bailey; David C W Mabey; Martin J Holland; Matthew J Burton
Journal:  Infect Immun       Date:  2020-03-23       Impact factor: 3.441

9.  Human genetic diversity regulating the TLR10/TLR1/TLR6 locus confers increased cytokines in response to Chlamydia trachomatis.

Authors:  Alyson B Barnes; Rachel M Keener; Benjamin H Schott; Liuyang Wang; Raphael H Valdivia; Dennis C Ko
Journal:  HGG Adv       Date:  2021-11-25
  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.