Literature DB >> 31197946

Comprehensive bioinformatics analysis of trabecular meshwork gene expression data to unravel the molecular pathogenesis of primary open-angle glaucoma.

Ilona Liesenborghs1,2, Lars M T Eijssen3,4, Martina Kutmon2,3, Theo G M F Gorgels1,5, Chris T Evelo2,3, Henny J M Beckers1, Carroll A B Webers1, Johannes S A G Schouten1,6.   

Abstract

PURPOSE: Performing bioinformatics analyses using trabecular meshwork (TM) gene expression data in order to further elucidate the molecular pathogenesis of primary open-angle glaucoma (POAG), and to identify candidate target genes.
METHODS: A systematic search in Gene Expression Omnibus and ArrayExpress was conducted, and quality control and preprocessing of the data was performed with ArrayAnalysis.org. Molecular pathway overrepresentation analysis was performed with PathVisio using pathway content from three pathway databases: WikiPathways, KEGG and Reactome. In addition, Gene Ontology (GO) analysis was performed on the gene expression data. The significantly changed pathways were clustered into functional categories which were combined into a network of connected genes.
RESULTS: Ninety-two significantly changed pathways were clustered into five functional categories: extracellular matrix (ECM), inflammation, complement activation, senescence and Rho GTPase signalling. ECM included pathways involved in collagen, actin and cell-matrix interactions. Inflammation included pathways entailing NF-κB and arachidonic acid. The network analysis showed that several genes overlap between the inflammation cluster on the one hand, and the ECM, complement activation and senescence clusters on the other hand. GO analysis, identified additional clusters, related to development and corticosteroids.
CONCLUSION: This study provides an overview of the processes involved in the molecular pathogenesis of POAG in the TM. The results show good face validity and confirm findings from histological, biochemical, genome-wide association and transcriptomics studies. The identification of known points of action for drugs, such as Rho GTPase, arachidonic acid, NF-κB, prostaglandins and corticosteroid clusters, supports the value of this approach to identify potential drug targets.
© 2019 The Authors. Acta Ophthalmologica published by John Wiley & Sons Ltd on behalf of Acta Ophthalmologica Scandinavica Foundation.

Entities:  

Keywords:  bioinformatics; gene ontology; glaucoma; molecular pathogenesis; network analysis; pathway analysis; primary open-angle glaucoma; trabecular meshwork

Mesh:

Substances:

Year:  2019        PMID: 31197946      PMCID: PMC7004120          DOI: 10.1111/aos.14154

Source DB:  PubMed          Journal:  Acta Ophthalmol        ISSN: 1755-375X            Impact factor:   3.761


Introduction

Primary open‐angle glaucoma (POAG) is a potentially sight threatening and complex disease in which the trabecular meshwork (TM) has been proven to play a crucial role (Tektas & Lutjen‐Drecoll 2009). It is well known that an increase in intra‐ocular pressure is the greatest and most common risk factor for POAG (Sommer et al. 1991). Despite the numerous data that has been collected in research, the molecular pathogenesis of POAG remains largely obscure. Consequently, an effective treatment option addressing these molecular changes is also still missing. Re‐using, re‐evaluating and integrating the existing data using state‐of‐the‐art bioinformatics approaches may provide an effective and efficient way to improve our understanding of the molecular pathogenesis of primary open‐angle glaucoma. During the past decades, biochemical and histological in vitro studies have been performed in order to identify the pathogenesis of POAG. However, the number of molecules that can be investigated within these studies is limited. Using omics technologies, we are currently able to measure the expression of several thousands of molecules from one sample of affected tissue, leading to an exponential increase in data. One example of omics studies is transcriptomics studies which investigate the differences in gene expression by comparing the levels of RNA for each gene in the healthy and POAG‐diseased TM tissue (Diskin et al. 2006; Liton et al. 2006; Wang et al. 2008; Liu et al. 2013). Genes that are found to be differentially expressed are likely to be associated with POAG. However, a gene is only a small fraction of the molecular processes it is involved in. Pathways represent such molecular processes and show the genes that are involved within these processes and how they are related to other molecules. By identifying the pathways in which the differentially expressed genes are involved, we can identify disease processes that are likely to play a role in the pathogenesis of POAG. The aim of this study was to expand the understanding of the molecular pathogenesis of POAG by applying integrative bioinformatics analysis on the available human gene expression data of the TM in patients with POAG and controls. The step‐by‐step approach applied in this study allows a systematic quality check, investigation and interpretation of the available gene expression data from TM tissue. The obtained results enable us to identify possible drug targets to modulate the disease outcome.

Methods

Systematic search

We conducted a systematic search in two commonly used public repositories, Gene Expression Omnibus (http://www.ncbi.nlm.nih.gov/geo/) (Edgar et al. 2002; Barrett et al. 2013) and ArrayExpress (http://www.ebi.ac.uk/arrayexpress) (Kolesnikov et al. 2015), to select and download publicly available data of microarray studies. In each of these studies, the gene expression profiles between human trabecular meshwork cells of participants with and without POAG were compared. The keywords used in this search were primary open‐angle glaucoma, POAG, glaucoma, gene expression profiling, microarray and trabecular meshwork (cells). The first search was performed on 2016‐09‐20 and last updated on 2018‐11‐19.

Quality control and preprocessing of the data

After the systematic search, the data of each identified study were subjected to a thorough quality control and preprocessed to use as input for further analysis. If necessary, the authors of the datasets were approached to obtain additional information. ArrayAnalysis.org (Eijssen et al. 2013), an online platform with standardized scripts for quality control and preprocessing of gene expression data was used to do this. Where possible, we used the raw data (non‐processed data) and evaluated data quality before and after normalization. The results of the quality control of the original data were visualized in multiple plots to allow assessment of the homogeneity of the dataset, the signal strength of the different samples in the dataset, the correlation of expression and the way in which the samples cluster or contain outliers. Studies of low quality were excluded for further analysis. Within the remaining studies, samples that appeared divergent throughout the quality control were also excluded for further analysis. The quality assessment of the studies and samples was performed by two researchers, independently from each other. Thereafter, the results were compared, and in case of disagreement, consensus was achieved by discussion. After data preprocessing, statistical analysis to compare patient and control groups was performed using the limma package for R (linear regression models) as available from Bioconductor (R Core Team 2013; Ritchie et al. 2015). Systematic application of the above‐mentioned steps led to high‐quality data on differential gene expression, which was used as input for pathway and Gene Ontology analysis.

Pathway analysis

After the quality control and preprocessing of the data, we aimed to identify in which molecular pathways the up‐ and downregulated (differentially expressed) genes are present. To do this, the differential gene expression data that resulted from the statistical analysis was mapped onto known (pre‐existing) pathways. As mentioned before, pathways represent molecular processes and the genes that are involved in these processes. Therefore, if all the genes within a pathway are differentially expressed (that means up‐ or downregulated) between patients and controls, we can assume that this pathway, and the process it represents, is relevant in the disease process. Similarly, if no genes within a pathway are changed, we can assume that this pathway is not relevant. However, the reality is more complicated, since mostly not all genes within a pathway are changed and the degree of change may vary as well. Consequently, we performed a pathway overrepresentation analysis using a common cut‐off criterion for the degree of change in gene expression based on common practice and agreement. We chose relatively strict cut‐off points and considered genes with an absolute 2log fold change (logFC) in gene expression higher than 0.58 (representing a 50% change in expression) and a p‐value smaller than 0.05, as significantly changed. By performing pathway analysis, we can determine which pathways have the highest overrepresentation of significantly changed genes and are therefore likely to be involved in the pathogenesis of POAG. The overrepresentation analysis was performed in PathVisio 3.2.4 (van Iersel et al. 2008; Kutmon et al. 2015), an open‐access analysis tool. In order to recognize all the elements within the pathways, an identifier mapping database was downloaded from http://www.pathvisio.org (last updated on the 14th of April 2016, version Hs_Derby_Ensembl_85.bridge). In the results of the overrepresentation analysis, pathways with a significance score (Z‐score) ≥ 1.96, a permuted p‐value < 0.05 and ≥3 changed genes in the pathway were considered significantly changed. As mentioned before, the pathways we studied are available from free access pathway databases. However, as none of the current pathway databases contain all the existing pathways, we combined the three most commonly used pathway databases into one large collection in order to obtain a larger pathway coverage: WikiPathways (Kelder et al. 2012; Kutmon et al. 2016), KEGG (Kanehisa & Goto 2000; Kanehisa et al. 2016, 2017) and Reactome (Milacic et al. 2012; Fabregat et al. 2016). The pathways from each database were downloaded on 10th August 2017. The number of pathways and unique genes in each of these pathway databases are shown in Table 1. For WikiPathways only the curated collection was included. For KEGG, the disease pathways were excluded and only the general processes were added to the collection. Of the nearly 12 000 unique genes in the combined pathway collection, 42.2% are covered in one of the three databases only (Fig. 1). This confirms the importance of integrating the available collections. Using PathVisio, we calculated the overrepresentation score (Z‐score) in one run for all included pathways of all three databases.
Table 1

The number of pathways and unique human genes per used pathway database

Number of pathwaysNumber of unique genes
WikiPathways3935,492
Reactome47310,088
KEGG2346,595
Figure 1

Venn diagram showing the overlap in unique genes between pathway databases.

The number of pathways and unique human genes per used pathway database Venn diagram showing the overlap in unique genes between pathway databases.

Clustering of the pathway results

After the aforementioned pathway analysis, the significantly changed pathways were clustered into functional categories. These categories were based on the molecular mechanisms that were captured in the pathways. The clustering was performed by human curation after careful investigation of the results.

Network analysis

Cytoscape (http://www.cytoscape.org), an open‐access tool, was used to combine the genes involved in each of the identified functional categories into a network of connected genes (Shannon et al. 2003). It allows to vizualize how the functional categories are related to each other, to visualize how many and which mutual genes are shared between the multiple categories, to investigate these genes, and to retrieve the most relevant target genes for new drugs.

Gene ontology analysis

In addition to the pathway analysis, in which we looked at the process level, we also performed Gene Ontology (GO) analysis on the gene expression data. Gene Ontology (GO) analysis allows investigating the functions of the genes by using their structured individual annotations (Ashburner et al. 2000). We used Gorilla (http://cbl-gorilla.cs.technion.ac.il/), a freely available online tool (Eden et al. 2009). As for pathway analysis, genes with an absolute 2log fold change (logFC) in gene expression higher than 0.58 and a p‐value smaller than 0.05 were considered to be significantly changed in the GO analysis.

Clustering of the GO results

After performing GO analysis, the significant functional terms were clustered into functional categories. This was also performed by human curation after careful investigation of the results. The results obtained after performing the pathway, and GO analysis were compared with each other for agreement in functional categories.

Results

After the systematic search, the datasets of four different studies were selected for further analyses: GSE27057 and GSE27058 (Diskin et al. 2006), GSE4316 (Liton et al. 2006), and GSE27276 (Liu et al. 2013). For three out of four datasets, we contacted the authors to obtain additional information. The characteristics of the datasets are shown in Table 2. The most important features for each of the included studies are described below.
Table 2

Characteristics of the identified gene expression datasets

Diskin et al. GSE27057 and GSE27058 (2006)Liton et al. GSE4316 (2006)Liu et al. GSE27276 (2013)
Study designHTM specimens of post‐mortem donors with and without POAG

HTM specimens of post‐mortem donors with and without POAG and cultured HTM cells of post‐

mortem donors without POAG
HTM specimens of POAG cases obtained from trabeculectomy and from post‐mortem donor eyes using the same approach as a standard trabeculectomy
Passage cellsNot applicable3rd (cultured cells)Not applicable
Number of included patients with POAG5 (GSE27057 and GSE27058)2 pairs of eyes15 (from one patient both eyes were included)
Number of included control donors

5 (GSE27057)

4 (GSE27058)

5 pairs of eyes13 (from six patients both eyes were included)
Age (years)POAG and control: 66–87

POAG: 59 and 77

Control: 70, 85 and 87

POAG: 40–86

Control: 48–94

MicroarrayCustom Affymetrix Glyco v2 GeneChipsAffymetrix Human Genome U133 Plus 2.0Sentrix Human‐6 Expression BeadChip
Characteristics of the identified gene expression datasets HTM specimens of post‐mortem donors with and without POAG and cultured HTM cells of post‐mortem donors without POAG 5 (GSE27057) 4 (GSE27058) POAG: 59 and 77 Control: 70, 85 and 87 POAG: 40–86 Control: 48–94 Datasets GSE27057 and GSE27058 of Diskin et al. investigate the differences in gene expression profiles between five glaucoma cases and five and four control cases, respectively. All samples were derived from post‐mortem human donor eyes. The diagnosis and medical information were supplied by the treating ophthalmologist of the donor, using a detailed questionnaire. Dataset GSE4316 shows the differences between the gene expression profiles of three cultured TM cells and the TM tissues of three control and two POAG pairs of eyes. All samples were derived from post‐mortem human donor eyes. The POAG eyes were from donors with a documented history of POAG and use of glaucoma medication. Dataset GSE27276 shows the differences between the gene expression profiling of fifteen POAG and thirteen control cases. One of the POAG cases had a MYOC mutation. The samples of the POAG cases were obtained from routine trabeculectomy surgery. The control TM samples were obtained from post‐mortem human donor eyes. To collect these control samples, the same method was used during the trabeculectomy. In one of the POAG cases and six of the control cases, samples of both eyes were used. A technical replicate was performed for one of the cases. The detailed reports of the quality control of the datasets are added in File S1, containing Fig. S1A–D, and File S2, containing Fig. S2A–G. The most important observations concerning the quality control for each of the datasets are described below.

Excluded datasets

Diskin et al. (GSE27057 and GSE27058): The provided data were generated using custom Affymetrix Glyco v2 GeneChips. This chip contained both human and mouse probes for a specific chosen subset of genes. We were able to link the provided identifiers to the more generic Ensembl identifiers, resulting in the identification of 752 unique human genes. This is a relatively low number of genes to use as input for pathway analysis. In addition, the associated article stated that the provided samples comprised not only of POAG cases, but also normal tension glaucoma and glaucoma suspect donors. The provided metadata did not allow us to trace this back to the samples in the dataset, thereby not allowing us to identify the glaucoma samples that were diagnosed with POAG. This also could not be deduced from the results of the quality control (not shown). Based on above findings, both datasets were excluded from further analysis. Liton et al. (GSE4316): For this study, only processed data was available. We first run quality control on the raw data, not observing any deviations. However, based on the results of the quality control, we concluded that the processed data, provided by the authors of this study, were not normalized yet. Therefore, we performed quantile normalization and made additional plots for the normalized data. The plots resulting from the quality control (File S1) demonstrated that the cultured cells behave differently from the other samples. Therefore, we removed them from the dataset, resulting in the remainder of two POAG and three control cases. When investigating these tissue samples, the quality control shows a mixed clustering of POAG and control cases (File S1). Due to the remaining small sample size and their divergent behaviour, the quality of the dataset was found to be too low, despite normalization techniques. Therefore, this dataset was also excluded for further analysis.

Included dataset

Liu et al. (GSE27276): we performed preprocessing and quality control on the provided raw and normalized data. Here, we only present the results of the quality control on the normalized data. The quality control (File S2) showed that one control sample (c9.2) behaved as an outlier. This sample was therefore removed from the data, and the quality control was repeated with the remaining samples. No other divergent samples were detected, and they were therefore included for further analyses. However, to ensure homogeneity of the dataset, we removed the samples of the MYOC mutation carrier (p15.1 and p15.1r). Therefore, fourteen POAG cases and thirteen control cases remained in the dataset. Based on the results of the quality control, one dataset containing high‐quality gene expression data of fourteen POAG cases and thirteen control cases was used for pathway and GO analysis. Pathway analysis, performed with the combined pathway collection from WikiPathways, KEGG and Reactome showed 92 significantly altered pathways (Z‐score ≥ 1.96, a permuted p‐value < 0.05, and ≥3 changed genes in the pathway). The complete results of the pathway analysis are shown in Table S1. The significant pathways could be clustered into the following functional categories: extracellular matrix (ECM), inflammation, complement activation, senescence and Rho GTPase. The inflammation category could be further subcategorized in interleukin signalling, NF‐κB, arachidonic acid, and general inflammation pathways, such as the TNF signalling pathway. Subcategories for the ECM category are collagen, actin and cell–matrix, cell–cell interactions and general ECM pathways, such as the ECM organization pathway. The results of the clustering are presented in Table 3.
Table 3

Clusters resulted from the pathway analysis

Pathway namePathway databaseZ‐scorePermuted p‐valuePositive* Measured*
Extracellular matrix
ECMExtracellular matrix organizationReactome5.58<0.00011441
Degradation of the extracellular matrixReactome4.94<0.00011237
ECM‐receptor interactionKEGG4.29<0.00011243
miRNA targets in ECM and membrane receptorsWikiPathways4.210.002719
miR‐509‐3p alteration of YAP1/ECM axisWikiPathways2.720.026413
Activation of matrix metalloproteinasesReactome2.530.014414
CollagenCollagen biosynthesis and modifying enzymesReactome4.72<0.00011134
Collagen chain trimerizationReactome3.860.001721
Assembly of collagen fibrils and other multimeric structuresReactome3.840.002826
ActinRegulation of actin cytoskeletonWikiPathways2.630.0121265
Regulation of actin cytoskeletonKEGG2.360.01816101
Cell–matrix and cell–cell interactionsCell adhesion molecules (CAMs)KEGG5.45<0.00011757
Tight JunctionKEGG3.40.0011787
Focal Adhesion‐PI3K‐Akt‐mTOR‐signalling pathwayWikiPathways3.290.00324141
Focal adhesionWikiPathways3.060.00218101
Inflammation
Interleukin signallingInterleukin‐4 and 13 signallingReactome9.28<0.000137106
IL‐17 signalling pathwayKEGG4.53<0.00011346
Interleukin‐10 signallingReactome3.230.0051043
IL‐17 signalling pathwayWikiPathways2.070.038417
NF‐κBQuercetin and NF‐κB/AP‐1 Induced Cell ApoptosisWikiPathways2.920.006412
TAK1 activates NF‐κB by phosphorylation and activation of IKKs complexReactome2.760.014518
Photodynamic therapy‐induced NF‐κB survival signallingWikiPathways3.950.004616
Arachidonic acidArachidonic acid metabolismKEGG2.480.014626
Arachidonic acid metabolismReactome2.270.022628
GeneralInflammatory response pathwayWikiPathways5.06<0.0001715
Allograft rejectionWikiPathways4.050.0011140
Intestinal immune network for IgA productionKEGG3.950.0001616
Th17 cell differentiationKEGG2.880.0061154
Antigen processing and presentationKEGG2.780.015942
Th1 and Th2 cell differentiationKEGG2.470.023946
Neutrophil degranulationReactome2.310.01738294
TNF signalling pathwayKEGG2.130.027951
Complement activation
Complement activationWikiPathways4.93<0.0001612
Complement and coagulation cascadesWikiPathways4.76<0.00011029
Complement and coagulation cascadesKEGG4.48<0.00011136
Complement cascadeReactome3.860.001721
Senescence
Senescence‐Associated Secretory Phenotype (SASP)Reactome3.510.0031252
DNA damage/telomere stress induced senescenceReactome2.830.004623
Senescence and autophagy in cancerWikiPathways1.980.0331061
RHO GTPase
RHO GTPases activate ROCKsReactome2.720.021413
RHO GTPases activate PKNsReactome2.340.024521
RHO GTPases activate CITReactome2.370.019415
RHO GTPases activate PAKsReactome2.210.023416

*Positive: the number of genes on the pathway that pass the statistical criteria (absolute logFC > 0.58 and p‐value < 0.05); Measured: the number of genes on the pathway that were measured in the dataset.

Clusters resulted from the pathway analysis *Positive: the number of genes on the pathway that pass the statistical criteria (absolute logFC > 0.58 and p‐value < 0.05); Measured: the number of genes on the pathway that were measured in the dataset. We created an integrated network showing the genes involved in each of the five main categories to study the interplay between the processes. The network analysis shows that the clusters of ECM, complement activation and senescence have multiple mutual genes with the inflammation cluster, showing the central role of inflammation in the connection between clusters. In addition, 12 genes are present in three or more clusters, as displayed in Figure 2.
Figure 2

Network of the five identified pathway categories. Orange rectangles indicate the categories; blue nodes indicate the genes present in at least one pathway of the category; darker shades of blue indicate participation of the gene in more than one category; gene labels are shown for genes present in three or more categories.

Network of the five identified pathway categories. Orange rectangles indicate the categories; blue nodes indicate the genes present in at least one pathway of the category; darker shades of blue indicate participation of the gene in more than one category; gene labels are shown for genes present in three or more categories. Gene Ontology (GO) overrepresentation analysis of the differentially expressed genes resulted in fifty‐one significant GO functional terms (p‐value < 10−6). The complete tabular results of the GO analysis are shown in Table S2. The GO terms were clustered into five categories: ECM, inflammation, cell adhesion, corticosteroids and development (Table 4).
Table 4

Clusters resulting from the GO analysis

GO termp‐value

FDR

q‐value

*
Enrichment (B, b)*
Extracellular matrix
Extracellular matrix organization1.75E‐101.1E‐63.08 (155, 38)
Extracellular structure organization1.75E‐107.31E‐73.08 (155, 38)
Extracellular matrix disassembly3.75E‐61.27E‐34.30 (38, 13)
Inflammation
Defence response3.11E‐83.24E‐51.97 (427, 67)
Response to wounding4.76E‐84.25E‐53.43 (88, 24)
Response to inorganic substance7.67E‐86.39E‐52.30 (246, 45)
Response to stimulus9.65E‐87.54E‐51.33 (2236, 236)
Immune response7.25E‐73.94E‐41.99 (335, 53)
Regulation of immune system process7.59E‐73.95E‐41.66 (667, 88)
Response to oxygen‐containing compound1.38E‐66.41E‐41.69 (597, 80)
Regulation of response to stimulus1.65E‐67.35E‐41.35 (1767, 189)
Humoral immune response1.83E‐67.89E‐43.23 (78, 20)
Regulation of immune response2.54E‐69.91E‐41.81 (432, 62)
Acute inflammatory response2.94E‐61.05E‐35.13 (27, 11)
Immune system process3.42E‐61.19E‐31.49 (981, 116)
Regulation of cytokine production4.13E‐61.36E‐31.99 (291, 46)
Response to molecule of bacterial origin5.84E‐61.62E‐32.52 (135, 27)
Positive regulation of response to stimulus6.42E‐61.74E‐31.47 (983, 115)
Inflammatory response8.81E‐62.25E‐32.26 (178, 32)
Cell adhesion
Cell adhesion6.53E‐99.07E‐62.14 (359, 61)
Biological adhesion1.01E‐81.26E‐52.11 (363, 61)
Cell–cell adhesion2.98E‐71.69E‐42.58 (161, 33)
Regulation of cell adhesion8.63E‐62.25E‐31.92 (308, 47)
Corticosteroids
Response to steroid hormone4.29E‐61.34E‐32.88 (96, 22)
Response to glucocorticoid5.13E‐61.46E‐33.39 (63, 17)
Response to corticosteroid7.13E‐61.9E‐33.19 (71, 18)
Development
Developmental process4.24E‐115.3E‐71.43 (2081, 237)
Epidermis development1.91E‐105.96E‐76.11 (35, 17)
Anatomical structure development2.98E‐83.38E‐51.48 (1380, 162)
Cellular developmental process1.04E‐77.66E‐51.53 (1105, 134)
Cell differentiation1.31E‐66.29E‐41.60 (756, 96)
Tissue development2.24E‐69.02E‐42.11 (250, 42)
Regulation of developmental process9.62E‐62.4E‐31.43 (1088, 124)

*FDR q‐value: p‐value after correction for multiple testing; B: the total number of genes associated with the GO term; b: the number of genes associated with the GO term that pass the statistical criteria (absolute logFC > 0.58 and p‐value < 0.05). In total, there are 7901 number of genes in the dataset (N = 7901); 628 genes are differentially expressed (n = 628); Enrichment is defined as (b/n)/(B/N). Results with a p‐value < 10−6 were considered to be significant.

Clusters resulting from the GO analysis FDR q‐value *FDR q‐value: p‐value after correction for multiple testing; B: the total number of genes associated with the GO term; b: the number of genes associated with the GO term that pass the statistical criteria (absolute logFC > 0.58 and p‐value < 0.05). In total, there are 7901 number of genes in the dataset (N = 7901); 628 genes are differentially expressed (n = 628); Enrichment is defined as (b/n)/(B/N). Results with a p‐value < 10−6 were considered to be significant. Figure 3 shows an integrated flowchart of the methods and results.
Figure 3

Flow chart of the methods and results of this paper. The grey text represents the methods, black text represents the results.

Flow chart of the methods and results of this paper. The grey text represents the methods, black text represents the results.

Discussion

Faced with the enormous amount of data obtained by omics studies, research has been directed to determine how this data can be interpreted and used in a meaningful way. In this study, we have shown how developments in bioinformatics enable the needed integration of omics data to improve interpretation and understanding of biological processes and their contribution to human disease. Specifically, we demonstrated how pathway and gene ontology analysis generate new insights into the molecular pathways involved in the pathogenesis of POAG. We applied a systematic bioinformatics approach to perform quality control and preprocessing steps on gene expression profiling datasets in which differences in gene expression of TM cells in patients with and without POAG were compared. Although only one dataset passed our quality control, pathway overrepresentation analysis identified 92 pathways that were significantly changed in patients with POAG, as compared to patients without POAG. These pathways were clustered into five functional categories: ECM, inflammation, complement activation, senescence and Rho GTPase. These categories were combined into a network of connecting genes, showing overlap between the categories and the central position of the inflammation cluster. Also, the genes present in at least three categories were visualized within the network. The additional GO analysis on our preprocessed and quality controlled dataset showed that the significantly changed genes were involved in ECM, inflammation and cell adhesion, which we already found in the pathway analysis. Additionally, development and corticosteroid‐related clusters were found. As far as we know, this study is the first to perform a systematic, state‐of‐the‐art step‐by‐step approach on the available gene expression datasets of TM tissue from patients with POAG and controls. For each of the five identified molecular process categories, other studies provide supportive evidence of their role in the TM and their involvement in the pathogenesis of POAG. Multiple changes have been described in the ECM of TM derived from patients with POAG. For example, the stiffness of glaucomatous TM, especially the juxtacanalicular tissue, is higher and an increase in plaque‐like depositions that consist, among others, of a core of elastic fibres is seen (Rohen et al. 1981; Last et al. 2011). In addition, gene expression studies have shown differences in the expression of genes encoding ECM proteins between patients with POAG and healthy controls. These differences correlate with morphological changes that occur in the ECM of the TM, for example genes that are involved in the formation of collagen, fibronectin and integrins (Liton et al. 2006). Furthermore, TGF‐β signalling regulates the ECM turnover (Fuchshofer & Tamm 2009). TGF‐β2 was identified in our study as part of the Extracellular Matrix Organization pathway and is known to regulate the ECM metabolism in TM cells and tissues. It induces elastin and collagen cross‐linking enzymes (LOX‐genes), which are associated with the pathological ECM changes observed in glaucomatous TM (Fuchshofer & Tamm 2009; Sethi et al. 2011). Elastin, collagen and LOX‐genes are also present in the Extracellular Matrix Organization pathway we identified. Moreover, TGF‐β1 is present in thirteen of our significantly changed pathways. It is known to induce the expression of smooth muscle actin in cultured TM cells and therefore influences the TM actin cytoskeleton (Tamm et al. 1996). Interestingly, corticosteroids, which we identified as a functional category in our GO analysis, also induce changes in the cytoskeleton, better known as the formation of cross‐linked actin networks (CLANs) (Clark et al. 1994). Concerning the inflammation category, inflammatory mediators, such as TGF‐β1, TNF‐α and interleukins such as IL‐6, can induce changes in the ECM and the cytoskeleton of the TM. This implies that an inflammatory response is also involved in the pathogeneses of POAG, which was also confirmed by other studies (Taurone et al. 2015). Moreover, interleukins, for example IL‐10, IL‐8 and TGF‐β1, are upregulated in the aqueous humour of patients with POAG (Micera et al. 2016). In addition, arachidonic acid, which we also found in our pathway analysis, is a precursor of prostaglandins. Prostaglandin analogues are currently used for the treatment of POAG and affect the outflow in the TM (Toris et al. 2008). Furthermore, NF‐κB, another inflammatory component, is known to be activated in the TM cells in glaucoma where it protects against oxidative stress (Wang et al. 2001). The category Rho GTPase is of special interest. Downstream targets of Rho GTPase are Rho/Rho‐associated kinases (ROCKs), including RHOA. Rho/Rho‐associated kinases (ROCKs) plays an important role in focal adhesions, actin stress fibres and cell adhesion (Wang et al. 2013). Therefore, they are likely to play an important role in the function of the TM and in the pathogenesis of glaucoma (Wang et al. 2013; Rao et al. 2017). Moreover, ROCK inhibitors are currently being tested to lower the intra‐ocular pressure in glaucoma (Inoue & Tanihara 2013; Rao et al. 2017). Finally, the category senescence is shown to be involved in POAG. Ageing is a known risk factor (de Voogd et al. 2005). With ageing, multiple changes in the TM have been described, for example an increase in TM stiffness (Morgan et al. 2015; Liu et al. 2017). Additionally, Senescence‐Associated Secretory Phenotype (SASP) genes play a key role as their expression activates multiple inflammation factors such as TNF‐α, and, among others, IL‐1/NF‐κB and TGF‐β/SMAD signalling pathways (Hubackova et al. 2012). There is evidence that ageing tissue is chronically inflamed, which suggests there is an interplay between senescence and inflammation (Young & Narita 2009). We created a network of the five molecular process categories that we have identified to be involved in the pathogenesis of POAG. This network does not only show an interplay between the senescence and inflammation cluster, as suggested above, but also shows that ECM and complement activation clusters share multiple genes with the inflammation cluster. This points out the central role of inflammation in the connection between clusters (Fig. 2). Furthermore, within the network, the gene symbols of the genes that are present in at least three of the categories are shown. More than 50% of these genes have been associated with POAG. TGF‐β1 is highly expressed in glaucomatous TM cells and can be induced by mechanical stress (Liton et al. 2005; Fatma et al. 2009). Furthermore, the exposure of the TM to TGF‐β1 or TGF‐β2 increases the expression of fibronectin (FN1) (Fatma et al. 2009). CD44 is increased in the aqueous humour of patients with POAG (Mokbel et al. 2010). COL3A1 and COL1A1 are differentially expressed after exposure to oxidative stress, which is known to affect the TM in patients with POAG (Izzotti et al. 2006; Luna et al. 2009). Further, the allele frequencies of SERPINE1 are significantly different in patients with POAG compared to a control group (Zhou & Liu 2010). Lastly, PLAU is differentially expressed after exposure to dexamethasone or triamcinolone and it is known that patients with POAG are more susceptible to develop corticosteroid‐induced ocular hypertension (Tripathi et al. 1999; Fan et al. 2008). The association of the above‐mentioned genes with POAG suggests a role for the other identified genes as well. In addition to the pathway analysis, we performed GO analysis. Gene Ontology (GO) annotations are based on the function of the differentially expressed genes. They do not have the level of detail of a known pathway diagram, but may add information where such detailed information is still lacking. ECM, inflammation and cell adhesion were found in this analysis as well. Complementary, we identified development and corticosteroid‐related clusters in our analysis. Studies have also shown that corticosteroids cause molecular changes in TM that increase the outflow resistance (Jones & Rhee 2006; Kersey & Broadway 2006). The findings described in the publication of Liu et al. are of special interest as we used the dataset they made publically available. They investigated the functional distribution of the DEGs, using the Database for Annotation, Visualization and Integrated Discovery (DAVID) and showed that among others, cell adhesion, ECM, extracellular region, signal peptide, glycoprotein and secretion are involved in the pathogenesis of POAG (Liu et al. 2013). The GO analysis we performed in our study showed similar results. However, there are some discrepancies which might partially be explained by differences in defining the cut‐off for DEGs and the significance level of the GO terms. The recently published study of Qiu et al. (2017) used the same publicly available dataset as we used for pathway and GO analysis. However, they did not describe quality checks and did not remove the patient with a MYOC mutation or the deviating control sample, from the database. In addition, they only used the KEGG database to perform pathway analysis. Our results add additional processes that have been confirmed by previous studies: ECM, inflammation, Rho GTPase and senescence, including each of the subcategories for the ECM and inflammation categories. Information on these processes might not be completely present in the KEGG database or they may be embedded in larger less specific pathways. The result of our study emphasizes the importance of performing a systematic step‐by‐step approach using multiple bioinformatics techniques and the benefits of combining knowledge from multiple pathway databases, in order to include the greatest possible coverage of knowledge and to obtain the most comprehensive results. The final study to discuss is the study of Danford et al. (2017) which identified all genes that have a confirmed link with POAG through transcriptomics, genetic association, functional and genome‐wide association studies. After identification, the genes were classified according to their associated ocular tissue, including the TM. The study of Fan et al. investigated the differences in gene expression between TM cells treated with and without steroid and was excluded in the analysis in order to avoid study bias. The pathway analysis Danford et al. performed on their 64 identified TM tissue‐associated genes revealed the TGF‐β signalling pathway, ECM organization, and senescence and autophagy in cancer as the most overrepresented pathways in the TM. As discussed before, the ECM organization and senescence pathways were also identified in our study and TGF‐β is involved in thirteen of our significantly changed pathways. One of the strengths of our study is that we integrated the pathway databases of WikiPathways, KEGG and Reactome into one collection. None of these databases captures the complete gene and pathway knowledge: 42.2% of the genes in this combined pathway collection are only covered in one of the three databases. This shows the importance of integrating the available databases. We chose WikiPathways, Reactome and KEGG as their pathways are of high quality, they all belong to the most extensive and commonly used pathway databases, and they provide their data in a machine‐readable format. Furthermore, we performed an extensive quality check on the retrieved publicly available datasets to avoid that divergent samples or an overall insufficient quality would affect the results of our following analyses. The fact that only one dataset could be maintained, stresses the importance of and the need for extensive and systematic quality control, which is often missing when performing analysis on publicly available data. In short, the purpose of this study was to systematically analyse available gene expression data of the TM in patients with POAG and controls. The analyses performed in this study provide a comprehensive overview of the processes involved in the molecular pathogenesis of POAG. These results show good face validity and confirm findings from histological, biochemical, genome‐wide association and transcriptomics studies. The appearance of already known points of action for drugs such as ROCK inhibitors, prostaglandins and corticosteroids in our results supports the validity of this approach and stresses its potential in the identification of new treatment options. This study shows how findings from decades of biochemical and histological studies of POAG can be efficiently retrieved by applying state‐of‐the‐art bioinformatics to genome‐wide data, clearly indicating the power and promise of this approach. Table S1. Complete results of the pathway analysis. Click here for additional data file. Table S2. Complete results of the GO analysis. Click here for additional data file. File S1. Quality control of dataset GSE4316 (Liton et al.). Figure S1. Results of the quality control on the normalized data of dataset GSE4316, visualized in multiple plots. Click here for additional data file. File S2. Quality control of dataset GSE27276 (Liu et al.). Figure S2. Results of the quality control on the normalized data of dataset GSE27276, visualized in multiple plots. Figure S3. Histogram of the distribution of the average log‐scaled expression of the probes. Click here for additional data file.
  53 in total

1.  Erythropoietin and soluble CD44 levels in patients with primary open-angle glaucoma.

Authors:  Tharwart H Mokbel; Asaad A Ghanem; Hanem Kishk; Lamiaa F Arafa; Azza A El-Baiomy
Journal:  Clin Exp Ophthalmol       Date:  2010-04-29       Impact factor: 4.207

Review 2.  The role of oxidative stress in glaucoma.

Authors:  Alberto Izzotti; Alessandro Bagnis; Sergio C Saccà
Journal:  Mutat Res       Date:  2006-01-18       Impact factor: 2.433

Review 3.  Corticosteroids and glaucoma risk.

Authors:  R C Tripathi; S K Parapuram; B J Tripathi; Y Zhong; K V Chalam
Journal:  Drugs Aging       Date:  1999-12       Impact factor: 3.923

Review 4.  Characterizing the "POAGome": A bioinformatics-driven approach to primary open-angle glaucoma.

Authors:  Ian D Danford; Lana D Verkuil; Daniel J Choi; David W Collins; Harini V Gudiseva; Katherine E Uyhazi; Marisa K Lau; Levi N Kanu; Gregory R Grant; Venkata R M Chavali; Joan M O'Brien
Journal:  Prog Retin Eye Res       Date:  2017-02-20       Impact factor: 21.198

5.  Induction of TGF-beta1 in the trabecular meshwork under cyclic mechanical stress.

Authors:  P B Liton; X Liu; P Challa; D L Epstein; P Gonzalez
Journal:  J Cell Physiol       Date:  2005-12       Impact factor: 6.384

Review 6.  Modulation of extracellular matrix turnover in the trabecular meshwork.

Authors:  Rudolf Fuchshofer; Ernst R Tamm
Journal:  Exp Eye Res       Date:  2009-04       Impact factor: 3.467

7.  Gene expression profiles of human trabecular meshwork cells induced by triamcinolone and dexamethasone.

Authors:  Bao Jian Fan; Dan Yi Wang; Clement Chee Yung Tham; Dennis Shun Chiu Lam; Chi Pui Pang
Journal:  Invest Ophthalmol Vis Sci       Date:  2008-05       Impact factor: 4.799

8.  Relationship between intraocular pressure and primary open angle glaucoma among white and black Americans. The Baltimore Eye Survey.

Authors:  A Sommer; J M Tielsch; J Katz; H A Quigley; J D Gottsch; J Javitt; K Singh
Journal:  Arch Ophthalmol       Date:  1991-08

9.  Increased expression of serum amyloid A in glaucoma and its effect on intraocular pressure.

Authors:  Wan-Heng Wang; Loretta G McNatt; Iok-Hou Pang; Peggy E Hellberg; John H Fingert; Mitchell D McCartney; Abbot F Clark
Journal:  Invest Ophthalmol Vis Sci       Date:  2008-01-25       Impact factor: 4.799

10.  GOrilla: a tool for discovery and visualization of enriched GO terms in ranked gene lists.

Authors:  Eran Eden; Roy Navon; Israel Steinfeld; Doron Lipson; Zohar Yakhini
Journal:  BMC Bioinformatics       Date:  2009-02-03       Impact factor: 3.169

View more
  4 in total

1.  Stem cell transplantation rescued a primary open-angle glaucoma mouse model.

Authors:  Siqi Xiong; Ajay Kumar; Shenghe Tian; Eman E Taher; Enzhi Yang; Paul R Kinchington; Xiaobo Xia; Yiqin Du
Journal:  Elife       Date:  2021-01-28       Impact factor: 8.140

2.  Investigation of Key Signaling Pathways Associating miR-204 and Common Retinopathies.

Authors:  Ahmad Bereimipour; Leila Satarian; Sara Taleahmad
Journal:  Biomed Res Int       Date:  2021-10-04       Impact factor: 3.411

3.  ANGPTL7 is transcriptionally regulated by SP1 and modulates glucocorticoid-induced cross-linked actin networks in trabecular meshwork cells via the RhoA/ROCK pathway.

Authors:  Mengsha Sun; Wenjia Liu; Minwen Zhou
Journal:  Cell Death Discov       Date:  2022-02-08

4.  Screening of primary open-angle glaucoma diagnostic markers based on immune-related genes and immune infiltration.

Authors:  Lingge Suo; Wanwei Dai; Xuejiao Qin; Guanlin Li; Di Zhang; Tian Cheng; Taikang Yao; Chun Zhang
Journal:  BMC Genom Data       Date:  2022-08-24
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.