| Literature DB >> 28330499 |
Jaclyn N Taroni1, Casey S Greene2, Viktor Martyanov1, Tammara A Wood1, Romy B Christmann3, Harrison W Farber4, Robert A Lafyatis3,5, Christopher P Denton6, Monique E Hinchcliff7, Patricia A Pioli8, J Matthew Mahoney9, Michael L Whitfield10.
Abstract
BACKGROUND: Systemic sclerosis (SSc) is a multi-organ autoimmune disease characterized by skin fibrosis. Internal organ involvement is heterogeneous. It is unknown whether disease mechanisms are common across all involved affected tissues or if each manifestation has a distinct underlying pathology.Entities:
Keywords: Functional genomics; Lung disease; Macrophage; Scleroderma; Systemic sclerosis
Mesh:
Year: 2017 PMID: 28330499 PMCID: PMC5363043 DOI: 10.1186/s13073-017-0417-1
Source DB: PubMed Journal: Genome Med ISSN: 1756-994X Impact factor: 11.117
Datasets included in this study
| Dataset label | Tissue | Phenotypes of interest | References | GEO accession |
|---|---|---|---|---|
| Milano | Diffuse skin | Inflammatory subset, proliferative subset | Milano et al. [ | GSE9285 |
| Pendergrass | Diffuse skin | Inflammatory subset, proliferative subset | Pendergrass et al. [ | GSE32413 |
| Hinchcliff | Diffuse skin | Inflammatory subset, proliferative subset | Hinchcliff et al. [ | GSE45485, GSE59785 |
| LSSc | Limited skin | NA | Present study | GSE76806 |
| UCL | Limited skin | NA | Present study | GSE76807 |
| Christmann | Lung | SSc-PF | Christmann et al. [ | GSE76808 |
| Bostwick | Lung | SSc-PF, IPF, IPAH, SSc-PAH | Hsu et al. [ | GSE48149 |
| ESO | Esophagus | Inflammatory subset, proliferative subset, SSc-PAH | Taroni et al. [ | GSE68698 |
| PBMC | PBMC | SSc-PAH | Pendergrass et al. [ | GSE19617 |
| Risbano | PBMC | IPAH, SSc-PAH | Risbano et al. [ | GSE22356 |
Abbreviations: ESO Esophagus, GEO Gene Expression Omnibus, IPAH idiopathic pulmonary arterial hypertension, IPF idiopathic pulmonary fibrosis, PAH pulmonary arterial hypertension, PBMC peripheral blood mononuclear cells, PF pulmonary fibrosis, NA not available
Bonferroni-corrected p values, Fisher’s exact test pathophenotype-associated modules in top-level communities in the module overlap graph
| Top-level community | “In SSc” | “In inflammatory” | “In proliferative” | “In PAH” | “In PF” |
|---|---|---|---|---|---|
| 1 | 1 | 0.02 | 1 | 1 | 1 |
| 2 | 0.71 | 0.07 | 1 | 1 | 1 |
| 3 | 0.09 | 0.27 | 1 | 0.77 | 0.29 |
| 4 | 8.56E-07 | 6.30E-12 | 1 | 0.30 | 1 |
| 5 | 1 | 1 | 0.03 | 1 | 1 |
| 6 | 1 | 1 | 1 | 1 | 1 |
| 7 | 1 | 0.64 | 1 | 0.03 | 1 |
| 8 | 1 | 1 | 1 | 1 | 1 |
Fig. 1Schematic overview of the analysis pipeline. Four datasets are shown for simplicity. Each gene expression dataset was partitioned using WGCNA independently to obtain coexpression modules. Module eigengenes were tested for their differential expression in pathophenotypes of interest. Modules were compared across datasets using MICC to form the “module overlap graph” and community detection algorithms were used to identify communities and sub-communities in the graph. These communities correspond to molecular processes that are conserved across datasets. Each community was examined for enrichment of pathophenotype-associated modules and edge overlap with canonical biological pathways. Gene sets derived from these communities were used to query GIANT functional genomic networks. The resulting networks allow for tissue-specific interrogations of the gene sets. Differential network analysis was performed to compare the lung and skin networks
Number of microarrays and WGCNA coexpression modules in each of the datasets included in this study
| Dataset | Number of arrays | Number of coexpression modules |
|---|---|---|
| Milano | 75 | 39 |
| Pendergrass | 89 | 38 |
| Hinchcliff | 165 | 62 |
| LSSc | 24 | 39 |
| UCL | 15 | 98 |
| Christmann | 18 | 56 |
| Bostwick | 62 | 54 |
| ESO | 33 | 71 |
| PBMC | 54 | 38 |
| Risbano | 38 | 54 |
Fig. 2The multi-tissue module overlap graph demonstrates that severe pathophenotypes have similar underlying expression patterns. a The full adjacency matrix of the module overlap graph sorted to reveal hierarchical community structure. A darker cell color is indicative of a higher W score or larger edge weight. Communities (numbered) and sub-communities (lettered) are indicated by the annotation tracks above and on the right side of the matrix, respectively. Coexpression modules with expression that is increased in a phenotype of interest are marked by the annotation bar on the left side of the matrix. If a module was up in SSc as well as another pathophenotype of interest, the other pathophenotype color is displayed. b The adjacency matrix of sub-communities 4A and 4B indicates that these clusters contain modules that are up in all pathophenotypes of interest and show that there are many edges between the two sub-communities. Sub-community 4A contains modules from all tissues whereas 4B contains mostly solid tissue modules as indicated by the tissue annotation track to the left of the matrix
Selected pathways that are similar to overlapping coexpression patterns in consensus clusters in the information graph
| Consensus cluster | Summary of selected pathways |
|---|---|
| 1A | DNA repair |
| 2 | Cell–cell junction organization |
| 3A | Endocytosis |
| 4A | T cytotoxic and helper pathway |
| 4B | ECM receptor interaction |
| 5 | G2 M checkpoint |
| 6 | Notch signaling |
| 7 | Steroid biosynthesis |
| 8 | Keratin metabolism |
We calculated the Jaccard similarity index between edges in the information graph and canonical pathways and used a Mann–Whitney U test to assess whether a particular pathway was more similar to edges within a consensus cluster than outside the consensus cluster
Fig. 3Genes that are overexpressed in late and early SSc-PF are distributed throughout the lung network. a The lung network shows functional connections between inflammatory and fibrotic processes. Genes in the largest connected component were clustered into functional modules using community detection. Biological processes associated with the functional modules are in boxes next to the modules. Genes are colored by whether they are overexpressed in late SSc-PF/UIP (red), early SSc-PF/NSIP (blue), both (SSc-PF, purple), or neither (grey). NSIP non-specific interstitial pneumonia, UIP usual interstitial pneumonia. Gene symbols in bold have putative SSc risk polymorphisms. Node (gene) size is determined by degree (number of functional interactions) and edge width is determined by the weight (probability of interaction between pairs of genes). The layout is determined by community membership, the strength of connections between communities, and finally the interactions between individual genes in the network. A fully labeled network is supplied as Additional file 30: Figure S3 and is intended to be viewed digitally. b Quantification of differentially expressed genes in each of the five largest functional modules. c–e Hubs of the consensus lung network; only the first neighbors of the hub that are in the same functional module are shown. c LAMC1 is a hub of the response to TGF-beta module. d NPC2 is a hub of the ECM disassembly, wound healing module. e TNFAIP3 is a hub of the innate immune response, NF-κB signaling, and apoptotic processes module. f Bridges of the consensus lung network. First neighbors of PLAUR, CD44, TNFSF10, and TGFBI are shown
Selected genes in the consensus lung network
| Functional module | Gene symbol | Description | Network position | Up in | Function/potential role in disease |
|---|---|---|---|---|---|
| Cell cycle |
| BUB3 mitotic checkpoint protein | - | Early SSc-PF/NSIP | Encodes a mitotic cell cycle checkpoint protein that regulates the onset of anaphase |
|
| Cell division cycle 7 | - | - | Regulates MCM complex | |
|
| Minichromosome maintenance complex component 3 | - | Early SSc-PF/NSIP | Subunit of minichromosome maintenance (MCM) complex | |
|
| MutS homolog 6 | - | Early SSc-PF/NSIP | Participates in DNA mismatch repair. | |
| ECM disassembly/wound healing |
| CD44 molecule (Indian blood group) | Bridge | - | A hyaluronic acid receptor that can interact with many other ligands found in the ECM. Primary idiopathic PF fibroblasts exhibit an invasive phenotype that was abrogated with treatment with anti-CD44 [ |
|
| CD63 molecule | - | - | Has been observed to interact with TIMP1 [ | |
|
| Cathepsin B | - | - | Regulates NPC2 secretion, TNF-alpha production, and cholesterol trafficking genes in an animal model of obesity [ | |
|
| Cathepsin L | - | - | Regulates NPC2 secretion, TNF-alpha production, and cholesterol trafficking genes in an animal model of obesity [ | |
|
| Galactosidase, beta 1 | - | Early SSc-PF/NSIP | Mutations in this gene can lead to GM1-gangliosidosis, a manifestation of which includes foam cell accumulation in the lungs [ | |
|
| Niemann-Pick disease, type C2 | Hub | Early SSc-PF/NSIP | Mutations in this gene result in a lipid storage disorder. Functions in the regulation of cholesterol trafficking through the lysosome by binding to cholesterol released from low density lipoproteins taken up by cells | |
|
| Transforming growth factor, beta-induced | Bridge | Late SSc-PF/UIP | Induced by phagocytosis of apoptotic debris in monocyte-derived MØs and regulates collagen turnover [ | |
|
| TIMP metallopeptidase inhibitor 1 | - | Early SSc-PF/NSIP | Has been observed to interact with CD63 and overexpression has been noted to inhibit apoptosis in a CD63-dependent manner [ | |
| Innate immune response/NFkB signaling/apoptotic process |
| Baculoviral IAP repeat-containing protein 3 | - | Late SSc-PF/UIP | Has antiapoptotic activity through interactions with caspases as well as the TNF superfamily members TRAF1 and TRAF2 [ |
|
| Cysteine-rich, angiogenic inducer, 61 | Late SSc-PF/UIP | Also known as CCN1. Implicated in apoptosis in fibroblasts [ | ||
|
| Dual specificity phosphatase 6 | - | Late SSc-PF/UIP | Plays a role in the positive regulation of apoptosis [ | |
|
| Fas cell surface death receptor | - | Early SSc-PF/NSIP | Cell surface death receptor | |
|
| Nuclear factor of kappa light polypeptide gene enhancer in B-cells inhibitor, epsilon | - | - | Negative regulator of NFkB signaling | |
|
| Plasminogen activator, urokinase receptor | Bridge | Late SSc-PF/UIP | Also known as uPAR. Contains an SSc risk SNP. Pulmonary fibroblasts from patients with idiopathic PF over express uPAR and that uPAR ligation results in a hypermotile phenotype [ | |
|
| Phospholipid scramblase 1 | - | - | Regulates phospholipid membrane asymmetry | |
|
| Tumor necrosis factor, alpha-induced protein 3 | Hub | Also known as A20. Contains an SSc risk SNP (also associated with other autoimmune conditions). Negative regulator of NFkB signaling | ||
|
| Tumor necrosis factor (ligand) superfamily, member 10 | Bridge | - | Also known as TRAIL. Elevated in serum of SSc patients [ | |
|
| Tumor necrosis factor receptor superfamily, member 10b | - | Late SSc-PF/UIP | Also known as TRAILR2 | |
| IFN/antigen presentation |
| Major histocompatibility complex, class I, E | - | - | Class I MHC molecule |
|
| Major histocompatibility complex, class I, F | - | - | Class I MHC molecule | |
|
| IFN induced transmembrane protein 1 | - | SSc-PF (UIP and NSIP) | IFN signaling | |
|
| IFN induced transmembrane protein 2 | - | Early SSc-PF/NSIP | IFN signaling | |
|
| IFN induced transmembrane protein 3 | - | Early SSc-PF/NSIP | IFN signaling | |
|
| IFN regulatory factor 1 | - | Late SSc-PF/UIP | Activator of type I IFN signaling | |
|
| 2′-5′-Oligoadenylate synthetase 1, 40/46 kDa | - | Early SSc-PF/NSIP | Involved in innate immune response to viral infection | |
| Response to TGF-beta |
| Caveolin 1 | - | - | Contains an SSc risk SNP |
|
| Connective tissue growth factor | - | - | Also known as CCN2. Has been shown to play a role in Fas-mediated and TRAIL-induced apoptosis [ | |
|
| Dab, mitogen-responsive phosphoprotein, homolog 2 (Drosophila) | - | SSc-PF (UIP and NSIP) | Required for the epithelial to mesenchymal transition induced by TGF-beta in mouse and for type II TGFbR recycling [ | |
|
| Fibronectin 1 | - | - | Extracellular matrix protein. | |
|
| Laminin gamma1 chain | Hub | Early SSc-PF/NSIP | Expression of this gene is essential for the development of basement membranes [ | |
|
| Thrombospondin 1 | - | - | Mediates cell-to-cell and cell-to-matrix interactions. Putative biomarker of modified Rodnan skin score [ |
Fig. 4The lung and skin network structures indicate distinct tissue microenvironments influence fibrosis. The skin and lung networks were compared by first finding the giant component of the lung network and then collapsing to nodes only found in both the skin and lung networks (which are termed the common skin and common lung networks). a A scatterplot of high probability edges (>0.5 in both networks) illustrates that pairs of genes with a higher probability of interacting in skin than lung exist and vice versa. Edges are colored red if the weight (probability) is 1.25 times higher in lung or blue if it is 1.25 times higher in skin. b The differential adjacency matrix where a cell is colored if the edge weight in a given tissue is over and above the weight in the global average and tissue comparator networks. For instance, a cell is red if the edge weight was positive following the successive subtraction of the global average weight and skin weight. Community detection was performed on the common lung network to identify functional modules; common functional modules largely recapitulate modules from the full lung network. Representative processes that modules are annotated to are above the adjacency matrix. The annotation track indicates a gene’s functional module membership. Nodes (genes) are ordered within their community by common lung within community degree. A fully labeled heatmap is supplied as Additional file 30: Figure S4 and is intended to be viewed digitally. c Quantification of tissue-specific interactions in each of the five largest functional modules. d The lung-resident MØ module found in the differential lung network (consists only of edges in red in b)
Fig. 5Evidence for alternative activation of MØs in SSc-PF lung that is distinct from SSc skin. a Genes identified by differential network analysis and inferred to be indicative of lung-resident MØs are correlated with canonical markers of alternatively activated MØs such as CCL18 and CD163 in the Christmann dataset. b Summarized expression values (mean standardized expression value) of gene sets (coexpression modules) upregulated in various MØ states from the Christmann and Hinchcliff datasets: module CL1, classic activation (IFN-γ); modules ALT 1 and 2, alternative activation (IL-4, IL-13); modules FFA 1, 2, and 3, treatment with free fatty acids. FFA free fatty acid. Modules from [34]. Asterisks (*) indicate significant differences (p < 0.05)
Fig. 6Overview of SSc-PF disease processes. a Network-centric overview. b Cell type-centric overview