| Literature DB >> 30112021 |
Wei Liu1, Wei Tu2, Li Li3, Yingfu Liu4, Shaobo Wang1, Ling Li1, Huan Tao1, Huaqin He1.
Abstract
The Connectivity Map (CMap) is a tool that has been extensively utilized to study drug repositioning and side-effect prediction. However, most of these analyses rely on signature genes, ignoring the pathways by which those genes are regulated, as well as the functional overlap of redundant genes. The present study utilized a systems biology approach referred to as Weighted Gene Co-expression Network Analysis (WGCNA) to dissect the transcriptional profiles of CMap and reveal these hidden factors. Seven common modules associated with protein binding, extracellular matrix organization and translation were identified. Furthermore, drugs were clustered based on module expression to infer their mechanism of action (MoA) based on common activity profiles. As an extension of this, an example of disease-based module projection to identify novel drugs was provided. The analysis developed in the present study may provide a novel framework for drug repositioning or discovering MoAs.Keywords: Connectivity Map; drug repositioning; mechanism of action; weighted gene co-expression network analysis
Year: 2018 PMID: 30112021 PMCID: PMC6090433 DOI: 10.3892/etm.2018.6275
Source DB: PubMed Journal: Exp Ther Med ISSN: 1792-0981 Impact factor: 2.447
Figure 1.Seven gene co-expression modules were identified in a co-expression network for Connectivity Map. (A) Scale free topology fit (R2, y-axis) as a function of different powers (powers are numbers in red). The power of six was selected to construct the scale-free network using 0.8 as the cutoff. (B) Gene dendrograms displaying the co-expression modules were identified by Weighted Gene Co-expression Network Analysis and labeled by different colors. The seven module colors are blue, black, brown, white, green, grey60 and turquoise.
Figure 2.The identified modules were tested for their stability and significance. (A) Module stability was tested by sampling a random half of all samples 1,000 times and connectivity was correlated and expressed as the mean ± standard deviation. (B) Module significance was also calculated by the Module significance function in WGCNA. Gene significance is the correlation of a gene expression profile with a sample trait. Module significance is determined as the average absolute gene significance measured for all genes in a given module. *P<0.05 denotes the significant association of the modules with the drug concentration.
GO annotation of the 7 gene co-expression modules identified in Connectivity Map.
| GO term (Benjamini-adjusted P-value) | ||||
|---|---|---|---|---|
| Module (number of probes) | Biological process | Cellular component | Molecular function | Chromosome (Benjamini-adjusted P-value) |
| Blue (4338) | Cell-cell adhesion (6.29×10−7) | Nucleoplasm (4.29×10−28) | Protein binding (3.09×10−29) | 16 (2.9×10−44) |
| Viral process (6.19×10−6) | Cytosol (3.89×10−17) | Cadherin binding involved in cell-cell adhesion (3.09×10−29) | ||
| Black (1407) | Cell-cell adhesion (3.59×10−5) | Cytosol (4.09×10−20) | Protein binding (1.69×10−14) | 20 (9.8×10−10) |
| Brown (3583) | Extracellular matrix organization (1.69×10−8) | Cytosol (3.19×10−19) | Protein binding (1.29×10−20) | 20 (1.1×10−3) |
| Angiogenesis (1.99×10−7) | Extracellular exosome (7.29×10−17) | Integrin binding (7.59×10−8) | ||
| White (575) | mRNA splicing, via spliceosome (4.19×10−6) | Nucleoplasm (2.89×10−20) | Protein binding (2.39×10−15) | |
| Termination of RNA polymerase | Nucleus (1.19×10−11) | Poly(A) RNA binding (6.29×10−11) | 16 (8.4×10−4) | |
| II transcription (3.39×10−4) | ||||
| Green (3324) | Membrane (4.59×10−10) | Protein binding (8.29×10−15) | 1 (3.3×10−5) | |
| Cytosol (2.89×10−8) | ATP binding (5.09×10−6) | |||
| Grey60 (124) | SRP-dependent cotranslational protein | Ribosome (2.29×10−51) | 3 (2.9×10−5) | |
| targeting to membrane (3.59×10−62) | Cytosolic large ribosomal | |||
| Translational initiation (1.5×10−60) | subunit (9.19×10−35) | |||
| Turquoise (8864) | Signal transduction (5.59×10−19) | Integral component of plasma | Protein binding (4.59×10−20) | 19 (1.5×10−5) |
| Immune response (7.29×10−16) | membrane (1.79×10−33) | Receptor activity (3.19×10−10) | ||
| Extracellular space (4.39×10−15) | ||||
GO, Gene Ontology; SRP, signal recognition particle.
Figure 3.Connectivity Map data were hierarchically clustered based on module eigengene values. Distinct clusters corresponding to three cell lines are displayed. Two clusters, C1 (red bar) from MCF7 and C2 (black bar) from PC3, were further analyzed with the Drug-Set Enrichment Analysis tool.
The top-ranking Gene Ontology Biological Processes for clusters C1 and C2 determined with the DSEA tool.
| Cluster | Rank | Pathway name | Escore | P-value |
|---|---|---|---|---|
| C1 | 1 | Programmed cell death | 0.39 | 3.42×10−6 |
| 2 | Positive regulation of transcription elongation from RNA polymerase II promoter | 0.39 | 4.77×10−6 | |
| 3 | Viral life cycle | −0.38 | 5.65×10−6 | |
| C2 | 1 | Proton transport | 0.33 | 6.56×10−5 |
| 2 | Pathogen-associated molecular pattern dependent induction by symbiont of host innate immune response | 0.32 | 1.21×10−4 | |
| 3 | Negative regulation of mesenchymal cell apoptotic process | 0.32 | 1.45×10−4 |
DSEA, Drug-Set Enrichment Analysis; EScore, enrichment score.
Figure 4.The rationale for disease-targeted drug repositioning was proposed. The first step is to construct gene co-expression networks for diseases of interest and to identify co-expressed modules. The second step is to project the Connectivity Map data onto these modules and calculate eigengene modules. Finally, these drugs are ranked and drug repositioning is inferred. WGCNA, Weighted Gene Co-expression Network Analysis; NASH, nonalcoholic steatohepatitis.