| Literature DB >> 22646858 |
Suleiman A Khan1, Ali Faisal, John Patrick Mpindi, Juuso A Parkkinen, Tuomo Kalliokoski, Antti Poso, Olli P Kallioniemi, Krister Wennerberg, Samuel Kaski.
Abstract
BACKGROUND: Detailed and systematic understanding of the biological effects of millions of available compounds on living cells is a significant challenge. As most compounds impact multiple targets and pathways, traditional methods for analyzing structure-function relationships are not comprehensive enough. Therefore more advanced integrative models are needed for predicting biological effects elicited by specific chemical features. As a step towards creating such computational links we developed a data-driven chemical systems biology approach to comprehensively study the relationship of 76 structural 3D-descriptors (VolSurf, chemical space) of 1159 drugs with the microarray gene expression responses (biological space) they elicited in three cancer cell lines. The analysis covering 11350 genes was based on data from the Connectivity Map. We decomposed the biological response profiles into components, each linked to a characteristic chemical descriptor profile.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22646858 PMCID: PMC3532323 DOI: 10.1186/1471-2105-13-112
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Data-driven search for statistical relationships between Chemical space (formed of VolSurf features) and Drug response space (gene expression).
Figure 2Quantitative validation of functional similarity of drug components. The figure shows the mean average precision for retrieving functionally similar chemicals as a function of the number of top chemicals considered. Results are shown for three representations: CCA (red), Chemical space (green), and Biological space (GSEA: blue, Gene expression: grey). Error bars show one standard error of the mean precision.
Figure 3Relationships decomposed into components. “Eye diagram” showing the top 10 significant CCA components ordered by correlation from top to bottom (middle), VolSurf descriptors (left), and top gene sets (right). The CCA components are shown as circles, with numbers indicating the decreasing order of canonical correlation and letters A and B indicating subcomponents (A: positive canonical score, B: negative canonical score). The widths of the curves from the components to VolSurf descriptors and gene sets indicate the strength of the corresponding associations. For VolSurf descriptors the subcomponent-specific activity is shown, whereas for the gene sets we show the overall activity in the component. For an example compound, VolSurf fields are illustrated in the top-left corner while three gene sets are listed along with their five most significant genes in the top right corner.
Summarized interpretation of top 10 components. Group A and B are the subcomponents of Figure 3
| 1 | Classic growth factor signaling: (MAP and protein kinase signaling) | Sulfonamides, antibiotics, carbonic anhydrase inhibitors | Antipsychotic and antihistaminic compounds | High lipophilicity |
| 2 | DNA damage | Contrast agents, antibiotics, | DNA damaging agents, antimetabolites | Strong lipophilic areas emphasized |
| 3 | Stress response, mitochondrial and anabolic metabolism | DNA damaging agents | GPCR antagonists, ion channel blockers | Polar interactions enriched |
| 4 | Cytoskeleton, cell adhesion and migration | GPCR liganda, macrocyclic cmpds and contrast agents | Beta adrenergic agonists, other GPCR ligands | N/A |
| 5 | Differentiation, EMT, stemness | NSAIDS, cAMP signaling promoting compounds | HDAC Inhibitors, HDAC-like | Significantly enriched with pharmacophoric features* |
| 6 | Inflammatory and differentiation signaling | N/A | Protein synthesis inhibitors, anti-diabetics, cardiac glycosides | Pharmacophoric features* |
| 7 | GPCR and cytokine signaling | N/A | Cardiac glycosides, cephalosporins | Pharmacophoric features* |
| 8 | Growth factor and cell adhesion signaling | Cardiac glycosides | β-adrenergic agonists, Ca2+ channel blockers | Integy-moment and significant pharmacophoric enriched* |
| 9 | Amino acid and nitrogen metabolism | Protein synthesis inhibitors | Anti-diabetics | Integy-moment and significant pharmacophoric enriched* |
| 10 | Cancer signaling | DNA damaging agents | Corticosteroids, ionophores | Size shape type descriptors |
The pharmacophoric enrichment analysis (marked with “*”) was carried out over VolSurf features (Additional file 5: VolSurf_Classification.xls) considered as a gold standard, and measuring enrichment of the list in a component by a hypergeometric test.
Figure 4Compounds high in hydrogen Bonding. Azacitidine (left) and Idarubicin (right) showing H-bonding areas with blue (hydrogen-bond donor) and red (hydrogen-bond acceptor).
Figure 53A drug transitions. NeRV visualization showing Drug Treatment Transitions. Lines indicate the transition from Pretreated MCF7 to treated MCF7 cells.
Figure 6Finding interesting components. Heatmap across the 10 highest scoring significant CCA components: X-axis lists the top 30 significant genes in each subcomponent, while y-axis represents the top 20 scoring compounds in each. Two unique components 2B and 10A are zoomed in to show the detailed expression pattern along with 3D VolSurf descriptors (green areas are the lipophilic fields and the purple water fields). Only a subset (5 compounds and 10 genes) is shown in the zoomed version due to space constraints.
Growth Inhibition verification of 2B/10A Compounds
| 2B | MCF7 | |||
| 2B, 10A | MCF7 | |||
| 2B | MCF7 | |||
| 2B | PC3 | |||
| 10A | HL60 | |||
| 10A | HL60 | |||
| 10A | PC3 | |||
| 10A | MCF7 | |||
| 10A | MCF7 | |||
| 10A | MCF7 | |||
| 10A | MCF7 | |||
| esculetin | 25.1 | >100** | 10A | HL60 |
| fulvestrant | 1.0 | >100** | 10A | PC3 |
GI50 values (drug concentration causing a 50% growth inhibition) from NCI/DTP are shown along with the corresponding concentrations used in the Connectivity Map (CMap) data. By comparing the GI50 and CMap values we can get an idea of expected cell killing effect of the drug in the CMap data. Drugs that are expected to eventually kill the cells are shown in bold. GI50 and CMap concentration values are in μM scales.
* GI50 value at the end of the tested range.
** Mean of GI50 values from HL60 and MCF7 cell lines.
*** Value from HL60 cell line.
Figure 7Heatmap for subcomponent 7B. Y-axis lists the top 10 active compounds in the component, replicated over the three cell lines, while the X-axis lists the most significantly active genes in the component. The genes are clearly activated systematically and exclusively in the HL60 cell line, hence indicating an HL60 specific response.