| Literature DB >> 28245581 |
Yang Liu1, Xiaoyao Yin2, Jing Zhong3, Naiyang Guan4, Zhigang Luo5, Lishan Min6, Xing Yao7, Xiaochen Bo8, Licheng Dai9, Hui Bai10,11.
Abstract
With accumulating public omics data, great efforts have been made to characterize the genetic heterogeneity of breast cancer. However, identifying novel targets and selecting the best from the sizeable lists of candidate targets is still a key challenge for targeted therapy, largely owing to the lack of economical, efficient and systematic discovery and assessment to prioritize potential therapeutic targets. Here, we describe an approach that combines the computational evaluation and objective, multifaceted assessment to systematically identify and prioritize targets for biological validation and therapeutic exploration. We first establish the reference gene expression profiles from breast cancer cell line MCF7 upon genome-wide RNA interference (RNAi) of a total of 3689 genes, and the breast cancer query signatures using RNA-seq data generated from tissue samples of clinical breast cancer patients in the Cancer Genome Atlas (TCGA). Based on gene set enrichment analysis, we identified a set of 510 genes that when knocked down could significantly reverse the transcriptome of breast cancer state. We then perform multifaceted assessment to analyze the gene set to prioritize potential targets for gene therapy. We also propose drug repurposing opportunities and identify potentially druggable proteins that have been poorly explored with regard to the discovery of small-molecule modulators. Finally, we obtained a small list of candidate therapeutic targets for four major breast cancer subtypes, i.e., luminal A, luminal B, HER2+ and triple negative breast cancer. This RNAi transcriptome-based approach can be a helpful paradigm for relevant researches to identify and prioritize candidate targets for experimental validation.Entities:
Keywords: Cancer Gene Census; DNA methylation; breast cancer; drug target; gene set enrichment analysis; library of integrated network-based cellular signatures
Year: 2017 PMID: 28245581 PMCID: PMC5368690 DOI: 10.3390/genes8030086
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1Workflow of candidate therapeutic target identification and multifaceted assessment. Luminal A phenotype specific signatures were calculated using TCGA gene expression data of luminal A breast cancer and corresponding normal samples. The signatures were then queried against Library of Integrated Network-Based Cellular Signatures (LINCS) MCF7 RNAi gene expression profile of 3689 genes using gene set enrichment analysis. Genes negatively connected to the phenotype were considered as candidate targets for luminal A. Then the targets were inferred to other three subtypes (i.e., luminal B, HER2+, TNBC) based on gene expression pattern analysis and further validated with transcriptome analysis, methylation analysis, cancer gene analysis and drug target analysis. Detailed information is provided in Materials and Methods.
Figure 2(A) Hierarchical clustering dendrogram of 919 breast samples (434 luminal A, 194 luminal B, 67 HER2+, 105 TNBC and 119 normal tissue) using 1000 most-variable genes as determined by variation; (B) Overlapping number of top ranking differentially expressed genes between any two cancer subtypes; (C) Venn diagram of differentially expressed genes for four subtypes.
Figure 3(A) Functional classes of protein products from the 510 candidate genes. Enzymes (predominantly kinases) and transcription factors constitute over half the candidate genes. A total of 14% of proteins, here labelled “Other”, fall into classes including transporters, cytokines, splicing factors, kinase activator, etc.; (B) Absolute connectivity score of candidate genes in each functional group. Genes with absolute connectivity score above 0.85 are labeled with gene symbols.
Final candidate gene targets for four breast cancer subtypes.
| Gene Symbol | Gene Name | Luminal A | Luminal B | HER2+ | TNBC | Protein Name | No. of Targeted Drugs | ||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| CS | logFC | CS | logFC | CS | logFC | CS | logFC | ||||
| MUC1 | mucin 1, cell surface associated | −0.75 | 3.11 | −0.75 | 1.97 | −0.75 | 2.33 | −0.75 | 0.43 | Mucin-1 | 0 |
| HLA-DRA | major histocompatibility complex, class II, DR alpha | −0.73 | 0.31 | - | - | −0.73 | 0.8 | - | - | HLA class II histocompatibility antigen, DR alpha chain | 0 |
| WNT7B | Wnt family member 7B | −0.73 | 2.14 | −0.73 | 1.96 | - | - | - | - | Protein Wnt-7b | 0 |
| XBP1 | X-box binding protein 1 | −0.72 | 1.58 | −0.72 | 1.68 | −0.72 | 0.94 | - | - | X-box-binding protein 1 | 0 |
| EFCAB2 | EF-hand calcium binding domain 2 | −0.72 | 0.08 | - | - | - | - | - | - | EF-hand calcium-binding domain-containing protein 2 | 0 |
| ATG16L2 | autophagy related 16 like 2 | - | - | −0.77 | 0.19 | - | - | - | - | Autophagy-related protein 16-2 | 0 |
| C1QTNF6 | C1q and tumor necrosis factor related protein 6 | - | - | - | - | −0.77 | 2.23 | - | - | Complement C1q tumor necrosis factor-related protein 6 | 0 |
| NDUFS6 | NADH: ubiquinone oxidoreductase subunit S6 | - | - | - | - | −0.77 | 1.34 | - | - | NADH dehydrogenase [ubiquinone] iron-sulfur protein 6, mitochondrial | 1 |
| CHERP | calcium homeostasis endoplasmic reticulum protein | - | - | - | - | - | - | −0.77 | 0.46 | Calcium homeostasis endoplasmic reticulum protein | 0 |
| TIAM1 | T-cell lymphoma invasion and metastasis 1 | - | - | - | - | - | - | −0.71 | 0.68 | T-lymphoma invasion and metastasis-inducing protein 1 | 0 |
| GABRP | gamma-aminobutyric acid type A receptor pi subunit | - | - | - | - | - | - | −0.7 | 1.5 | Gamma-aminobutyric acid receptor subunit pi | 41 |
Note: CS means connectivity score; FC means fold change; “-” means the corresponding values do not reach the thresholds that define statistical significance.
Figure 4Statistical summarization of final candidate targets for four breast cancer subtypes from multifaceted assessment.