| Literature DB >> 29303984 |
Dan Li1, William Yang2, Jialing Zhang3, Jack Y Yang4, Renchu Guan5, Mary Qu Yang6.
Abstract
Lung cancer is the second most commonly diagnosed carcinoma and is the leading cause of cancer death. Although significant progress has been made towards its understanding and treatment, unraveling the complexities of lung cancer is still hampered by a lack of comprehensive knowledge on the mechanisms underlying the disease. High-throughput and multidimensional genomic data have shed new light on cancer biology. In this study, we developed a network-based approach integrating somatic mutations, the transcriptome, DNA methylation, and protein-DNA interactions to reveal the key regulators in lung adenocarcinoma (LUAD). By combining Bayesian network analysis with tissue-specific transcription factor (TF) and targeted gene interactions, we inferred 15 disease-related core regulatory networks in co-expression gene modules associated with LUAD. Through target gene set enrichment analysis, we identified a set of key TFs, including known cancer genes that potentially regulate the disease networks. These TFs were significantly enriched in multiple cancer-related pathways. Specifically, our results suggest that hepatitis viruses may contribute to lung carcinogenesis, highlighting the need for further investigations into the roles that viruses play in treating lung cancer. Additionally, 13 putative regulatory long non-coding RNAs (lncRNAs), including three that are known to be associated with lung cancer, and nine novel lncRNAs were revealed by our study. These lncRNAs and their target genes exhibited high interaction potentials and demonstrated significant expression correlations between normal lung and LUAD tissues. We further extended our study to include 16 solid-tissue tumor types and determined that the majority of these lncRNAs have putative regulatory roles in multiple cancers, with a few showing lung-cancer specific regulations. Our study provides a comprehensive investigation of transcription factor and lncRNA regulation in the context of LUAD regulatory networks and yields new insights into the regulatory mechanisms underlying LUAD. The novel key regulatory elements discovered by our research offer new targets for rational drug design and accompanying therapeutic strategies.Entities:
Keywords: cancer; gene regulation; long non-coding RNA; network; systems biology; transcription factor
Year: 2018 PMID: 29303984 PMCID: PMC5793165 DOI: 10.3390/genes9010012
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1Gene regulatory network identification procedure. The procedure includes co-expression identification, edge determination by transcription factor (TF) targets, Bayesian analysis, and long non-coding RNAs (lncRNA)–protein binding potential. The DNA methylation analysis was used to infer potential epigenetic regulations. DEGs: Differential Expressed Genes.
Figure 2Key transcription factors in the network. (a) A total of 95 TFs demonstrated significant expression level alterations in lung adenocarcinomas and had at least one target gene in the same network. (b) The pathways that were significantly enriched for the 46 key TFs. (c) Nine common TFs were revealed by overlapping 46 key TFs with 61 TFs carrying at least one somatic mutation. (d) The pathways that were significantly enriched for 61 TFs harboring significant somatic mutation(s) in lung adenocarcinoma (LUAD). HTLV-1: human T cell lymphotropic virus type 1.
Figure 3Key transcription factors outside the network. (a) The target genes of nine transcription factors that carried driver mutations were abundant in the regulatory networks. The color of the edge represents the p-value of the target gene enrichment analysis. The red-colored networks were over-expressed, whereas the green-colored networks were under-expressed in the disease. The yellow-colored networks were mixed with over-expressed and under-expressed genes. (b) The hierarchy clusters for the nine TFs and their regulated networks.
Figure 4Key regulatory lncRNAs. (a) Network 10 consists of several lung cancer-related lncRNAs, such as MALAT1 and NEA1. The blue-colored nodes represent lncRNA transcripts. (b) The expression alterations (log2(FC)) of 13 key lncRNAs in the 16 solid tissue cancer types. Red denotes over-expression while green represents under-expression of the lncRNAs in cancers. Grey refers to an insignificant gene expression change.
Key regulatory long non-coding RNAs (lncRNAs), their corresponding downstream target protein-coding genes and lncRNAs, and the cancer types in which the key lncRNAs are differentially expressed.
| Key lncRNA | Target Protein-Coding Gene(s) 1 | Target lncRNA(s) 1 | Differentially Expressed in Cancer Types 2 |
|---|---|---|---|
| LINC00261 | NA | BICA, Bladder, Esophagus, GM, HN, KRCC, Liver, LSCC, LUAD, Prostate, Thyroid, UCEC | |
| MALAT1 | NEAT1 (0.77; 0.84) | BICA, COAD, KRCC, LUAD, Prostate, READ, UCEC | |
| LINC01614 | NA | BICA, Bladder, COAD, Esophagus, GM, HN, KRCC, LSCC, LUAD, READ, STAD, Thyroid | |
| AC007405.6 | NA | COAD, Esophagus, KRCC, LSCC, LUAD, READ | |
| AC109642.1 | NA | Bladder, GM, KRCC, KRPC, LSCC, LUAD, UCEC | |
| RP11-672A2.4 | NA | BICA, Bladder, KRPC, LSCC, LUAD, UCEC | |
| LINC01355 | RP11-159D12.2 (0.68; 0.78), AC008746.12 (0.77; 0.85), RP11-332H14.2 (0.64; 0.65) | Bladder, COAD, HN, KRCC, KRPC, Liver, LSCC, LUAD, Prostate, READ, STAD | |
| CTD-2547G23.4 | NEAT1 (0.47; 0.70), LL0XNC01-7P3.1 (0.81; 0.82), LINC01355 (0.77; 0.85), RP4-563E14.1 (0.70; 0.82), RP11-112J3.16 (0.73; 0.72) | Bladder, COAD, HN, KRCC, KRPC, Liver, LSCC, LUAD, Prostate, READ | |
| CTD-2349P21.9 | NA | COAD, KRPC, Liver, LUAD | |
| RP11-468E2.10 | NA | Liver, LUAD | |
| LINC00926 | NA | Bladder, KRCC, LUAD, Thyroid | |
| RP11-290F5.1 | NA | COAD, KRCC, KRPC, Liver, LSCC, LUAD, READ, STAD, UCEC | |
| RP11-291B21.2 | AC002331.1 (0.74; 0.71) | BICA, Bladder, COAD, HN, KRCC, KRPC, LUAD, UCEC |
1 The first number in parentheses represents the Pearson correlation coefficient of the key regulator and the corresponding target gene in the normal lung and lung adenocarcinoma (LUAD) tissues, while the second number represents their expression correlation in the 53 human tissues. 2 BICA: Breast Invasive Carcinoma; COAD: Colon Adenocarcinoma; HN: Head and Neck; GM: Glioblastoma Multiforme; KRCC: Kidney Renal Clear Cell Carcinoma; KRPC: Kidney Renal Papillary Cell Carcinoma; LSCC: Lung Squamous Cell Carcinoma; READ: Rectum Adenocarcinoma; STAD: Stomach Adenocarcinoma; UCEC: Uterine Corpus Endometrial Carcinoma.