| Literature DB >> 29113413 |
Jonathan Castillo1,2,3, Theresa R Stueve2, Crystal N Marconett1,2,3.
Abstract
Previously thought of as junk transcripts and pseudogene remnants, long non-coding RNAs (lncRNAs) have come into their own over the last decade as an essential component of cellular activity, regulating a plethora of functions within multicellular organisms. lncRNAs are now known to participate in development, cellular homeostasis, immunological processes, and the development of disease. With the advent of next generation sequencing technology, hundreds of thousands of lncRNAs have been identified. However, movement beyond mere discovery to the understanding of molecular processes has been stymied by the complicated genomic structure, tissue-restricted expression, and diverse regulatory roles lncRNAs play. In this review, we will focus on lncRNAs involved in lung cancer, the most common cause of cancer-related death in the United States and worldwide. We will summarize their various methods of discovery, provide consensus rankings of deregulated lncRNAs in lung cancer, and describe in detail the limited functional analysis that has been undertaken so far.Entities:
Keywords: LncRNA biology; RNA biology; cancer; lung adenocarcinoma; transcriptomic analysis
Year: 2017 PMID: 29113413 PMCID: PMC5655308 DOI: 10.18632/oncotarget.18432
Source DB: PubMed Journal: Oncotarget ISSN: 1949-2553
Figure 1Exponential discovery of lncRNAs with the adoption of transcriptome-wide gene expression technologies
Graph indicates the total number of publications per year for select lncRNAs with known involvement in LUAD. With the advent of transcriptomic profiling, the pace of lncRNA discovery and papers characterizing their function has increased exponentially over the last decade. lncRNAs included are those described in Table 1.
Known lncRNAs involved in LUAD
| GENE | Locus | Nearby factor implicated in cancer | Mechanism(s) of action | Additional cancer association(s) |
|---|---|---|---|---|
| 9p21 | CDKN2B (INK4-ARF) tumor suppressor | Oncogene. Antagonizes the CDKN2A and CDKN2B tumor suppressors via recruitment of PRC2 and PRC1 [ | basal cell, breast, cervical, esophageal, gallbladder, gastric, liver, melanoma, ovarian. | |
| 11p15 | IGF2 growth factor | Oncogene. Targets multiple tumor suppressive miRNAs [ | adrenal, bladder, cervical, colorectal, gallbladder, gastric, esophageal, laryngeal, nasopharyngeal, ovarian, pancreatic, thyroid. | |
| 12q13 | HOXC transcription factors | Oncogene. Long-range epigenetic action via recruitment of PRC2 and LSD1 [ | bladder, colorectal, ER (+) breast, liver, nasopharyngeal, oral, ovarian, pancreatic, pituitary, small cell lung. | |
| 7p15 | HOXA transcription factors | Oncogene. Regulates chromatin structure at the HOXA transcription factor locus [ | colorectal, pancreatic, osteosarcoma, tongue. | |
| 11q13 | Oncogene. Suppresses E-cadherin via suz12 recruitment, leading to metastasis [ | glioma, multiple myeloma, pituitary, renal clear cell, tongue. | ||
| 14q32 | DLK1 growth factor receptor | Tumor Suppressor. Long-range epigenetic action, leading to suppression of TFG β [ | AML, cervical, colorectal, gastric, meningioma, ovarian, pancreatic, pituitary, prostate, thyroid. | |
| 11q13 | Oncogene. Promotes survival when DNA damage present via paraspeckle formation [ | colorectal, esophageal, gastric, glioma, leukemia, ovarian, prostate. |
The RefSeq gene name annotation, alongside the hg19 chromosomal location are listed. In addition, the established mechanism of action is listed, as well as other cancers where the lncRNA has demonstrated effects on tumor initiation, promotion, progression, and/or patient survival outcomes.
Figure 2Molecular origins of LUAD
Lung adenocarcinoma (LUAD) arises in the distal alveolar epithelium from progenitor alveolar epithelial cells. LUAD develops from these precursor cells though oncogenic activation (and deactivation of tumor suppressors) by induced mutations to the DNA, amplification and fusion events, as well as epigenomic alterations. Genes listed were taken from TCGA analysis of LUAD (15). Added to this is the newly-emergent appreciation for altered lncRNA regulation of cellular processes as an oncogenic event.
Figure 3Overlap of deregulated lncRNAs in LUAD between multiple large-scale bioinformatic studies
Results from microarray analysis, de novo RNA-seq transcriptome assembly of TCGA LUAD datasets from the Maher study on LCALs (supplementary data file 5 from their study which includes LUAD-specific lncRNAs), and robust statistical analysis of multiple lung cancer datasets were overlapped in the hg19 UCSC genome browser to determine a unifying set of lncRNAs deregulated in all three studies. These forty deregulated lncRNAs fall broadly into two categories, 15 were antisense transcripts and the remaining 25 were intergenic genes.
Differentially expressed intergenic lncRNAs
| Level of Validation | Genecode ID | Alternate ID | Position (GRCh37/hg19) |
|---|---|---|---|
| Known Genes | NR_034007 | chr3: 156799455-156840793 | |
| TCONS_00008359 | chr4: 185748077-185776905 | ||
| TCONS_00010402 | chr5: 90597873-90621000 | ||
| ENST00000504719 | chr8: 128806778-129113503 | ||
| TCONS_00018434 | chr10: 10825866-10837007 | ||
| ENST00000439790 | chr13: 107028910-107030941 | ||
| TCONS_00023799 | chr15: 95752084-96051089 | ||
| TCONS_00016990 | |||
| Validated Genes | TCONS_l2_00015916 | chr2: 112186885-112268567 | |
| ENST00000523242 | chr5: 173069521-173085197 | ||
| TCONS_l2_00024132 | chr6: 28953517-28959134 | ||
| ENST00000522511 | chr8: 57401656-57472382 | ||
| ENST00000566787 | chr16: 10608698-10622059 | ||
| ENST00000579631 | chr17: 70319263-70599647 | ||
| TCONS_l2_00011006 | chr17: 77796264-77801616 | ||
| ENST00000586345 | chr19: 36795481-36822667 | ||
| ENST00000428669 | chr21: 17553910-18013444 | ||
| Predicted Genes | TCONS_00000738 | chr1: 222054322-222158306 | |
| ENST00000331944 | chr2: 87754947-87821037 | ||
| ENST00000510621 | chr5: 66995256-67198428 | ||
| ENST00000548359 | chr12: 79933982-79944315 | ||
| TCONS_00023630 | chr15: 38360990-38365188 | ||
| Predicted Genes | TCONS_00007953 | chr4: 1546984-1555291 | |
| TCONS_00025436 | chr17: 53690342-53725799 | ||
| TCONS_00029745 | chr22: 50981205-50983413 |
LncRNAs that were deregulated in three studies (from Figure 3) that also occupy chromosomal regions in between mRNAs. LncRNAs were segregated by their Gencode transcript confidence level (tsl1 = highest quality, full transcript is validated; tsl2-3, one or many spliced ESTs are validated; tsl4-5 = one or none ESTs support the validity of the transcript, and those ESTs are suspect.) Those without a tsl ranking do not have a representative transcript in Gencode. All coordinates span the entire transcript length and are hg19 genome-based. The MSN transcript overlaps in the same orientation the MSN mRNA, therefore there is a high probability that this is not truly a lncRNA (and thus is greyed out in figure).
Differentially expressed anti-sense lncRNAs
| Alternate IDs | Anti-Sense Gene | Multi-exonic | Position (GRCh37/hg19) | |
|---|---|---|---|---|
| RP4-575N6.1 | YES | chr1: 101701238-101702084 | ||
| DUBR | YES | chr3: 106959538-107045811 | ||
| LINC00312 | NO | chr3: 8613467-8634810 | ||
| RP11-325F22.5 | YES | chr7: 104558006-104567077 | ||
| FEZF1-AS1 | FEZF1-AS1 | YES | chr7: 121945003-121945871 | |
| RP11-251M1.1 | YES | chr9: 139541826-139554873 | ||
| RP11-783K16.5 | YES | chr11: 64014525-64015649 | ||
| RP11-677M14.3 | YES | chr11: 124632326-124635257 | ||
| SBK1-AS1 | RP11-57A19.2 | YES | chr16: 28270020-28303385 | |
| FENDRR | FENDRR | YES | chr16: 86508050-86542705 | |
| RP11-332H18.5 | YES | chr17: 59470732-59477096 | ||
| RP11-720L2.4 | YES | chr18: 314886-319165 | ||
| GATA6-AS1 | GATA6-AS1 | YES | chr18: 19746858-19748929 | |
| RP11-290F20.2 | YES | chr20: 48909256-48937879 | ||
| LINC00649 | YES | chr21: 35295736-35351160 |
The Gencode annotation for each lncRNA is indicated, along with the genomic coordinates of the lncRNA and the mRNA that is transcribed in the antisense orientation. All but LINC00312 are multi-exonic, indicative of splicing. The hg19-based lncRNA coordinates are listed.
miTranscriptome-defined LUAD lineage-specific lncRNAs
| miTranscriptome | Alternate IDs | Position (GRCh37/hg19) | Genomic Context | Patient Outcome |
|---|---|---|---|---|
| chr8:128996355-129130070 (−) | ||||
| chr1:97720954-97751573 (+) | ||||
| chr16:47936832-47961855 (+) | ||||
| chr12:131471968-131478539 (−) |
miTranscriptome-calcuated lncRNAs with lineage specificity to LUAD.