| Literature DB >> 30287838 |
Subbroto Kumar Saha1, Yeojin Jeong1, Sungha Cho1, Ssang-Goo Cho2.
Abstract
OCT4 is a master transcription factor that regulates the pluripotency of pluripotent stem cells and cancer stem cells along with other factors, including SOX2, KLF4, and C-MYC. Three different transcripts, OCT4A, OCT4B, and OCT4B1, are known to be generated by alternative splicing and eight OCT4 pseudogenes have been found in the human genome. Among them, we examined OCT4 and three pseudogenes (POU5F1P1, POU5F1P3, and POU5F1P4) because of their high expression possibility in cancer. In addition, previous studies indicated that OCT4 expression is augmented in cervical cancer and associated with poor prognosis, whereas OCT4 is down-regulated and correlated with good clinical outcomes in breast cancer. Because of these conflicting reports, we systematically evaluated whether expression of OCT4 and its pseudogenes can serve as oncogenic markers in various human cancers using the Oncomine database. Moreover, copy number alterations and mutations in OCT4 gene and its pseudogenes were analyzed using cBioPortal and the relationship between expression of OCT4 and pseudogenes and survival probability of cancer patients were explored using Kaplan-Meier plotter, OncoLnc, PROGgeneV2, and PrognoScan databases. Multivariate survival analysis was further conducted to determine the risk of the expression of the occurrence of OCT4 and its pseudogenes on certain cancer types using data from the Kaplan-Meier plotter. Overall, an association between expression of OCT4 and pseudogenes and cancer prognosis were established, which may serve as a therapeutic target for various human cancers.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30287838 PMCID: PMC6172215 DOI: 10.1038/s41598-018-33094-7
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1mRNA expression pattern of OCT4 and its pseudogenes in various cancer types: (a) The comparison indicated the number of datasets with OCT4 and its pseudogene mRNA over-expression (right column, red) and under-expression (left column, blue) in cancer versus normal tissues. The threshold was designed with the following parameters: p-value of 1E-4, fold-change of 2, and gene ranking of 10%. (b) Schematic view of OCT4 and its pseudogene transcription. POU5F1 and its three pseudogenes (POU5F1P1, POU5F1P3, and POU5F1P4) showed high homology in their mRNA sequences, but each gene was in a different chromosome.
Figure 2OCT4 (POU5F1) expression pattern and patient survival analysis in different cancer types, compared to OCT4 expression in normal tissue and each cancer tissue. (a) The fold-change of OCT4 in various types of cancers was identified by our analyses as shown in Supplementary Table S1 and expressed as a forest plot. (b–d) The box plot comparing specific OCT4 expression in normal (left plot) and cancer tissue (right plot) was derived from the Oncomine database. The analysis was shown in breast carcinoma relative to in normal breast (b), in ovarian adenocarcinoma relative to in normal ovarian tissue (c), in renal carcinoma relative to normal renal (d). (e) Significant hazard ratios in various types of cancers were identified from our analyses shown in Supplementary Table S2 and expressed as a forest plot. (f–h) The survival curve comparing patients with high (red) and low (black, blue, and green) expression in breast (f), ovarian (g), kidney (h) cancer was plotted from the Kaplan Meier-plotter, PROGgeneV2, and OncoLnc database. Survival curve analysis was conducted using a threshold Cox p-value < 0.05.
Figure 3Mutation and alteration frequency patterns of OCT4 (POU5F1) and its associated genes in various cancers: (a) Functional protein partner of OCT4 was predicted by STRING. Line indicates the predicted mode of molecular action. (b) Mutation diagram of POU5F1 in different cancer types across protein domains. POU5F1 mutation frequencies are the highest in melanoma and POU5F1 mutation mere more frequent in N-domain than in the C-domain. (c) The alteration frequency of a five-gene signature (POU5F1, SOX2, NANOG, LIN2BA, and KLF4) was determined using cBioPortal and is shown on the top. The alteration frequency of a seven-gene signature (POU5F1, SOX2, NANOG, LIN2BA, KLF4, SALL4, and FGF2) was determined using cBioPortal and is shown on the bottom. Only cancer types containing >100 samples and an alteration frequency of >20% are shown. The alteration frequency included deletions (blue), amplification (red), multiple alterations (grey), or mutation (green). The total number of samples for each cancer type is indicated by the numbers at the top of each column. Prostate cancer types frequently amplify POU5F1. We used the Oncoprint feature of cBioPortal to determine the copy number alteration frequency of each gene in POU5F1 within selected cancer subtypes. (d) The percentages of alterations in five genes and seven genes in the prostate cancer. Grey bars along a vertical line represent the same sample evaluated for amplification (red), deep deletion (blue), missense mutation (green), truncating mutation (black), or in-frame mutation (brown). (e) The interactions between POU5F1 and its associated gene alterations were searched in cBio Cancer Genomics Portal. Network view of the POU5F1 neighborhood in prostate cancer. Darker red indicates increased frequency of alteration (defined by mutation, copy number amplification, or homozygous deletion) in prostate cancer.
Figure 4POU5F1P1 expression and mutation pattern compared to POU5F1P1 expression in normal tissue and each cancer tissue: (a) The fold-change of POU5F1P1 in various types of cancers was identified from our analyses shown in Supplementary Table S7 and expressed as the forest plot. (b–e) The box plot comparing specific POU5F1P1 expression in normal (left plot) and cancer tissue (right plot) was derived from the Oncomine database. The analysis was shown in seminoma relative to normal testicle (b), in renal carcinoma relative to normal renal (c), in melanoma relative to normal skin (d), and in breast relative to normal breast (e). (f) Significant hazard ratios in various types of cancers was identified from our analyses shown in Table S8 and expressed as a forest plot. (g-h) Survival curve comparing patients with high (red) and low (black, blue) expression in melanoma (g), breast (h) was plotted from OncoLnc and Kaplan Meier-plotter database. Survival curve was analyzed using a threshold Cox p-value < 0.05.
Figure 5Mutation and alteration frequency patterns of POU5F1P1 and its associated genes in various cancers. (a) Functional protein partner of POU5F1P1 was predicted by STRING web. Line indicates the type of interaction evidence. (b) Mutation diagram of POU5F1P1 in different cancer types across protein domains was expressed. POU5F1P1 mutation frequencies are the highest in lung and one hot spots (P255S) representing the common founder mutations in POU5F1P1 homeobox site. The alteration frequency of a three-gene signature (POU5F1P1, PRDM14, and FAM84B) was determined using cBioPortal and is shown on the top. (c) The alteration frequency of a five-gene signature (POU5F1P1, PRDM14, FAM84B, TCFL2, and HLA-C) was determined using cBioPortal and is shown on the bottom. Only cancer types containing >100 samples and an alteration frequency of >20% are shown. The alteration frequency included deletions (blue), amplification (red), multiple alterations (grey), or mutation (green). The total number of samples for each cancer type is indicated by the numbers at the top of each column. Prostate cancer types frequently amplify POU5F1P1. We used the Oncoprint feature of cBioPortal to determine the copy number alteration frequency of each gene in POU5F1 within selected cancer subtypes. (d) The percentages of alterations in three genes and five genes in the prostate cancer. Grey bars along a vertical line represent the same sample evaluated for amplification (red), deep deletion (blue), missense mutation (green), truncating mutation (black), or in-frame mutation (brown).
Figure 6POU5F1P3 expression pattern and patient survival analysis in different cancer types compared to POU5F1P3 expression in normal tissue and each cancer tissue. (a) The fold-change of POU5F1P3 in various types of cancers was identified from our analyses shown in Supplementary Table S13 and expressed as the forest plot. (b–e) The box plot comparing specific POU5F1P3 expression in normal (left plot) and cancer tissue (right plot) was derived from the Oncomine database. The analysis was shown in colorectal carcinoma relative to normal colorectal (b), in renal carcinoma relative to normal renal (c), in seminoma relative to normal testicle (d), in breast carcinoma relative to normal breast (e). (f) Significant hazard ratios in various types of cancers was identified from our analyses shown in Table S14 and expressed as a forest plot. (g–i) The survival curve comparing patients with high (red) and low (black, blue) expression in breast (g), blood (h), and kidney (i) cancers was plotted from the Kaplan Meier-plotter, PrognoScan, and GAPIA database. The survival curve was analyzed using a threshold Cox p-value < 0.05.
Figure 7POU5F1P4 expression pattern and patient survival analysis in different cancer types compared to POU5F1P4 expression in normal tissue and each cancer tissue: (a) The fold-change of POU5F1P4 in various types of cancers was identified from our analyses shown in Supplementary Table S15 and expressed as the forest plot. (b–d) The box plot comparing specific POU5F1P4 expression in normal (left plot) and cancer tissue (right plot) was derived from the Oncomine database. The analysis was shown in breast carcinoma relative to normal breast (b), in rectal adenocarcinoma relative to normal rectal tissue (c), in gastric adenocarcinoma relative to normal gastric tissue (d). (e) Significant hazard ratios in various types of cancers were identified from our analyses shown in Supplementary Table S16 and expressed as the forest plot. (f–h) The survival curve comparing patients with high (red) and low (black, blue) expression in breast (f), colorectal (g), and gastric (h) tissue was plotted from the PrognoScan database, R2: Genomics analysis and visualization platform, and Kaplan-Meier plotter. The survival curve was analyzed with a threshold Cox p-value < 0.05. (i) The summary of predictive role of OCT4 (POU5F1) and its three pseudogenes in different cancers is based on the consistent results of gene expression and outcome.
Figure 8Expression co-occurrence of OCT4 and its pseudogenes in relation to the clinical prognosis of cancer patients. The multivariate survival curves compare the clinical prognosis in patients with high/high (red), high/low (green), low/high (blue), and low/low (orange) expression co-occurrence of POU5F1/POU5F1P1, POU5F1/POU5F1P3, and POU5F1/POU5F1P4 in breast (a), ovarian (b), lung (c), and gastric (d) cancers. The clinical outcome data were retrieved from the Kaplan-Meier plotter database. Information indicating statistical significance represents p < 0.05, and a non-significant p-value is expressed as ‘ns’ in the graph.
Main characteristic of the selected oncogenomic portals.
| Database | Data source | Sites of analyzed cancer* | Oncogenomic data | link |
|---|---|---|---|---|
| Oncomine | TCGA, Cancer data from literature | Bd; Br; Bra; Cer; Clr; Eso; HN; Kd; Lng; Lvr; Lymph; Ov; Pnc; also: cancer cell lines | Drug sensitivity, cancer histology, clinical outcome, tissue, pathology, subtype, molecular subtype, patient treatment response | https://www.oncomine.org[ |
| PrognoScan | Cancer data from literature | Bd; Bld; Br; Bra; Clr; EA; Eso; HN; Kd; Lng; Lymph; Ov; Prst; Sk; ST; | Survival analyses | http://www.abren.net/PrognoScan[ |
| STRING | Protein, gene from literature | Gene, gene from literature | Structure | http://stringdb.org[ |
| cBioPortal | AMC, BCCRC, BGI, British Columbia, Broad, Broad/Cornell, CCLE, CLCGP, Genentech, ICGC, JHU, Michigan, MKSCC, MKSCC/Broad, NCCS, NUS, PCGP, Pfizer UHK, Riken, Sanger, Singapore, TCGA, TSP, UTokyo, Yale | ACC; Bd; Bld; Br; Bra; Chl; Clr; Eso; HN; Kd; Lng; Lvr; Lymph; MM` Npx; Ov; Pnc; Prst; Sk; ST; Stc; Thr; Utr; also: cancer cell lines | Mutations, putative copy number alterations; mRNA expression, protein/phosphoprotein level; survival analyses | http://www.cbioportal.org/[ |
| DAVID functional annotation | — | Signal pathway | GO terms, annotation terms, BioCarta & KEGG pathway, interacting proteins, gene-disease associations, protein functional domains and motifs | https://david.ncifcrf.gov/home.jsp[ |
| OncoLnc | TCGA | Bld: Br: Cer: Col: Eso: Gil: Head; Kd; Leuk; Bra; Lvr; Lng; Ov; Panc; Reect; Src; Stm; MM; | Survival analyses | http://www.oncoLnc.org/[ |
| PROGgeneV2 | TCGA, Cancer data from literature | Br; Kd; Bld; Bon; Bra; Col; Heme; Hnc; Liv; Lng; Ov; Panc; Prs; Rect; Skn; Stm; Uter; Cerv; Eso; Eye; Gst; Mstl; Nure; Src; Tym; Tyrd; also: cancer cell lines | Survival analyses | http://watson.compbio.iupui.edu/chirayu/proggene/database/index.php[ |
| Kaplan-Meier plotter | GEO (Affymetrix microarrays only), EGA and TCGA | Br; Gst; Ov; Lng; Liv; | Survival analyses | http://kmplot.com/analysis/[ |
| GEPIA | RNA sequencing expression data from TCGA and the GTEx project | Acc; Blca; Brca; Cesc; Chol; Coad; Dlbc; Esca; Gbm; Hnsc; Kich; Kirc; Kirp; Laml; Lgg; Lihc; Luad; Lusc; Meso; Ov; Paad; Pcpg; Prad; Read; Sarc; Skcm; Stad; Tgct; Thca; Thym; Ucec; Uvm | Survival analysis, Methylation, Annotation, WGS, SNP, Chip, CGHt | http://gepia.cancer-pku.cn/index.html[ |
| R2: Genomics analysis and visualization platform | GEO, TCGA, and GTEx projects | Gli; Kicc; Lug; Lym; Mlym; Mal; Myel; Neur; Ova; Pan; Wil | Survival analysis | https://hgserver1.amc.nl/cgi-bin/r2/main.cgi?&species=hs[ |
*Abbreviations: ACC– adenoid cystic carcinoma; Bd – bladder; Bld – blood; Bo – bone; Br – breast; Bra – brain; Chl – cholangiocarcinoma; Clr – colorectal; Col – colon; EA – eye and adnexa; EG - endocrine glands; Eso – esophagus; GIST – gastrointestinal; HN– head and neck; Htp – hematopoietic; Kd – kidney; Lng – lung; Lvr – liver and biliary tract; Lymph – Lymphoma; Msh –mesothelioma; Mth – mouth; Nb – neuroblastoma; Npx – nasopharynx; Ov – ovary; Pan – pancancer; Pnc – pancreas; Pnx– pharynx; Prc/Prn - pheochromocytoma and paraganglioma; Prst – prostate; Rc – rectum; Sk – skin; ST – soft tissues; Stc– stomach; Swn – schwannoma; Thm – thymus; Thr – thyroid; Tst – testis; Utr – uterine; Blca – bladder urothelial carcinoma; Brca – breast invasive carcinoma; Cesc – cervical squamous cell carcinoma and endocervical adenocarcinoma; Chol – cholangio carcinoma; Coad – colon adenocarcinoma; Dlbc – lymphoid neoplasm diffuse large B-cell lymphoma; Esca –esophageal carcinoma; Gbm – glioblastoma multiforme; Hnsc – head and neck squamous cell carcinoma; Kich – kidney chromophobe; Kirc – kidney renal clear cell carcinoma; Kirp – kidney renal papillary cell carcinoma; Laml – acute myeloid leukemia; Lgg – brain lower grade glioma; Lihc – liver hepatocellular carcinoma; Luad – lung adenocarcinoma; Lusc – lung squamous cell carcinoma; Meso – mesothelioma; Ov – ovarian serous cystadenocarcinoma; Paad – pancreatic adenocarcinoma; Pcpg – pheochromocytoma and paraganglioma; Prad – prostate adenocarcinoma; Read – rectum adenocarcinoma; Sarc – sarcoma; Skcm – skin cutaneous melanoma; Stad – stomach adenocarcinoma; Tgct – testicular germ cell tumor; Thca – thyroid carcinoma; Thym – thymoma; Ucec – uterine carcinoma; Uvm – uveal melanoma; Myel – myeloma; Neur – neuroblastoma; Wil – Wilms.