| Literature DB >> 26813108 |
Cheng Wang1,2, Yayun Gu1,2, Kai Zhang1,2, Kaipeng Xie1,2, Meng Zhu1,2, Ningbin Dai1,2, Yue Jiang1,2, Xuejiang Guo1, Mingxi Liu1, Juncheng Dai1,2, Linxiang Wu3, Guangfu Jin1,2, Hongxia Ma1,2, Tao Jiang1,2, Rong Yin4, Yankai Xia5, Li Liu6, Shouyu Wang5, Bin Shen1, Ran Huo1, Qianghu Wang3, Lin Xu4, Liuqing Yang7, Xingxu Huang8, Hongbing Shen1,2, Jiahao Sha1, Zhibin Hu1,2.
Abstract
Cancer-testis (CT) genes represent the similarity between the processes of spermatogenesis and tumorigenesis. It is possible that their selective expression pattern can help identify driver genes in cancer. In this study, we integrate transcriptomics data from multiple databases and systematically identify 876 new CT genes in 19 cancer types. We explore their relationship with testis-specific regulatory elements. We propose that extremely highly expressed CT genes (EECTGs) are potential drivers activated through epigenetic mechanisms. We find mutually exclusive associations between EECTGs and somatic mutations in mutated genes, such as PIK3CA in breast cancer. We also provide evidence that promoter demethylation and close non-coding RNAs (namely, CT-ncRNAs) may be two mechanisms to reactivate EECTG gene expression. We show that the meiosis-related EECTG (MEIOB) and its nearby CT-ncRNA have a role in tumorigenesis in lung adenocarcinoma. Our findings provide methods for identifying epigenetic-driver genes of cancer, which could serve as targets of future cancer therapies.Entities:
Mesh:
Substances:
Year: 2016 PMID: 26813108 PMCID: PMC4737856 DOI: 10.1038/ncomms10499
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Figure 1Classification of all genes and known CT genes.
(a) All 50,016 genes were classified into six groups according to their mRNA expression patterns and classified into three groups according to their protein expression patterns. The pie chart displays the mRNA expression classification in the outer circle and the protein expression classification in the inner circle. (b) Pie chart summary of the classification of known CT genes. mRNA expression and protein abundance of known CT genes. The genes in the y axis were ranked by the SPM values from the GTEx database in descending order. The expression of each gene was rescaled to the interval [0, 1]. The depth of colour indicates the expression level. Black indicates testis and red indicates other normal tissues. The colour of each category corresponds with the colours in a,b. Genes were classified into the following six categories based on specificity measure (SPM) values: (C1) high-confidence testis-specific coding genes: GENCODE-annotated protein-coding genes (v19) with (a) SPMGTEx >0.9, SPMHBM >0.9 and SPMNJMU >0.9; or (b) Known CT and SPMGTEx=0, SPMHBM >0.9, SPMNJMU >0.9 and gene copies with identical sequences; (C2) high-confidence testis-specific non-coding genes: GENCODE-annotated non-coding genes (v19) with SPMGTEx >0.9, SPMHBM >0.9 and SPMNJMU >0.9; (C3) moderate-confidence testis-specific coding genes: GENCODE-annotated protein-coding genes (v19) with SPMGTEx >0.9 and either SPMHBM >0.9 or SPMNJMU >0.9; (C4) moderate-confidence testis-specific non-coding genes: GENCODE-annotated non-coding genes (v19) with SPMGTEx>0.9 and either SPMHBM >0.9 or SPMNJMU >0.9; (C5) low-confidence testis-specific genes: genes with SPMGTEx >0.9 but SPMHBM ≤0.9 and SPMNJMU ≤0.9; (C6) non-gene-level testis-specific gene: genes with SPMGTEx ≤0.9. G6 genes were then classified into the following two sub-groups using transcript abundance data from GTEx; (C6a) genes with testis-specific transcripts: C6 with SPMGTEx transcript>0.9; (C6b) genes without testis-specific transcripts: C6 with SPMGTEx transcript ≤0.9. (c) Complete expression patterns of known CT genes in the GTEx, HBM and NJMU studies.
Figure 2Enrichment analysis of testis-specific regulatory elements (TSREs).
Enrichment analysis was used to evaluate the relationship between TSREs and CT genes. Four types of regulatory elements (promoters, methylation sites, non-coding RNAs and enhancers) were included in the analysis. The error bar represents the ER confidence interval.
Figure 3General description of EECTPs in 19 cancer types.
(a) Enrichment analysis suggests that EE patterns were more likely to emerge in TSGs and TSPs, Fisher's exact test was applied to evaluate the enrichment ratio. (b) Number of activated EECTPs in 19 cancer types. The box plot displays the first and third quartiles (top and bottom of the boxes), the median (band inside the boxes), and the lowest and highest point within 1.5 times the interquartile range of the lower and higher quartile (whiskers).
Figure 4The association between the activation pattern of EECTPs and the alterations of the driver genes in patients with BRCA and papillary thyroid carcinoma.
(a) Mutually exclusive pattern of EECTP activation and PIK3CA mutation in breast cancer. (b) Mutually exclusive pattern of EECTP activation and PIK3CA mutation in different PAM50 subtypes. (c) The total of number of activated EECTPs is significantly higher in papillary thyroid carcinoma patients without clear driver alterations (mutations and fusions). The box plot displays the first and third quartiles (top and bottom of the boxes), the median (band inside the boxes), and the lowest and highest point within 1.5 times the interquartile range of the lower and higher quartile (whiskers). (d) The activation of MEIOB is restricted in papillary thyroid carcinoma patients without driver mutations or fusions but co-occur with arm copy number drivers.
Figure 5Negative correlation between the average promoter methylation level of EECTPs and the number of activated EECTPs.
The depth of colour indicates the genome-wide promoter methylation level. The blue line is the best-fit linear regression and the shaded area indicates the confidence interval.
Figure 6MEIOB and SPATA22 co-expression patterns in testis and tumour samples and the association between MEIOB activation and SAi.
(a) MEIOB and SPATA22 are co-expressed in testis. (b) MEIOB and SPATA22 are exclusively activated in LUAD samples. (c) Activation of MEIOB is associated with higher SAi. The box plot displays the first and third quartiles (top and bottom of the boxes), the median (band inside the boxes), and the lowest and highest point within 1.5 times the interquartile range of the lower and higher quartile (whiskers).
Figure 7MEIOB/LINC00254 drive lung cancer cell growth, migration and invasion in vitro.
(a) Expression of MEIOB/LINC00254 in the GTEx data sets. (b) Expression of MEIOB/LINC00254 in our 24 tumour/normal paired samples. (c) Overexpression of MEIOB effects A549 vitality. (d) Knockout of MEIOB effects A549 vitality. (e) Overexpression of LINC00254 effects A549 vitality. Relative expression of LINC00254 and MEIOB (left) and a growth curve (middle) of different treated A549 cells. The fold change is relative to the control of colony formation; EdU staining, migration and invasion are shown together (right). (f) Cell cycle distribution in the A549 control and MEIOB knockout cells. (g) Cell cycle distribution in the A549 control and MEIOB overexpressing cells. Error bars represent s.e.m, n=5. *P<0.05 compared with the vector control. **P<0.001 compared with the vector control. All of the experiments were repeated three times.