| Literature DB >> 29234122 |
Stephane Wenric1,2, Sonia ElGuendi1, Jean-Hubert Caberg3, Warda Bezzaou1, Corinne Fasquelle1, Benoit Charloteaux4, Latifa Karim4, Benoit Hennuy4, Pierre Frères2, Joëlle Collignon2, Meriem Boukerroucha2, Hélène Schroeder2, Fabrice Olivier2, Véronique Jossa5, Guy Jerusalem2, Claire Josse6,7, Vincent Bours1,3.
Abstract
Non-coding RNAs (ncRNA) represent 1/5 of the mammalian transcript number, and 90% of the genome length is transcribed. Many ncRNAs play a role in cancer. Among them, non-coding natural antisense transcripts (ncNAT) are RNA sequences that are complementary and overlapping to those of either protein-coding (PCT) or non-coding transcripts. Several ncNATs were described as regulating protein coding gene expression on the same loci, and they are expected to act more frequently in cis compared to other ncRNAs that commonly function in trans. In this work, 22 breast cancers expressing estrogen receptors and their paired adjacent non-malignant tissues were analyzed by strand-specific RNA sequencing. To highlight ncNATs potentially playing a role in protein coding gene regulations that occur in breast cancer, three different data analysis methods were used: differential expression analysis of ncNATs between tumor and non-malignant tissues, differential correlation analysis of paired ncNAT/PCT between tumor and non-malignant tissues, and ncNAT/PCT read count ratio variation between tumor and non-malignant tissues. Each of these methods yielded lists of ncNAT/PCT pairs that were enriched in survival-associated genes. This work highlights ncNAT lists that display potential to affect the expression of protein-coding genes involved in breast cancer pathology.Entities:
Mesh:
Substances:
Year: 2017 PMID: 29234122 PMCID: PMC5727077 DOI: 10.1038/s41598-017-17811-2
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Study workflow. RNA and DNA were simultaneously extracted from 23 breast cancer ER+/HER2− tumors and their paired adjacent non-malignant tissues. Strand-specific paired-end RNA sequencing and comparative genomic hybridization (CGH) were performed. Quality control steps and RNA-Seq validation were performed and led to the elimination of one patient due to poor strand specificity in this sample. This strategy allowed the study of the differential expression of ncNATs and PCTs between tumors and non-malignant tissues and performance of differential correlation analysis of ncNAT/PCT pairs. Three lists of genes with deregulated ncNAT expression in tumors that could potentially affect the corresponding PC expression were extracted, and their coding genes were subjected to survival analysis with an external cohort (TCGA).
Patient clinicopathological characteristics.
|
|
|
|
|---|---|---|
| Age (years) | Median | 62.9 years |
| Range | 43–83 years | |
| Tumor size (mm) | >20 | N = 22 |
| <20 | N = 0 | |
| Ki 67 (%) | <19 | N = 11 |
| ≥19 | N = 10 | |
| Unknown | N = 1 | |
| Histology | IDC + DCIS | N = 7 |
| IDC | N = 15 | |
| Bloom grade | I | N = 4 |
| II | N = 9 | |
| III | N = 9 | |
| T (x to 4) | 1c | N = 10 |
| 2 | N = 11 | |
| 3 | N = 1 | |
| N (x to 3) | 0 | N = 13 |
| 1a | N = 5 | |
| 1c | N = 1 | |
| 2a | N = 2 | |
| 3a | N = 1 | |
| M (0 or 1) | 0 | N = 22 |
| 1 | N = 0 | |
| Molecular subtype | ER + /HER2- | N = 22 |
| Luminal A | N = 11 | |
| Luminal B | N = 11 | |
| Meantime follow-up | Month | 43.36 |
Distribution of the relative expression intensities of ncNATs and their corresponding PCTs among the 9632 ncNAT/PCT pairs. This study was focused on ncNAT/PCT pairs where both the PCTs and the NATs were expressed in at least 7 out of the 22 patients, both in the tumor and the non-malignant tissue. This group of 4884 gene pairs contains 60% of the total reads counts, and the ncNAT/PCT ratio expression is increased in tumors.
| Nber NAT/PC pairs | transcript non-null expression in more than 7/22 patients | sum of read counts | ||||||
|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
| |
| 4884 | yes | yes | yes | yes | 5.56E + 07 | 4.73E + 07 | 1.73E + 07 | 2.26E + 07 |
| 3282 | yes | yes | no | no | 3.51E + 07 | 2.89E + 07 | 1.62E + 04 | 2.49E + 04 |
| 944 | yes | yes | no | yes | 1.08E + 07 | 1.02E + 07 | 1.03E + 04 | 2.12E + 04 |
| 149 | yes | yes | yes | no | 2.06E + 06 | 1.17E + 06 | 2.61E + 03 | 2.06E + 03 |
| 149 | no | no | yes | yes | 4.81E + 02 | 9.21E + 02 | 7.11E + 04 | 6.73E + 04 |
| 89 | no | no | no | no | 2.51E + 02 | 3.85E + 02 | 5.65E + 02 | 8.20E + 02 |
| 50 | no | yes | yes | yes | 9.32E + 02 | 1.77E + 03 | 2.27E + 04 | 4.29E + 04 |
| 29 | no | yes | no | no | 5.16E + 02 | 9.14E + 02 | 1.35E + 02 | 2.42E + 02 |
| 28 | no | no | no | yes | 9.85E + 01 | 1.64E + 02 | 3.23E + 02 | 5.69E + 02 |
| 11 | yes | no | yes | yes | 2.42E + 02 | 2.38E + 02 | 2.02E + 03 | 2.32E + 03 |
| 7 | no | yes | no | yes | 9.24E + 01 | 1.93E + 02 | 9.25E + 01 | 1.88E + 02 |
| 5 | yes | no | no | no | 1.50E + 02 | 1.43E + 02 | 1.61E + 01 | 3.80E + 01 |
| 3 | no | no | yes | no | 3.26E + 01 | 2.27E + 01 | 4.59E + 01 | 8.58E + 01 |
| 1 | yes | no | yes | no | 5.67E + 01 | 1.53E + 01 | 1.00E + 01 | 1.48E + 01 |
| 1 | no | yes | yes | no | 4.84E + 00 | 8.32E + 00 | 1.53E + 01 | 2.86E + 01 |
| 0 | yes | no | no | yes | 0.00E + 00 | 0.00E + 00 | 0.00E + 00 | 0.00E + 00 |
| Total 9632 | NAT/PCT pairs where at least one patient has a non-null expression in one of those four transcript types | 1.04E + 08 | 8.75E + 07 | 1.74E + 07 | 2.28E + 07 | |||
| 2.31E + 08 | ||||||||
Figure 2(A). ncNAT and PCT transcript distributions in adjacent non-malignant healthy tissue and tumors. Density plot of log10 (mean counts) of the four transcript types (ncNAT tumor; ncNAT non-malignant, PCT tumor, PCT non-malignant). The upper panel contains the complete list of 9632 ncNAT/PCT pairs; the second panel contains ncNAT/PCT pairs in amplified genomic regions; the third panel contains ncNAT/PCT pairs in deleted genomic regions; and the last panel contains ncNAT/PCT pairs in copy-number-neutral genomic regions. The ncNAT distributions are clearly different in non-malignant and tumor tissues, with a higher number of ncNATs displaying reduced expression in the tumors than the number of ncNATs that are increased in the tumors. (B) Presents the same information as panel A, but in the highly expressed ncNAT subgroup of the 4882 ncNAT/PCT pairs. (C) Schematic representation of proportion of different classes of differential correlations between tumors and non-malignant tissues. Primarily positive correlations of expression between ncNAT and their corresponding PCT are created or lost in tumorous tissues. The numbers indicated in the graph are the number of ncNAT/PCT pairs in this category; +/+= significant positive correlation between ncNAT and PCT in adjacent non-malignant tissue and conserved in the tumor; +/−= significant positive correlation between ncNAT and PCT in the adjacent non-malignant tissue that becomes negative in the tumor; +/0= significant positive correlation between ncNAT and PCT in the adjacent non-malignant tissue that is lost in the tumor; −/−= significant negative correlation between ncNAT and PCT in adjacent non-malignant tissue that is conserved in the tumor; −/+= significant negative correlation between ncNAT and PCT exists in adjacent non-malignant tissue that becomes positive in the tumor; −/0= significant negative correlation between ncNAT and PCT in the adjacent non-malignant tissue that is lost in the tumor; 0/0 = no significant correlation between ncNAT and PCT exists in the adjacent non-malignant tissue or the tumor.
Examples of spearman correlations between ncNAT and cancer-associated coding genes that are altered in tumors when compared to adjacent non-malignant tissues.
| Coding Gene Name | ncNAT_ID_Ensembl | ncNAT-Name | ncNAT/PCT spearman correlation adj non-malignant tissue | ncNAT/PCT cor. p-val adj non-malignant tissue | ncNAT/PCT spearman correlation tumor tissue | ncNAT/PCT cor. p-val tumor tissue | zScore_DiffCor_NAT/PCT | pVal_DiffCor_NAT/PCT | Classes_DiffCor |
|---|---|---|---|---|---|---|---|---|---|
| ACSL6 | ENSG00000234758 | AC034228.4 | −0.353 | 1.07E-01 | 0.441 | 4.01E-02 | 2.59 | 9.48E-03 | 0/+ |
| AFF1 | ENSG00000235043 | TECRP1 | 0.762 | 3.74E-05 | 0.247 | 2.69E-01 | −2.31 | 2.08E-02 | +/0 |
| BRCA1 | ENSG00000240828 | RPL21P4 | 0.625 | 1.87E-03 | −0.041 | 8.57E-01 | −2.39 | 1.70E-02 | + /0 |
| CAMTA1 | ENSG00000225126 | RP4-549F15.1 | −0.093 | 6.82E-01 | 0.698 | 3.06E-04 | 2.95 | 3.22E-03 | 0/+ |
| CAMTA1 | ENSG00000269978 | RP11-338N10.3 | 0.817 | 3.41E-06 | 0.139 | 5.36E-01 | −3.11 | 1.87E-03 | +/0 |
| CDKN2B | ENSG00000240498 | CDKN2B-AS1 | −0.541 | 9.31E-03 | 0.269 | 2.27E-01 | 2.72 | 6.61E-03 | −/0 |
| CEBPA | ENSG00000267727 | CTD-2540B15.7 | 0.788 | 1.31E-05 | −0.013 | 9.55E-01 | −3.33 | 8.74E-04 | +/0 |
| CTCF | ENSG00000237718 | AC009095.4 | −0.249 | 2.63E-01 | 0.577 | 4.96E-03 | 2.81 | 4.93E-03 | 0/+ |
| GNA11 | ENSG00000267139 | AC005262.3 | −0.135 | 5.49E-01 | 0.483 | 2.28E-02 | 2.04 | 4.11E-02 | 0/+ |
| GNAS | ENSG00000235590 | GNAS-AS1 | −0.599 | 3.22E-03 | 0.261 | 2.41E-01 | 2.96 | 3.12E-03 | −/0 |
| HLF | ENSG00000263096 | RP11-515O17.2 | −0.494 | 1.95E-02 | 0.142 | 5.29E-01 | 2.11 | 3.50E-02 | −/0 |
| HNF1A | ENSG00000241388 | HNF1A-AS1 | 0.023 | 9.18E-01 | 0.668 | 6.87E-04 | 2.41 | 1.58E-02 | 0/+ |
| HOXC11 | ENSG00000228630 | HOTAIR | 0.078 | 7.29E-01 | 0.625 | 1.87E-03 | 2.02 | 4.36E-02 | 0/+ |
| MAP2K1 | ENSG00000269999 | CTD-3185P2.2 | −0.523 | 1.26E-02 | 0.099 | 6.62E-01 | 2.09 | 3.64E-02 | −/0 |
| MSH2 | ENSG00000236558 | AC138655.6 | 0.762 | 3.74E-05 | 0.156 | 4.88E-01 | −2.60 | 9.26E-03 | +/0 |
| MSH6 | ENSG00000224058 | AC006509.7 | 0.567 | 5.89E-03 | −0.100 | 6.58E-01 | −2.29 | 2.18E-02 | +/0 |
| MYH11 | ENSG00000263065 | AF001548.6 | 0.662 | 7.94E-04 | 0.030 | 8.95E-01 | −2.36 | 1.82E-02 | +/0 |
| PPP6C | ENSG00000232630 | PRPS1P2 | −0.187 | 4.05E-01 | 0.453 | 3.41E-02 | 2.09 | 3.66E-02 | 0/+ |
| RAP1GDS1 | ENSG00000214559 | RP11-323J4.1 | 0.159 | 4.78E-01 | 0.911 | 3.99E-09 | 4.23 | 2.38E-05 | 0/+ |
| UBR5 | ENSG00000246263 | KB-431C1.4 | 0.164 | 4.65E-01 | 0.813 | 4.16E-06 | 2.99 | 2.76E-03 | 0/+ |
| UBR5 | ENSG00000272037 | KB-431C1.5 | 0.235 | 2.91E-01 | 0.735 | 9.92E-05 | 2.15 | 3.13E-02 | 0/+ |
| WT1 | ENSG00000183242 | WT1-AS | 0.392 | 7.09E-02 | 0.885 | 4.46E-08 | 3.03 | 2.42E-03 | 0/+ |
| ZFHX3 | ENSG00000259901 | RP5-991G20.4 | −0.322 | 1.44E-01 | 0.503 | 1.71E-02 | 2.73 | 6.26E-03 | 0/+ |
PC genes exhibiting deregulated corresponding ncNAT expression in tumor are preferentially related to survival of breast cancer patients.
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|
| Nb genes in list | 441 | 738 | 610 | 540 | 582 | 582 |
| Nb genes also present in TCGA dataset | 440 | 729 | 604 | 533 | 582 | 582 |
| Nb genes w/ log-rank p-val <= 0.05 | 71 | 118 | 96 | 84 | 57.38 ± 7.32 | 41.48 ± 6.61 |
| Genes % w/ log-rank p-val <= 0.05 | 16.1% | 16.2% | 15.9% | 15.8% | 13.83 ± 1.78% | 12.27 ± 1.91% |
| Average log-rank p-val | 0.392 | 0.387 | 0.388 | 0.391 | 0.404 ± 0.0315 | 0.483 ± 0.357 |
Protein coding genes of the 3 gene lists DiffCor, ncNATDiffExp and VarRatio (left and right) were tested for association with survival by means of a TCGA RNA-Seq dataset of breast cancers. The percentage of genes associated with survival in each list has been compared with 2 control distributions containing randomly selected protein coding genes.