| Literature DB >> 27058892 |
Giulia Pintarelli1, Alice Dassano1, Chiara E Cotroneo1,2, Antonella Galvan3, Sara Noci1, Rocco Piazza4,5, Alessandra Pirola4, Roberta Spinelli6, Matteo Incarbone7, Alessandro Palleschi8, Lorenzo Rosso8, Luigi Santambrogio8, Tommaso A Dragani1, Francesca Colombo1.
Abstract
Read-through transcripts result from the continuous transcription of adjacent, similarly oriented genes, with the splicing out of the intergenic region. They have been found in several neoplastic and normal tissues, but their pathophysiological significance is unclear. We used high-throughput sequencing of cDNA fragments (RNA-Seq) to identify read-through transcripts in the non-involved lung tissue of 64 surgically treated lung adenocarcinoma patients. A total of 52 distinct read-through species was identified, with 24 patients having at least one read-through event, up to a maximum of 17 such transcripts in one patient. Sanger sequencing validated 28 of these transcripts and identified an additional 15, for a total of 43 distinct read-through events involving 35 gene pairs. Expression levels of 10 validated read-through transcripts were measured by quantitative PCR in pairs of matched non-involved lung tissue and lung adenocarcinoma tissue from 45 patients. Higher expression levels were observed in normal lung tissue than in the tumor counterpart, with median relative quantification ratios between normal and tumor varying from 1.90 to 7.78; the difference was statistically significant (P < 0.001, Wilcoxon's signed-rank test for paired samples) for eight transcripts: ELAVL1-TIMM44, FAM162B-ZUFSP, IFNAR2-IL10RB, INMT-FAM188B, KIAA1841-C2orf74, NFATC3-PLA2G15, SIRPB1-SIRPD, and SHANK3-ACR. This report documents the presence of read-through transcripts in apparently normal lung tissue, with inter-individual differences in patterns and abundance. It also shows their down-regulation in tumors, suggesting that these chimeric transcripts may function as tumor suppressors in lung tissue.Entities:
Keywords: RNA-Seq; conjoined genes; gene fusion; lung adenocarcinoma; read-through transcripts
Mesh:
Substances:
Year: 2016 PMID: 27058892 PMCID: PMC5053695 DOI: 10.18632/oncotarget.8556
Source DB: PubMed Journal: Oncotarget ISSN: 1949-2553
Clinical characteristics of the 64 lung adenocarcinoma patients from whom total RNA from resected non-tumor lung tissue was analyzed by RNA-Seq and of 45 lung adenocarcinoma patients from whom total RNA from non-tumor and tumor lung tissue was analyzed by qPCR (these 45 patients included 6 patients analyzed by RNA-Seq)
| RNA-Seq | qPCR | |
|---|---|---|
| Sex, | ||
| Men | 50 (78.1) | 34 (75.6) |
| Women | 14 (21.9) | 11 (24.4) |
| Age at surgery, years, mean (SD) | 63.8 (7.3) | 62.6 (10.7) |
| Smoking status, | ||
| Smoker | 61 | 31 |
| Non-smoker | 1 | 11 |
| Missing | 2 | 3 |
| Stage, | ||
| I | 55 | 18 |
| II | 3 | 8 |
| III or IV | 5 | 13 |
| Missing | 1 | 6 |
Read-through transcripts identified by RNA-Seq analysis
| Read-through name | Chr. | First transcript (exon) | Second transcript (exon) | Intergenic region (bp) | N. patients | PCR validated | Maintained frame |
|---|---|---|---|---|---|---|---|
| AGRP-ATP6V0D1 | 16 | NM_001138 (3) | NM_004691 (2) | 1,334 | 1 | Yes | No |
| ARF3-FKBP11 | 12 | NM_001659 (4) | NM_001143781 (4) | 9,249 | 1 | Different | See Table |
| ARPC4-TTLL3 | 3 | NM_001024959 (5) | NM_001025930 (2) | 360 | 10 | Yes | No |
| CD59-C11orf91 | 11 | NM_203331 (4) | NM_001166692 (2) | NA | 2 | Different | See Table |
| CHIA-PIFO | 1 | NM_021797 (8) | NM_181643 (2) | 25,722 | 1 | Yes | Yes |
| CTSC-RAB38 | 11 | NM_001814 (5) | NM_022337 (2) | 118,125 | 2 | Yes | Yes |
| NM_001814 (4) | NM_022337 (2) | 1 | No | − | |||
| CTSD-IFITM10 | 11 | NM_001909 (8) | NM_001170820 (2) | 2,161 | 3 | Yes | Yes |
| NM_001909 (7) | NM_001170820 (2) | 2 | No | − | |||
| ELAVL1-TIMM44 | 19 | NM_001419 (4) | NM_006351 (3) | 14,658 | 2 | Different | See Table |
| FAM162B-ZUFSP | 6 | NM_001085480 (3) | NM_145062 (2) | 83,406 | 1 | Yes | No |
| FARSA-SYCE2 | 19 | NM_004461 (12) | NM_001105578 (3) | 3,203 | 1 | Yes | Yes |
| HARS2-ZMAT2 | 5 | NM_012208 (13) | NM_144723 (2) | NA | 1 | Different | See Table |
| HSD17B11-HSD17B13 | 4 | NM_016245 (5) | NM_178135 (2) | 13,704 | 1 | Different | See Table |
| IFNAR2-IL10RB | 21 | NM_207584 (7) | NM_000628 (2) | 683 | 1 | Yes | Yes |
| NM_207584 (7) | NM_000628 (3) | 1 | Yes | No | |||
| INMT-FAM188B | 7 | NM_001199219 (2) | NM_032222 (2) | 13,815 | 4 | Yes | No |
| NM_001199219 (2) | NM_032222 (3) | 7 | No | − | |||
| NM_001199219 (3) | NM_032222 (3) | 1 | No | − | |||
| KIAA1841-C2orf74 | 2 | NM_032506 (21) | NM_001143960 (2) | NA | 1 | Yes | No |
| LMAN2-MXD3 | 5 | NM_006816 (7) | NM_001142935 (4) | 18,805 | 1 | Yes | Yes |
| MBD1-CFAP53 | 18 | NM_001204139 (14) | NM_145020 (2) | 360 | 1 | Yes | No |
| MED22-SURF6 | 9 | NM_133640 (4) | NM_006753 (2) | 1,925 | 1 | Yes | No |
| MRPS17-GBAS | 7 | NM_015969 (1) | NM_001202469 (3) | NA | 1 | No | − |
| NAA60-CLUAP1 | 16 | NM_001083601 (1) | NM_015041 (2) | 13,961 | 1 | Different | See Table |
| NDUFB8-SEC31B | 10 | NM_005004 (4) | NM_015490 (2) | NA | 1 | Yes | No |
| NFATC3-PLA2G15 | 16 | NM_173165 (9) | NM_012320 (2) | 16,045 | 2 | Yes | Yes |
| NKX2-1-SFTA3 | 14 | NM_001079668 (2) | NM_001101341 (4) | 2,568 | 5 | Yes | No |
| NM_001079668 (2) | NM_001101341 (2) | 2 | Yes | No | |||
| NM_001079668 (1) | NM_001101341 (2) | 1 | Yes | No | |||
| NM_001079668 (1) | NM_001101341 (4) | 3 | Different | See Table | |||
| NM_001079668 (1) | NM_001101341 (3) | 5 | No | − | |||
| PACSIN2-ARFGAP3 | 22 | NM_001184970 (10) | NM_001142293 (3) | NA | 1 | Yes | No |
| NM_001184970 (10) | NM_001142293 (4) | 1 | Different | See Table | |||
| PIR-FIGF | X | NM_001018109 (9) | NM_004469 (2) | 423 | 1 | Yes | No |
| PLEKHO2-ANKDD1A | 15 | NM_001195059 (2) | NM_182703 (5) | 43,895 | 1 | Yes | Yes |
| NM_001195059 (4) | NM_182703 (5) | 1 | Different | See Table | |||
| PPRC1-NOLC1 | 10 | NM_015062 (13) | NM_004741 (4) | 1,851 | 1 | Different | See Table |
| SCNN1A-TNFRSF1A | 12 | NM_001159575 (12) | NM_001065 (2) | 4,729 | 2 | Yes | Yes |
| NM_001159575 (12) | NM_001065 (3) | 2 | No | − | |||
| NM_001159575 (13) | NM_001065 (2) | 1 | No | - | |||
| NM_001159575 (10) | NM_001065 (2) | 1 | No | - | |||
| NM_001159575 (9) | NM_001065 (2) | 1 | No | - | |||
| SFTPC-BMP1 | 8 | NM_003018 (3) | NM_001199 (2) | 257 | 2 | Yes | No |
| SHANK3-ACR | 22 | NM_033517 (22) | NM_001097 (2) | 4,898 | 1 | Yes | Yes |
| SIRPB1-SIRPD | 20 | NM_001083910 (2) | NM_178460 (2) | 4,678 | 1 | Yes | Yes |
| SNTB2-VPS4A | 16 | NM_006750 (6) | NM_013245 (3) | 2,304 | 1 | Different | See Table |
| SPECC1L-ADORA2A | 22 | NM_015330 (13) | NM_000675 (3) | 139 | 1 | Different | See Table |
| TSTD1-F11R | 1 | NM_001113205 (1) | NM_016946 (3) | 16,283 | 2 | Different | See Table |
| VAMP8-VAMP5 | 2 | NM_003761 (2) | NM_006634 (2) | 2,377 | 5 | Yes | Yes |
| ZDHHC1-TPPP3 | 16 | NM_013304 (10) | NM_016140 (3) | 884 | 1 | Yes | No |
| ZNF343-SNRPB | 20 | NM_024325 (6) | NM_198216 (2) | 10,964 | 1 | No | - |
Gene1-Gene2 symbol.
Chr.: chromosome.
Exons joined by the read-through event.
Number of patients in which the read-through transcript was identified by bioinformatic analysis of RNA-Seq data.
NA: not available; there is no intergenic distance since genes are overlapping.
Identification of different read-through fusion points by Sanger-sequencing (see Table 3).
No sequence amplified at PCR
Read-through transcripts identified by Sanger sequencing of PCR amplicons
| Read-through name | First transcript (exon) | Second transcript (exon) | Maintained frame |
|---|---|---|---|
| ARF3-FKBP11 | NM_001659 (4) | NM_001143781 (2) | No |
| NM_001659 (4) | NM_001143781 (3) | No | |
| CD59-C11orf91 | NM_203331 (4) | NM_001166692 (2) | No |
| ELAVL1-TIMM44 | NM_001419 (4) | NM_006351 (2) | No |
| HARS2-ZMAT2 | NM_012208 (13) | NM_144723 (2) | No |
| HSD17B11-HSD17B13 | NM_016245 (6) | NM_178135 (2) | No |
| NAA60-CLUAP1 | NM_001083601 (2) | NM_015041 (2) | No |
| NKX2-1-SFTA3 | NM_001079668 (1) | NM_001101341 (2) | No |
| PACSIN2-ARFGAP3 | NM_001184970 (10) | NM_001142293 (2) | No |
| PLEKHO2-ANKDD1A | NM_001195059 (5) | NM_182703 (4) | Yes |
| NM_001195059 (5) | NM_182703 (5) | Yes | |
| PPRC1-NOLC1 | NM_015062 (13) | NM_004741 (2) | No |
| SNTB2-VPS4A | NM_006750 (6) | NM_013245 (2) | Yes |
| SPECC1L-ADORA2A | NM_015330 (14) | NM_000675 (3) | Yes |
| TSTD1-F11R | NM_001113205 (1) | NM_016946 (2) | Yes |
Gene1-Gene2 symbol.
Exons joined by the read-through event.
A portion of intragenic region is included in the read-through transcript.
Incomplete exon.
Figure 1Relative quantification (RQ) of read-through transcripts, in pairs of matched non-involved and tumor tissue from lung adenocarcinoma patients, reveals that read-through transcripts were expressed at lower levels in tumor tissue (T) than in the normal counterpart (N); n = number of normal-tumor tissue pairs examined
The line within each box represents the median RQ; upper and lower edges of each box are 75th and 25th percentiles, respectively; top and bottom whiskers indicate the greatest and least RQ values, respectively. ***P < 0.001, paired Wilcoxon's signed-rank test for paired samples.
Influence of clinical characteristics on read-through transcript levels in lung adenocarcinoma tissue versus non-involved lung tissue, determined by testing in a multivariate linear model, using disease stage, smoking habit and sex as covariates
| Read-through | Variable/Covariate | Coefficient estimate | SE | |
|---|---|---|---|---|
| Tissue | −11.88 | 4.85 | 0.0184 | |
| Tissue | −13.58 | 4.44 | 0.004 | |
| Tissue | −18.69 | 1.87 | 1.45 × 10−10 | |
| Sex | −4.982 | 2.00 | 0.019 | |
| Stage | 6.945 | 1.96 | 0.001 | |
| Tissue | −10.25 | 4.33 | 0.024 | |
| Tissue | −20.33 | 3.65 | 2.43 × 10−6 | |
| Tissue | −16.50 | 3.76 | 1.22 × 10−4 | |
| Tissue | −15.23 | 4.40 | 0.001 | |
| Sex | −11.23 | 5.28 | 0.040 | |
| Tissue | −25.61 | 3.95 | 4.27 × 10−8 | |
| Tissue | −18.94 | 3.76 | 2.29 × 10−5 |
Quantitative levels were rank-transformed to improve the normality distribution of values.
Tumor tissue as compared to non-involved tissue; males as compared to females; ever smoker as compared to never smoker; stage > I as compared to stage = I.
SE, standard error.
P values are reported only if P < 0.05; CTSC-RAB38 showed P > 0.05 for all variables.