| Literature DB >> 31992760 |
Bharati Mehani1,2, Kiran Narta1,2, Deepanjan Paul1,2, Anurag Raj2,3, Deepak Kumar1,4, Anchal Sharma1,2, Lalit Kaurani1, Subhashree Nayak5, Debasis Dash2,3, Ashish Suri6, Chitra Sarkar5, Arijit Mukhopadhyay7,8.
Abstract
Fusion transcripts can contribute to diversity of molecular networks in the human cortex. In this study, we explored the occurrence of fusion transcripts in normal human cortex along with single neurons and astrocytes. We identified 1305 non-redundant fusion events from 388 transcriptomes representing 59 human cortices and 329 single cells. Our results indicate while the majority of fusion transcripts in human cortex are intra-chromosomal (85%), events found in single neurons and astrocytes were primarily inter-chromosomal (80%). The number of fusions in single neurons was significantly higher than that in single astrocytes (p < 0.05), indicating fusion as a possible contributor towards transcriptome diversity in neuronal cells. The identified fusions were largely private and 4 specific recurring events were found both in cortex and in single neurons but not in astrocytes. We found a significant increase in the number of fusion transcripts in human brain with increasing age both in single cells and whole cortex (p < 0.0005 and < 0.005, respectively). This is likely one of the many possible contributors for the inherent plasticity of the adult brain. The fusion transcripts in fetal brain were enriched for genes for long-term depression; while those in adult brain involved genes enriched for long-term potentiation pathways. Our findings demonstrate fusion transcripts are naturally occurring phenomenon spanning across the health-disease continuum, and likely contribute to the diverse molecular network of human brain.Entities:
Mesh:
Substances:
Year: 2020 PMID: 31992760 PMCID: PMC6987184 DOI: 10.1038/s41598-020-58165-6
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Fusion transcripts identified in human cortex. (a) Circos plot representing 38 unique fusions, where periphery of the plot represents different chromosomes and links denote fusion events. Red links are inter-chromosomal event involving genes from two different chromosomes and links that are in green are intra-chromosomal fusion between genes from same chromosome. (b) Percentage of fusion breakpoints containing exon boundaries from one, both or neither of their parent genes. (c) Percentages of fusion transcripts that use canonical splice site signature GT-AG, CT-AC, or another donor-acceptor sequential preference in brain tissues. (d) Preferred sequences around fusion breakpoint matching with the signature for canonical splicing.
Fusion transcripts identified in 59 brain samples. The asterisk (*) represents events retaining exon boundaries from both of its partner genes.
| Gene left | Gene right | Chr left | Cord left | Chr right | Cord right | Recurrence | Dinucleotide preference | Left exon intact | Right exon intact |
|---|---|---|---|---|---|---|---|---|---|
| KANSL1* | ARL17A* | chr17 | 44171925 | chr17 | 44430295 | 10 | GT-AG | yes | yes |
| KANSL1* | ARL17A* | chr17 | 44171925 | chr17 | 44648234 | 9 | GT-AG | yes | yes |
| ENSG00000212127* | PRH1-PRR4* | chr12 | 11126253 | chr12 | 11001005 | 5 | GT-AG | yes | yes |
| ENSG00000227733* | HYDIN* | chr1 | 146126403 | chr16 | 71196632 | 4 | GT-AG | yes | yes |
| FAM78B | ENSG00000229588 | chr1 | 166135290 | chr1 | 166304564 | 4 | GT-AG | no | yes |
| FLJ39739 | NBPF9 | chr1 | 147917467 | chr1 | 144676872 | 3 | CC-TA | no | no |
| CCDC7* | C10orf68* | chr10 | 32854485 | chr10 | 32873231 | 3 | CT-AC | yes | yes |
| ABLIM1 | ENSG00000228484 | chr10 | 116361719 | chr10 | 116527819 | 3 | CT-AC | yes | no |
| LOC100129961* | ENSG00000224043* | chr2 | 135635089 | chr2 | 135493150 | 3 | GT-AG | yes | yes |
| ENSG00000225065* | NCOA6* | chr20 | 33302393 | chr20 | 33303168 | 3 | GT-AG | yes | yes |
| ENSG00000243795 | C3orf17 | chr3 | 112862700 | chr3 | 112738552 | 3 | CT-AC | no | yes |
| ENSG00000236537* | TULP4* | chr6 | 158733082 | chr6 | 158735299 | 3 | CT-AC | yes | yes |
| LOC729852 | ENSG00000233108 | chr7 | 7841373 | chr7 | 8007329 | 3 | GT-AG | yes | no |
| ENSG00000218328 | KAZN | chr1 | 14507086 | chr1 | 14925478 | 2 | GT-AG | yes | no |
| PDE4DIP | FLJ39739 | chr1 | 145013720 | chr1 | 147931723 | 2 | CT-AC | no | yes |
| CCDC7* | C10orf68* | chr10 | 32832227 | chr10 | 32873231 | 2 | CT-AC | yes | yes |
| HSPA12A | ENO4 | chr10 | 118466795 | chr10 | 118609076 | 2 | CT-AC | yes | no |
| ENSG00000231121* | NAV3* | chr12 | 77966045 | chr12 | 78334098 | 2 | GT-AG | yes | yes |
| ENSG00000230021 | PCBD2 | chr1 | 564469 | chr5 | 134261453 | 1 | AC-TG | no | no |
| ENSG00000198744 | DHFR | chr1 | 570067 | chr5 | 79946838 | 1 | GC-CA | no | no |
| MTOR | UBIAD1 | chr1 | 11316632 | chr1 | 11334116 | 1 | CT-AC | no | yes |
| ENSG00000231485* | JAK1* | chr1 | 65532310 | chr1 | 65352023 | 1 | GT-AG | yes | yes |
| PFKFB3* | LOC399715* | chr10 | 6268327 | chr10 | 6368508 | 1 | GT-AG | yes | yes |
| ENSG00000249456 | ZRANB1 | chr10 | 126628942 | chr10 | 126631875 | 1 | CT-AC | no | yes |
| ALG1L9P | FAM66C | chr11 | 71518527 | chr12 | 8346798 | 1 | GT-AG | no | yes |
| OPCML* | NTM* | chr11 | 132812820 | chr11 | 132081915 | 1 | GT-AG | yes | yes |
| ENSG00000245482 | ALG10B | chr12 | 34185072 | chr12 | 38720271 | 1 | CT-AA | no | no |
| ACAD10 | MAPKAPK5 | chr12 | 112182444 | chr12 | 112308981 | 1 | AA-CT | yes | yes |
| PSMC1 | ENSG00000205533 | chr14 | 90738533 | chr14 | 90739625 | 1 | AG-CC | no | no |
| ENSG00000186031* | FMN1* | chr15 | 33445248 | chr15 | 33300275 | 1 | GT-AG | yes | yes |
| LRRK1 | CHSY1 | chr15 | 101590983 | chr15 | 101775286 | 1 | CT-AC | no | yes |
| THRB | THRB-AS1 | chr3 | 24378861 | chr3 | 24536623 | 1 | CT-AC | yes | no |
| ENSG00000249598 | SDHAP1 | chr3 | 195685882 | chr3 | 195686956 | 1 | GT-AG | no | yes |
| ENSG00000250859 | HNRNPK | chr5 | 126847434 | chr9 | 86585718 | 1 | AT-AG | no | yes |
| ENSG00000236537* | TULP4* | chr6 | 158703294 | chr6 | 158735299 | 1 | CT-AC | yes | yes |
| CPQ | TSPYL5 | chr8 | 97919555 | chr8 | 98287867 | 1 | CT-AC | no | no |
| CLIC2 | ENSG00000224216 | chrX | 154563678 | chrX | 154564556 | 1 | TA-GG | yes | yes |
| TTTY14 | NCRNA00185 | chrY | 21239153 | chrY | 21039090 | 1 | GT-AG | no | no |
Figure 2Fusion specific PCR product was run on 2% agarose gel and further confirmed by using Sanger sequencing. Validation of (a) KANSL1-ARL17 fusion was validated in FC171, FC151 & FC156 with a product size of 239 bp; (b) C3ORF17-RP11-572M11.3 fusion in FC171 & FC202 with a product size of 227 bp and (c) mTOR-UBIAD1 fusion in FC171 with a product size of 236 bp. Strategy used to design the fusion specific primer is depicted in the Supplementary Fig. S7.
Figure 3Fusion transcripts identified in single cells from adult human brain. (a) Circos plot represents 911 unique fusions and signifies 82% (749/911) of the identified events are Inter-chromosomal. Here, periphery of the plot represents different chromosomes and links denote fusion events. Red links are inter-chromosomal event involving genes from two different chromosomes and links that are in green are intra-chromosomal fusion between genes from same chromosome. (b) Neurons (blue) harbor significantly higher number of fusions compared to astrocytes (red) (one tailed Wilcoxon p value < 10−3 marked by **), where boxplot represents number of fusion transcripts identified in each cell. Horizontal axis denotes brain cells while vertical axis represents number of fusion transcripts normalized with total data generated in that cell.
Figure 4Boxplots represent fusion transcripts are enriched in adult brain compared to the fetal brain. (a) In case of single cells, adult neurons (red) harbor significantly higher number of fusions with respect to fetal neurons (blue) (one tailed Wilcoxon p value < 10−4 is marked by ***). (b) Similarly, bulk tissue from adult brain (red) also demonstrated higher number of fusions compared to fetal (blue) (one tailed Wilcoxon p value < 10−3 is marked by **) where horizontal axis denotes brain samples while vertical axis represents number of fusion transcripts normalized with total data generated in a samples. Pathway enrichment analysis of genes harboring fusions has revealed enrichment for genes implicated in processes involved in synaptic plasticity. In the figure the horizontal axis shows the negative logarithm of FDR corrected p-value while vertical axis has different biological processes. Top biological processes that were significantly enriched in adult brain (c) and fetal brain (d) (both from tissue and single cells).
Table represents putative translated junction for identified fusions a reading frame of their parent genes. Highlighted events are identified in multiple proteomics samples.
| GeneA | GeneB | Putative translated fusion peptide | Junction peptide from MS data | GeneA and its reading frame | GeneB and its reading frame |
|---|---|---|---|---|---|
| GDI2 | FAM208B | IIVQNGKVIGVKSEGENLLRKGGHTEIEPQHF | VIGVKSEGENLLR | intact | intact |
| ENSG00000225302 | HSPA12A | AKNKMKCDSRWEIAASETAPTSAYSSPARSLGD | WEIAASETAPTSAYSSPAR | intact | intact |
| ENSG00000212127 | PRH1-PRR4 | LPAGTCCIYSRVEVFTDVNYEDFTFTIPGKSQ* | LPAGTCCIYSRVEVFTDVNYEDFTFTIPGK | intact | intact |
| MAP1B | DDX3× | EKIERTTKSPSDSGYSYRPCVVYGGADIGQQIR | SPSDSGYSYRPCVVYGGADIGQQIR | intact | intact |
| YLPM1 | ZNF207 | MVPPYQGGPPRPPMGMRPPPPLPPPPPVIKPQT | PPYQGGPPRPPMGMRPPPPLPPPPPVIKPQT | intact | intact |
| KLHL8 | SMARCAD1 | LLRFYENGELCDVTLKLIESTSTMDGAIAAAL | FYENGELCDVTLKLIESTSTMDGAIAAAL | intact | intact |
| SLC7A5P2 | PTPRD | CPVPEEAAKLVACHSVPPPRFTRTPVDQTGVSG | LVACHSVPPPRFTR | intact | intact |
| CSAD | ARHGEF12 | CA*WKEMSIPLKSSFLLWQDLICRMAASVKEQS | SSFLLWQDLICR | intact | no |
| GABRB2 | ZNF451 | MNIDIASIDMVSEVNMNVTVMITWVPKILQ*E | NIDIASIDMVSEVNMNVTVMITWVPK | intact | no |
| PPP2R4 | NPAP1 | IDTSDMNTTPPSKTVILQSGQGLRLALV*ESCF | IDTSDMNTTPPSKTVILQSGQGLR | intact | no |
| ACAD10 | MAPKAPK5 | QGDLMTPQFTPYYVAPQGKQAPHMRNKLES*PN | QGDLMTPQFTPYYVAPQGKQAPHMR | intact | no |
| EIF4G3 | FRMPD2 | RSILNKLTPQMFNQLMKHLPGARHYSRPPSMLR | LTPQMFNQLMKHLPGAR | intact | no |
| OPCML | NTM | KAMDNVTVRQGESATLSISQNCRDFFRYLH**R | QGESATLSISQNCRDFFR | intact | no |
| MALAT1 | MLL2 | RCEPPRLAGSPFFLTPTNLPSTISPGLGSLPSK | CEPPRLAGSPFFLTPTNLPSTISPGLGSLPSK | intact | no |
| WDR6 | CBFA2T3 | SGPGGVVACLEISAAPSGICVCGVARGLASVRV | SGPGGVVACLEISAAPSGICVCGVARGLASVR | intact | no |
| WDR48 | AATF | LFKDKGGPEFSSALKNMNLGGLLLQALLEYWPR | GGPEFSSALKNMNLGGLLLQALLEYWPR | intact | no |
| GRAMD2 | HSP90AA1 | PEGLKGEEIKKCGREGVNLWVVEQKLSYT*KKT | CGREGVNLWVVEQK | intact | no |
| SYNE1 | SSBP1 | PPKEPMDMEAQLMDCQASETWHINM*KRGKLKR | PPKEPMDMEAQLMDCQASETWHINM | intact | no |
| NUMB | ARHGAP24 | YLPGLSKPLPYCEELFYILFSMQVKTHNIDFIN | YLPGLSKPLPYCEELFYILFSMQVKTHNIDFIN | intact | no |
| SH3PXD2A | BCLAF1 | MGRASHLVHDMQRLPEDQEALDYFSDKESGKQK | ASHLVHDMQRLPEDQEALDYFSDK | no | intact |
| ZNF292 | ARNT2 | SQGLSIQSLRNTIGLLIHIFNKHNDKHKAHLIR | SQGLSIQSLRNTIGLLIHIFNK | no | intact |
| NCKAP1 | KIAA1109 | RSIVGMTMYNQATQEIALAADHHSKHEAQRHFL | SIVGMTMYNQATQEIALAADHHSK | no | intact |
| USP11 | ATL2 | ARVGENVHCGPAKAGENYEDDDLVNSDEVMKK | VGENVHCGPAKAGENYEDDDLVNSDEVMK | no | intact |
| DPP6 | MLLT3 | RRRTWTVSILAWLCTPEKPSKDSREHKSAFKE | TWTVSILAWLCTPEKPSKDSR | no | intact |
| USP34 | DDHD1 | MRKCVVQLCQGGWLIGQKMDQGRIIKNTAM*VL | CVVQLCQGGWLIGQKMDQGR | no | intact |
| FAM185A | FBXL13 | HCQKNHTAKTSPNSWPTGNITLQSKMGNITVGM | TSPNSWPTGNITLQSKMGNITVGM | no | intact |
| NMD3 | MKLN1 | ADDYNCKQC*RLALGSEDGLDFYYSSKQHAQKM | LALGSEDGLDFYYSSK | no | intact |
| ENSG00000178440 | TIMM23 | TGEVLRRSYTKGSKVLDTDEFILPTGANKTRG | GSKVLDTDEFILPTGANK | no | intact |
| ENSG00000147421 | PCDH9 | QVMKWKASLTAVISSDTLISHPLPLVQPQDEF | ASLTAVISSDTLISHPLPLVQPQDEF | no | intact |
| GRIA4 | ENSG00000234873 | LDQ*QKQVTHQGTGQPWSWSWLVLLCPPPRGE | QVTHQGTGQPWSWSWLVLLCPPPRGE | no | intact |
| FAM226B | NAP1L2 | SSLFSSPVASMSSSSSSSFTATERNWGGCHSLW | SSLFSSPVASMSSSSSSSFTATER | no | intact |
Figure 5Spatial proximity might be a possible trigger for fusion transcripts. (a) Chromatin interaction map identified in 10 human cell lines. Horizontal axis in the plot represents different cell lines and vertical axis represents number of chromatin interaction identified in each cell line at 40 Kb resolutions. Blue bars represent raw numbers of interactions while that are in red are significant interaction identified with Z score > 1.96 in each cell line. (b) Chromosome pairs assisting chromatin interactions and their numbers. Horizontal axis represents different chromosome pairs and vertical axis represents number of trans-interactions identified.
Fusion partners that are also in physical proximity.
| Fusion_Pair | Fusion breakpoint1 | Fusion breakpoint2 | Chromatin Interacting locus1 | Chromatin Interacting locus2 | Chromatin Interaction identified in cell line |
|---|---|---|---|---|---|
| ENSG00000249456_ZRANB1 | chr10:126631875 | chr10:126628942 | chr10:126640000-126680000 | chr10:126600000-126640000 | A549, NCI-H460, RPMI-7951, T47D |
| ZFYVE9_ENSG00000154222 | chr1:52815403 | chr1:52805352 | chr1:52800000-52840000 | chr1:52760000-52800000 | A549, RPMI-7951, SK-MEL-5, T47D |
| ENO2_ATN1 | chr12:7033739 | chr12:7031013 | chr12:7040000-7080000 | chr12:7000000-7040000 | A549, RPMI-7951, SK-MEL-5, G401 |
| ENSG00000225065_NCOA6 | chr20:33303168 | chr20:33302393 | chr20:33320000-33360000 | chr20:33280000-33320000 | NCI-H460, RPMI-7951, T47D, G401 |
| FAM177A1_PPP2R3C | chr14:35557754 | chr14:35548998 | chr14:35560000-35600000 | chr14:35520000-35560000 | A549, NCI-H460, G401 |
| ENSG00000227733_HYDIN | chr1: 146126403 | chr16:71196632 | chr1:146100000-146140000 | chr16:71160000-71200000 | NCI-H460, T47D, G401 |
| GDI2_FAM208B | chr10:5798575 | chr10:5815804 | chr10:5800000-5840000 | chr10:5760000-5800000 | A549, Caki2 |
| SLC7A6_SLC7A6OS | chr16:68338040 | chr16:68333926 | chr16:68320000-68360000 | chr16:68280000-68320000 | A549, RPMI-7951 |
| RENBP_SSR4 | chrX:153062912 | chrX:153208306 | chrX:153160000-153200000 | chrX:153080000-153120000 | A549, SK-N-MC |
Fusion transcripts having CNV signature around their breakpoints.
| Sample | Gene A | Chr Gene A | Cord Gene A | Gene B | Chr Gene B | Cord Gene B | chr | CNV satrt | CNV end | CNV state | CNV data source |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 156 fc | chr17 | 44171925 | chr17 | 44430295 | chr17 | 44757175 | 44782177 | state2,cn = 1 | 660quad microarray | ||
| 156 fc | chr17 | 44171925 | chr17 | 44648234 | chr17 | 44757175 | 44782177 | state2,cn = 1 | 660quad microarray | ||
| 171 fc | chr1 | 11316632 | chr1 | 11334116 | chr1 | 12057354 | 12062160 | dup | Exome Sequencing |
Figure 6Validation of KANSL1-ARL17 fusion in 7/9 GBM. Fusion specific PCR products (239 bp) were ran on 2% gel with 100 bp ladder and further confirmed by Sanger sequencing. Lower panel denotes a representative electropherogram generated by using cDNA from GBM_8163 which was ran on lane 4 of the agarose gel. Rest other electropherograms are shown in the supplementary file as Supplementary Fig. S6. (b) Confirmation of H1 β duplication haplotype in 6/10 in-house samples (FC). PCR products (307 bp) were run on 2% gel with 100 bp ladder. Relative expression of fusion transcript along with its parent genes was checked in GBM compared to controls. We performed real time PCR using (c) KANSL1-ARL17 fusion, (d) KANSL1 primers with cDNAs from 6 in-house normal brains along with 6 GBM samples. In both the boxplots, vertical axis represents Log2ΔCt calculated using the expression of B2M. We also performed the same set of experiment using fusion specific primers for ARL17 and depicted in the Supplementary Fig. S8. Strategy used to design the fusion specific primer is depicted in the Supplementary Fig. S7.