| Literature DB >> 35992789 |
Liwei Song1,2, Mengjun Yu1, Renjing Jin1, Meng Gu1, Ziyu Wang1, Dailun Hou3, Shaofa Xu2, Jinghui Wang1,4, Teng Ma1.
Abstract
The DNA-dependent protein kinase catalytic subunit (DNA-PKcs) with a Ku70/Ku80 heterodimer constitutes the intact DNA-PK kinase, which is an upstream component of the DNA repair machinery that signals the DNA damage, orchestrates the DNA repair, and serves to maintain genome integrity. Beyond its role in DNA damage repair, the DNA-PK kinase is also implicated in transcriptional regulation and RNA metabolism, with an illuminated impact on tumor progression and therapeutic responses. However, the efforts to identify DNA-PK regulated transcriptomes are limited by short-read sequencing to resolve the full complexity of the transcriptome. Therefore, we leveraged the PacBio Single Molecule, Real-Time (SMRT) Sequencing platform to study the transcriptome after DNA-PK inactivation to further underscore the importance of its role in diseases. Our analysis revealed additional novel transcriptome and complex gene structures in the DNA-PK inactivated cells, identifying 8,355 high-confidence new isoforms from 3,197 annotated genes and 523 novel genes. Among them, 380 lncRNAs were identified. We validated these findings using computational approaches and confirmatory transcript quantification with short-read sequencing. Several novel isoforms representing distinct splicing events have been validated through PCR experiments. Our analyses provide novel insights into DNA-PK function in transcriptome regulation and RNA metabolism.Entities:
Keywords: DNA-PK; alternative splicing; long-read sequencing; short-read sequencing; transcriptome
Year: 2022 PMID: 35992789 PMCID: PMC9382581 DOI: 10.3389/fonc.2022.941638
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 5.738
Figure 3Transcriptome structure analysis and alternative splicing profiling in the long-read data. (A) The circus display of chromosome sequence; alternative splicing loci (light blue for intron retention, green for A3 alternative splicing, yellow for A5 alternative splicing, purple for exon skipping, red for mutual exclusive exons, brown for alternative first exon, dark blue for alternative last exon); alternative polyadenylation loci; the distribution of new transcripts; novel gene distribution; lncRNA density distribution; gene fusions, purple line (same), yellow line (different) genes on chromosomes fused. (B) Alternative splicing patterns in the long-read data. The different splicing patterns were depicted in the left panel. The number of genes with different splicing events between control cells and NU7441-treated cells were summarized. The percentage of different splicing patterns was also shown in the pie chart. (C) Validation of the alternative splicing events in long-read data. Different splicing events for AGO3, ACTN4, GRIK2, HK2, and POM121 were validated by qPCR.
Figure 1The long-read sequencing flow chart and long-read data characteristics. (A) The experimental design and the data processing pipeline of the long-read sequencing. (B, D) The chromosomal distribution of sequencing reads between control cells and NU7441 treated cells. (C) Summary of the reads mapped to the reference genome. (E) The number of genes covered by the long-read sequencing data between control cells and NU7441-treated cells.
Figure 2Annotations of the long-read sequencing data. (A) Annotations of the unmapped transcripts in control cells and NU7441-treated cells (p <0.001). (B) Annotations of the novel genes in control cells and NU7441-treated cells (p <0.001). (C) Annotations of the isoforms in control cells and NU7441-treated cells in (p <0.001). (D) GO analysis of novel genes identified in control cells and NU7441-treated cells. (E) KEGG analysis of novel genes identified in control cells and NU7441-treated cells.
Figure 4Differential expression of genes analysis in the short-read data. (A) Volcano plot for differentially expressed genes between control and NU7441-treated cells. (B) Venn diagram for differentially or co-expressed genes between control and NU7441-treated cells. (C) Functional enrichment of the differentially expressed genes between control and NU7441-treated cells.