| Literature DB >> 34341964 |
Yu-Sheng Chen1,2,3, Shuaiyao Lu4, Bing Zhang1,2, Tingfu Du4, Wen-Jie Li1,2, Meng Lei1,2, Yanan Zhou4, Yong Zhang4, Penghui Liu4, Yong-Qiao Sun1,2, Yong-Liang Zhao1,2,5, Ying Yang6,7,8,9, Xiaozhong Peng10,11, Yun-Gui Yang12,13,14,15.
Abstract
Entities:
Mesh:
Substances:
Year: 2022 PMID: 34341964 PMCID: PMC8327898 DOI: 10.1007/s13238-021-00861-8
Source DB: PubMed Journal: Protein Cell ISSN: 1674-800X Impact factor: 14.870
Figure 1Chimeric genes of human and SARS-CoV-2 were frequently detected among highly expressed gene by RNA-seq. (A) Heatmap displaying the chimeric level for filtered chimeric genes in each sample. The chimeric level was defined as CPM of chimeric reads and normalized by depth of reads from SARS-CoV-2. (B) Venn diagram showing the number of shared and specific chimeric genes among 3 human cell lines. (C) Pie chart displaying proportion of RNA types for 132 common chimeric genes. (D) Integrative genomics viewer (IGV) tracks displaying the junction loci in both human RNA and SARS-CoV-2 RNA for common chimeric genes RPL3 (upper) and GPI (lower). The lines with different colors indicate the sources of chimeric reads, where yellow, blue and pink represents 293T, Huh-7 and Calu-3 cells, respectively. Blocks with different colors represent 5’UTR, ORF1ab, S, ORF3a, E, M, ORF6, ORF7ab, ORF8, N and 3’UTR along the SARS-CoV-2 genome. (E) Venn diagrams showing the number of shared chimeric genes for each two cell lines (top) and conserved chimeric events on shared chimeric genes at different resolutions (1 nt, 10 nt, 50 nt and 100 nt). For venn diagrams of chimeric events, the value in parenthesis represents the number of chimeric genes containing the shared chimeric events. (F) Scatter plot displaying the chimeric levels for chimeric genes in two replicates. The sizes of scatters represent number of chimeric genes with the same distribution of chimeric levels in two replicates. (G) Boxplot displaying the distributions of expression level for chimeric genes (dark color) and non-chimeric genes (light color) in each cell line. The grey block represents expression levels of 132 common chimeric genes
Figure 2Whole genome sequencing and mixed RNA-seq library reveal chimeric reads are falsely generated during library construction but not integration of SARS-CoV-2 into host genome. (A) Heatmap showed the accumulated expression score of each bin along human genome, and scatter plot below displayed the accumulated chimeric levels of corresponding bins for each cell line. (B) IGV tracks displaying the expression level (top), coverage of whole genome sequencing (middle) and genomic insertion signal (bottom) along chr14: 39,385,404-49,852,821. No viral integrating signal could be detected among all samples. (C-E) IGV tracks displaying the expression level (top), coverage of whole genome sequencing (middle) and genomic insertion signal (bottom) along bins in 293T (C), Huh-7 (D) and Calu-3 (E). (F) Proportions of viral chimeric reads with human (blue) and zebrafish RNA (green) were shown in left, while the corresponding ratio of sequencing depth for human and zebrafish, which defined as number of perfect matched reads, were shown in right. (G) Distribution of chimeric (dark blue) and non-chimeric reads (light blue) across the length of chimeric human mRNAs. 5’UTRs, CDSs, and 3’UTRs of human mRNAs were individually binned into regions spanning 1% of their total length, and the percentages of chimeric and non-chimeric reads that fall within each bin were determined, respectively. (H) Scatter plot displaying the correlation between chimeric level and counts of perfect matched fragments (CPM) to chimeric genes for human (left) and zebrafish (right) from the same mixed library