| Literature DB >> 31142650 |
Hailei Zhang1, Huan Zhong1, Shoudong Zhang1,2,3, Xiaojian Shao4, Min Ni1, Zongwei Cai4, Xuemei Chen5, Yiji Xia6,3,4.
Abstract
The 5' end of a eukaryotic mRNA transcript generally has a 7-methylguanosine (m7G) cap that protects mRNA from degradation and mediates almost all other aspects of gene expression. Some RNAs in Escherichia coli, yeast, and mammals were recently found to contain an NAD+ cap. Here, we report the development of the method NAD tagSeq for transcriptome-wide identification and quantification of NAD+-capped RNAs (NAD-RNAs). The method uses an enzymatic reaction and then a click chemistry reaction to label NAD-RNAs with a synthetic RNA tag. The tagged RNA molecules can be enriched and directly sequenced using the Oxford Nanopore sequencing technology. NAD tagSeq can allow more accurate identification and quantification of NAD-RNAs, as well as reveal the sequences of whole NAD-RNA transcripts using single-molecule RNA sequencing. Using NAD tagSeq, we found that NAD-RNAs in Arabidopsis were produced by at least several thousand genes, most of which are protein-coding genes, with the majority of these transcripts coming from <200 genes. For some Arabidopsis genes, over 5% of their transcripts were NAD capped. Gene ontology terms overrepresented in the 2,000 genes that produced the highest numbers of NAD-RNAs are related to photosynthesis, protein synthesis, and responses to cytokinin and stresses. The NAD-RNAs in Arabidopsis generally have the same overall sequence structures as the canonical m7G-capped mRNAs, although most of them appear to have a shorter 5' untranslated region (5' UTR). The identification and quantification of NAD-RNAs and revelation of their sequence features can provide essential steps toward understanding the functions of NAD-RNAs.Entities:
Keywords: Arabidopsis; NAD tagSeq; NAD+ cap; Oxford Nanopore sequencing; RNA cap
Year: 2019 PMID: 31142650 PMCID: PMC6575590 DOI: 10.1073/pnas.1903683116
Source DB: PubMed Journal: Proc Natl Acad Sci U S A ISSN: 0027-8424 Impact factor: 11.205
Fig. 1.Detection of NAD+ in total RNA extract from Arabidopsis seedling. Total RNAs were digested with P1. The digest was separated by HPLC, and the fraction containing NAD+ was collected and analyzed by LC-MS. (A) Representative LC-MS chromatograph of the NAD+ standard, the NAD+ fraction from the P1-digested RNAs, and the control sample (the RNA sample treated with heat-inactivated P1). The experiment was repeated 3 times with similar results. (B) Product ions from NAD+ of the P1 digest were identical to those from the NAD+ standard.
Fig. 2.The NAD tagSeq method. (A) Diagram illustrating the reactions for tagging NAD-RNA with a synthetic RNA. In the presence of ADPRC, the alkyne (4-pentyn-1-ol) replaces the nicotinamide of NAD-RNA, resulting in NAD-RNA being functionalized with an alkyne group. In the second step, through the CuAAC reaction, the alkyne-functionalized NAD-RNA is ligated to a synthetic RNA (tagRNA) with an azide group at its 3′ end. (B) Workflow for identification of NAD-RNAs from poly(A)-containing RNAs. Poly(A)-enriched RNAs are tagged with the tagRNA through the reactions shown in A. Tagged NAD-RNAs are enriched by hybridization to the 5′ biotin-DNA probe attached to the streptavidin beads. The eluted RNAs are subjected to Oxford Nanopore library preparation and sequencing. (C) Tagging of a 38-nt NAD-RNA with a synthetic 25-nt RNA-azide. The NAD-RNAs were reacted with 4-pentyn-1-ol in the presence or absence of ADPRC and then with the RNA-azide through CuAAC, resulting in its ligation with the RNA tag. No such product was detected in the absence of ADPRC. The RNAs were resolved on a denaturing polyacrylamide gel.
Fig. 3.NAD-RNAs and their relative abundance compared with total transcripts. (A) The numbers of NAD-RNAs identified from the 3 replicates (Rep 1, 2, and 3) in the poly(A) RNA tagging experiment with (w/) (Left) or without (w/o) (Right) the tagged RNA enrichment step. Only the NAD-RNAs with a normalized TPM value differing by 7 or more in the comparison between the ADPRC+ sample and ADPRC− sample were selected for the analysis. (B) Percentage of NAD-RNA reads in the total transcript reads from the same genes detected from the poly(A) RNA tagging experiment with (Left) or without (Right) the tagged RNAs enrichment step. (C) Ten Arabidopsis genes with the highest ratios of NAD-RNA reads to total RNA reads. Shown are the counts of NAD-RNA reads and total transcript reads from the 2 poly(A) RNA tagging experiments with or without the enrichment step of tagged RNAs.
Fig. 4.NAD-RNA–producing genes tend to be highly expressed. (A) Scatter plots showing abundance of NAD-RNAs and non–NAD-RNAs from the same genes in the poly(A) RNA tagging experiments with (w/) (Left) or without (w/o) (Right) enrichment of tagged RNAs. The different colors of the dots indicate transcript density: red (highest) > yellow > green > blue (lowest). The RNA levels shown were normalized TPM and under the fourth root to better represent genes with no NAD-RNA read. The solid red line is the identity line; the dashed red line represents the locally weighted regression line, which indicates a high positive correlation between NAD-RNA read values and non–NAD-RNA read values. (B) Overlap between the 500 most abundant NAD-RNAs and the 500 most abundant total transcripts detected in the poly(A) RNA tagging experiment. (C) Comparison of transcript half-lives between the 2,000 NAD-RNA–producing genes and all genes. The half-lives of transcripts were based on ref. 13. (D) Comparison of abundance of proteins from NAD-RNA–producing genes and all genes. The protein abundances were based on normalized spectral counts from the proteomic data from ref. 14.
Fig. 5.Sequence organization of NAD-RNAs. (A) Alignment of NAD-RNA reads and other RNA reads to the genomic sequences of 2 genes. Aligned sequencing reads were visualized using the Integrative Genomics Viewer, with the density of reads denoted above the genes. For the organization of the genes, boxes represent exons, and the lines represent introns. (B) Locations of 5′ ends of the NAD-RNAs relative to annotated TSSs. The reads in the positive x axis had the identified 5′ ends located downstream of the annotated TSSs. (C) Locations of 5′ ends of the NAD-RNAs relative to annotated translation start sites. The reads in the negative x axis had their 5′ ends located upstream of translation start sites. (D) Locations of the 3′ ends of the NAD-RNA reads relative to annotated translation end sites. The reads in the positive x axis contained a 3′ UTR. (E) Distribution of 3′ ends of the NAD-RNA reads (black) and all RNA reads (gray) relative to annotated polyadenylation sites.