Literature DB >> 28180294

Removing bias against short sequences enables northern blotting to better complement RNA-seq for the study of small RNAs.

Yun S Choi1, Lanelle O Edwards1, Aubrey DiBello1, Antony M Jose1.   

Abstract

Changes in small non-coding RNAs such as micro RNAs (miRNAs) can serve as indicators of disease and can be measured using next-generation sequencing of RNA (RNA-seq). Here, we highlight the need for approaches that complement RNA-seq, discover that northern blotting of small RNAs is biased against short sequences and develop a protocol that removes this bias. We found that multiple small RNA-seq datasets from the worm Caenorhabditis elegans had shorter forms of miRNAs that appear to be degradation products that arose during the preparatory steps required for RNA-seq. When using northern blotting during these studies, we discovered that miRNA-length probes can have ∼1000-fold bias against detecting even synthetic sequences that are 8 nt shorter. By using shorter probes and by performing hybridization and washes at low temperatures, we greatly reduced this bias to enable nearly equivalent detection of 24 to 14 nt RNAs. Our protocol can discriminate RNAs that differ by a single nucleotide and can detect specific miRNAs present in total RNA from C. elegans and pRNAs in total RNA from bacteria. This improved northern blotting is particularly useful to analyze products of RNA processing or turnover, and functional RNAs that are shorter than typical miRNAs.
© The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Substances:

Year:  2017        PMID: 28180294      PMCID: PMC5449620          DOI: 10.1093/nar/gkx091

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Small non-coding RNAs such as microRNAs (miRNAs) regulate much of our genome (1) and thus can be broad indicators of health and disease. Next generation RNA sequencing (RNA-seq) (2) has quickly developed to become a powerful method to measure the expression of small RNAs. Because changes in miRNA expression have been observed in multiple disease states (3), there is great interest in measuring miRNA levels accurately for use in clinical diagnosis (4,5). Therefore, several additional approaches that augment RNA-seq for the accurate measurement of miRNA levels are being pursued (see ref. 6 for an overview). A particular challenge in measuring miRNA abundance is that each miRNA gene can be expressed as a diverse population of related sequences (7) called isomiRs that differ at both 5΄ and 3΄ ends (8). Heterogeneity at the 3΄ end was discovered early (9–11) and remains the dominant variation of miRNA sequence that is detected. These 3΄ end variants can have both trimming and extension, which could be due to variation in miRNA precursor processing, activity of RNA exonucleases and/or non-templated nucleotide addition by nucleotidyltransferases (reviewed in 12). Modifications at the 3΄ end (e.g. adenylation (13) and deadenylation (14)) have been associated with specific functional consequences. Recent pulse-chase experiments (15) have revealed that fast decaying miRNAs are associated with 3΄ tailed and 3΄ trimmed isomiRs. In contrast, the rarity of 5΄ end variants has been attributed to the functional importance of the 5΄ end seed sequence in determining the targets regulated by miRNAs (16). Yet, there are now reports that some isomiRs with different 5΄ ends can behave as functionally distinct miRNAs and regulate different sets of target genes (17–20). Despite these advances, the biological significance of isomiRs, in most cases, remains unknown. While miRNA variants are often first detected using RNA-seq, their validation and further study requires additional approaches to ensure that a detected RNA is present in the biological sample and not generated during the procedures required for RNA-seq. Although RNA-seq is sensitive, it has limitations and sources of bias (reviewed in 21). For example, RNA-seq requires amplification, which can vary based on sequence composition (22,23), and ligation, which can vary based on modifications of terminal nucleotides (24) or presence of complementary sequences (25). In addition to these limitations that are intrinsic to RNA-seq, certain RNA purification methods can selectively miss sequences based on their composition (26). These considerations heighten the need for complementary approaches to accurately analyze populations of small RNAs. One of the best approaches for validating the expression of a new RNA without the need for ligation or reverse transcription and irrespective of terminal modifications on the RNA is northern blotting. Therefore, substantial effort has been devoted to improving the sensitivity of this technique for the analysis of small RNAs. These include the use of locked nucleic acid (LNA)-modified probes (27), the use of chemical crosslinking of RNA to the membrane (28) and the demonstration that digoxigenin-labeled probes (29,30) can be more sensitive than radioactive probes (31). Together, improvements in sensitivity have made it possible to detect as little as 50 attomoles of a miRNA (31). Furthermore, the recent development of a technique to unmask small RNAs bound to complementary RNAs (25) and an approach to multiplex probes (32) continue to improve the ease of use and applicability of northern blotting. Here we detect unexpected forms of well-studied Caenorhabditis elegans miRNAs in some RNA-seq datasets, discover that northern blotting can have drastic biases against short RNAs and develop an improved northern blotting protocol that reduces this bias to enable specific detection of RNAs in the 14 to 24 nt size range including endogenous miRNAs in C. elegans and product RNAs (pRNAs) in bacteria.

MATERIALS AND METHODS

Analysis of RNA-seq datasets

All datasets analyzed in this study are listed in Supplementary Table S1.

Detection of length variants of miRNAs in RNA-seq datasets

FASTQ or FASTA datasets were aligned using Bowtie to the C. elegans genome (Ce6) after removing adaptor sequences in the Galaxy interface (33–36). The bowtie parameters used were: bowtie -q -p 8 -S -n 2 -e 70 -l 28 –maxbts 125 -k 4 –phred33-quals . For adapter trimming, a minimum sequence length after trimming was set to 15 nt and all sequences with unknown (N) bases were discarded. Reads that aligned to genomic intervals of known miRNA genes (miRBase WBcel235) were filtered and used to calculate the average length of reads per miRNA gene. All reads (if total <1000) or 1000 randomly selected reads (if total >1000) that aligned to miRNAs that showed shortening were included to determine the coverage per nucleotide using Matlab (Mathworks).

Detection of miR-52 variants

Wild-type (N2) datasets that were deposited as compressed FASTA files or as SRA files were retrieved from the NCBI Sequence Read Archive. SRA datasets were converted to FASTQ format using SRA Toolkit V.2.4.2 and used regardless of quality scores. Prevalence of miR-52 with missing 5΄ end nucleotides (5΄-truncated) in N2 datasets was calculated by comparing the number of reads that exactly matched the full, annotated length (24 nt) to the number of reads that included the 3΄-most 15 bases of miR-52 (ATATGTTTCCGTGCT) using grep function in unix. This 15-mer showed only one match to the C. elegans genome (miR-52) when searched with BLASTn. Only reads that exactly matched the reference genome were kept and reads with 3΄ end variation were excluded. The number of reads required to consider a variant was scaled with the size of the RNA-seq dataset. In our detailed examination of miR-52 variants, we counted only those forms of miR-52 that had a perfect match to the annotated 3΄ end of miR-52. In addition to these 5΄ truncated versions of miRNAs, there were additional forms that varied at both the 5΄ and 3΄ ends as expected because of the known 3΄ end heterogeneity of miRNAs (12). Such miRNA variants with 3΄ truncation and/or 3΄ nucleotide addition were excluded from our counts. Circularly permuted forms of miR-52 were identified by searching for reads (using grep) that exactly matched all the rearranged full-length miR-52 sequences. These sequences were used to search the nucleotide database with BLASTn and no alignments other than miR-52 were found in the C. elegans genome. All plots were created using Excel (Microsoft), Matlab (Mathworks) and Illustrator (Adobe).

Generation of mut-2(jam9)

The mut-2 locus (37) was edited by homology-based repair using the method described earlier (38,39). Briefly, plasmids that separately express Cas9, sgRNA, a homology repair template and mCherry markers were co-injected into worms. Transgenic worms that expressed the mCherry marker were screened for edits in the mut-2 locus. The sgRNA targeting mut-2 was designed using E-CRISP (40) and cloned into the pU6::unc-119 sgRNA expression vector (Addgene 46169, (39)). The unc-119 sgRNA was removed from the plasmid by restriction digest (MfeI and XbaI; all restriction enzymes were from NEB). The large fragment of the plasmid digest was gel purified (28706, Qiagen) and used as the vector backbone for cloning of the new sgRNA. To prepare the insert containing the sgRNA targeting mut-2, two overlapping polymerase chain reaction (PCR) products were amplified with Phusion DNA polymerase (NEB, used for all PCRs except as indicated) in separate reactions using pU6::unc-119 as a template. Primer pair YC001 and YC128 was used for the first reaction and primer pair YC127 and YC020 was used for the second reaction. Both PCR products were used as templates and the fused product was amplified using YC001 and YC020 as primers. The amplified product was column purified (28106, Qiagen), digested (MfeI and XbaI), gel purified (28706, Qiagen) and cloned into the prepared pU6::unc-119 sgRNA expression vector with T4 DNA ligase (NEB). The homology repair template was prepared by initially cloning the mut-2 locus and then replacing the mut-2 coding regions with that of gfp. The mut-2 coding sequence and ∼1.4 kb of flanking sequences were amplified from wild-type (N2) genomic DNA using the primers YC087 and YC085. The PCR product was column purified (28106, Qiagen), digested (ApaI and PmlI) and used as the insert. The plasmid pCFJ-151 (Addgene 19330, (41)) was digested (ApaI and PmlI) and the ∼2.3 kb fragment was gel purified, and used as vector. The resultant cloned plasmid (pmut-2_genomic) was used to amplify the 5΄ (primers YC039 and YC134) and 3΄ (primers YC137 and YC138) flanking sequences of mut-2 in separate reactions. The gfp coding region was then amplified from pGC305 (Addgene 19646, (42)) with primers (YC135 and YC136) that also overlap mut-2 flanking sequences. The three fragments were combined by overlap extension PCR using primers YC039 and YC138. The resultant ∼1.8kb fusion product was gel purified, digested (PstI and BamHI), column purified (28106, Qiagen) and used as insert. The plasmid pmut-2_genomic was digested (PstI and BamHI), ∼3.8 kb fragment was gel purified (28706, Qiagen) and used as vector to generate pmut-2_to_eGFP_HRT. Plasmids for microinjection were suspended in 10 mM Tris–Cl, pH 8.5 to the following final concentrations: 50 ng/μl Peft-3::cas9-SV40_NLS::tbb-2 3΄UTR (Addgene 46168, (39)); 22.5 ng/μl PU6::mut-2_sgRNA (based on Addgene 46169, (39)); 10 ng/μl pmut-2_to_eGFP_HRT; 10 ng/μl pMA122 (Addgene 34873, (43)); 10 ng/μl pGH8 (Addgene 19359, (41)); 2.5 ng/μl pCFJ90 (Addgene 19327, (41)); 5.0 ng/μl pCFJ104 (Addgene 19328, (41)). F1 progeny of injected worms that expressed the mCherry coinjection markers were selected and allowed to have progeny. Twenty sets of five F2 progeny that did not express mCherry coinjection markers were moved to a plate. Pools of worms from each plate were screened for editing by PCR (primers YC176 and YC171). Individuals with successful editing were identified, homozygosed, confirmed by genomic DNA sequencing and designated as mut-2(jam9).

RNA purification

Worms grown on 35 mm plates of LB agar and fed Escherichia coli (OP50) were collected from starved plates by rinsing with M9 buffer (22 mM KH2PO4, 42 mM Na2HPO4, 86 mM NaCl and 1 μM MgSO4). Worms were pelleted (3 min at 750 × g tabletop microcentrifuge) and excess M9 buffer was aspirated leaving behind ∼50 μl. 1.0 ml TRIzol (LifeTechnologies) was added to the worm pellet and frozen at −80°C. After thawing, the samples were processed according to manufacturer's instructions. RNA was resuspended in nuclease free water (IDT) and its concentration was measured by ultraviolet (UV) spectrophotometry (Nanovue GE).

Induction and purification of pRNA

Escherichia coli strains were grown in LB for 24 h at 37°C in a shaking incubator. pRNAs were induced by transferring 10 ml of saturated culture to 40 ml of pre-warmed LB and placed back in the shaking incubator. Aliquots (5 ml) of induced culture were removed after 3 and 5 min of shaking, placed on ice and then collected in a 4°C centrifuge (3 min, 1500 × g). The growth medium was removed and cells were resuspended in 1.0 ml of Trizol and then transferred to 1.7 ml microcentrifuge tubes. Cells in Trizol were disrupted by freeze-thaw cycles of flash freezing in liquid N2 and then incubated at 65°C for 5 min. Total RNA was then collected as per manufacturer's protocol (Life Technologies).

RT-PCR

A total of 250 ng of total RNA was reverse transcribed with SuperScript III (LifeTechnologies) using 500 ng oligo-dT18 primer (IDT) according to manufacturer's protocol. Gene specific primers and Taq polymerase were used to amplify target genes from 1 μl of the cDNA synthesis reaction. Cycling conditions were an initial incubation at 95°C for 30s followed by 30 cycles of: 95°C, 30s; annealing temperature (52°C, GFP; 55°C, mut-2; and 55°C, ttb-2), 30s; 72°C, 30s; and a final 72°C incubation of 5 min.

Northern blotting

Northern blotting was done as described earlier (31) with some essential modifications as noted below. In brief, 5 μg (mut-2(jam9)) or 10 μg (miRNA) of total worm RNA or 100 fmol of each synthetic RNA was separated by denaturing 20% polyacrylamide gel electrophoresis (PAGE) (7M urea) and transferred (25 V, 1.0 A, 30 min, BioRad Transblot-Turbo) to a positively charged nylon membrane (11209299001, Roche). RNA was crosslinked to the membrane using UV irradiation (1200 μJ/m2, VWR UV Crosslinker). Synthetic DNA (IDT) probes were made with Roche DIG Oligonucleotide Tailing kit following manufacturer's protocol or bought as 5΄-monodigoxigenin-labeled (mono-DIG) oligonucleotides (IDT). Probes were hybridized to membranes (2.5 pmol probe in 5 ml ULTRAhyb buffer, Ambion LifeTechnologies) by incubating over night (with rotation at 8 RPM in a VWR Hybridization Oven) at 42°C for full-length probes and at room temperature (RT, ∼22°C) for short probes. Membranes were then washed (2 × 15 min at 37°C for full-length miR-52 probe; 2 × 15 min at 42°C for full-length miR-53 probe; and 2 × 15 min at RT for all short probes). Membranes were blocked (31), incubated with DIG antibodies (31), washed (4 × 15 min at RT for long probes and 2 × 5 min for short probes), developed (31) and imaged (Fujifilm LAS-3000). Blots were stripped by incubation in ∼5–10 ml of 5% sodium dodecyl sulphate at 80°C for 15 min and washed twice with ∼5–10 ml 2× Saline-Sodium Citrate (SSC) at 80°C for 5 min. The total RNA blot was probed first for miR-52, stripped and then probed for U6. For the miR-52 and miR-53 blot, the probes were used in the following order: short miR-52, short miR-53, full-length miR-52 and full-length miR-53. The synthetic RNA blot was probed in the following order: miR-1, let-7, miR-58 and miR-52. The miRNA in total RNA preparations blot was probed in the following order: miR-52, miR-58 and U6. The blot for comparing mono-DIG labeling with DIG tailing was probed in the following order: 15 nt DIG-tailed, 15 nt mono-DIG, 24 nt DIG-tailed, 24 nt mono-DIG and U6. Northern blotting of miR-1 from total RNA was done as above with 15 and 17 nt DIG-tailed DNA oligos. Northern blotting of pRNA was done in a similar manner using 10 μg of total RNA purified from E. coli strains. The membrane was first probed with a 15 nt DIG-tailed DNA oligo for pRNAs, stripped and reprobed with a 22 nt DIG-tailed DNA oligo for 5S rRNA.

Image analysis

Northern blot images were superimposed as separate layers in Photoshop (Adobe) and arranged so that the same area of the membrane was cropped for analysis. Cropped images were analyzed with ImageJ64 1.48v (44). Background was subtracted (50 pixels radius) before measuring band intensities using lanes of equal widths. Areas of peaks were determined by closing the base of each peak with a line segment. Peak areas were plotted with Excel and Illustrator.

E. coli strains

Wild-type (K-12) and ΔssrS1 (KW73).

Worm strains

Wild-type (N2), miR-52(n4114), miR-58(n4640), mir-1(n4101) and mut-2(jam9[gfp]) were maintained at 20°C using standard methods (45).

RESULTS

Some miRNAs show substantial 5΄ truncation in multiple RNA-seq datasets

A growing body of work shows that processes that change the length of a miRNA can be regulated (17–19) and are biologically important. To look for such variations in the length of miRNAs from C. elegans, we examined some published small RNA-seq datasets (Supplementary Table S1). Because non-templated nucleotide addition to 3΄ ends of miRNAs is a common source of size heterogeneity (7), we compared datasets prepared from wild-type worms and from worms lacking a putative nucleotidyltransferase, MUT-2, which is predicted to add nucleotides to the 3΄ ends of RNA (37). Surprisingly, we found that the average lengths of many miRNAs were substantially shorter in a dataset obtained from mut-2(ne3364) animals (Figure 1A), but to a lesser extent in a dataset obtained from mut-2(ne298) animals (data not shown). We also observed a similar shortening of miRNAs in a dataset from wild-type animals undergoing pos-1 RNAi (Figure 1A) but not in a dataset from animals lacking MUT-16 (Figure 1A), which promotes formation of perinuclear foci where MUT-2 is thought to extend small RNAs (46,47). Despite the two datasets that showed an abundant population of shorter miRNAs being generated in different labs (see Supplementary Table S1), the datasets shared a subset of miRNAs that were substantially truncated. These included all members of an essential miRNA family in C. elegansmiR-51 to miR-56 (48) (Figure 1B for mut-2(ne3364) and wild-type animals undergoing RNAi). Furthermore, miRNA truncation was specific to the 5΄ end of the more frequently sequenced arm (miRNA-3p or miRNA-5p) of the miRNA precursor (Figure 1C and Supplementary Figure S1).
Figure 1.

A subset of miRNAs is detected predominantly with missing nucleotides at the 5΄ end in some small RNA-seq datasets. (A) Many miRNAs show altered average lengths in two different RNA-seq datasets. The average length of miRNAs detected in RNA-seq datasets both from wild-type animals (black) and from mut-16(-) (magenta), mut-2(-) (blue) or wild-type animals undergoing pos-1 RNAi (green) were determined. The average length of each miRNA was normalized to that detected in the dataset from wild-type animals and plotted in rank order starting with the miRNAs that show the most proportional shortening. Total numbers of miRNAs present in both datasets (shared miRNAs) for each pair of datasets compared are indicated. (B) A common set of miRNAs, including the entire miR-51 to 56 family, show the most shortening in two different RNA-seq datasets. The average lengths of the most shortened miRNAs detected (A) were plotted for all datasets. Dataset labels as in (A) and the miR-51 to 56 family (brown) is highlighted. (C) The shorter length of miRNAs in both datasets is due to a shorter 5΄ end and is specific to the more abundantly sequenced arm of the miRNA. One thousand RNA reads that map to a miRNA (miR-52 shown as an example) were sampled from wild-type (black), mut-2(-) (blue) or wild-type animals undergoing pos-1 RNAi (green) and the resultant coverage (reads per base) of both arms of the miRNA gene (-5p and -3p, which are also in numbers per 1000 5p reads in insets) were plotted. Also see Supplementary Figure S1.

A subset of miRNAs is detected predominantly with missing nucleotides at the 5΄ end in some small RNA-seq datasets. (A) Many miRNAs show altered average lengths in two different RNA-seq datasets. The average length of miRNAs detected in RNA-seq datasets both from wild-type animals (black) and from mut-16(-) (magenta), mut-2(-) (blue) or wild-type animals undergoing pos-1 RNAi (green) were determined. The average length of each miRNA was normalized to that detected in the dataset from wild-type animals and plotted in rank order starting with the miRNAs that show the most proportional shortening. Total numbers of miRNAs present in both datasets (shared miRNAs) for each pair of datasets compared are indicated. (B) A common set of miRNAs, including the entire miR-51 to 56 family, show the most shortening in two different RNA-seq datasets. The average lengths of the most shortened miRNAs detected (A) were plotted for all datasets. Dataset labels as in (A) and the miR-51 to 56 family (brown) is highlighted. (C) The shorter length of miRNAs in both datasets is due to a shorter 5΄ end and is specific to the more abundantly sequenced arm of the miRNA. One thousand RNA reads that map to a miRNA (miR-52 shown as an example) were sampled from wild-type (black), mut-2(-) (blue) or wild-type animals undergoing pos-1 RNAi (green) and the resultant coverage (reads per base) of both arms of the miRNA gene (-5p and -3p, which are also in numbers per 1000 5p reads in insets) were plotted. Also see Supplementary Figure S1. The 5΄-truncated miRNAs could have been generated during the preparation of RNA for sequencing or during the lifecycle of miRNAs in vivo. The severely shortened forms observed in miRNAs from mut-2(ne3364) animals are surprising because mut-2 has been reported to be required for silencing transposons through the generation of two other classes of small RNAs (22G and 21U) but not for the generation or stability of miRNAs (49). One possibility is that either this allele of mut-2 or other mutations resulting from unregulated transposon activity are responsible for miRNA truncation.

An abundant 5΄-truncated miRNA observed in the mut-2(ne3364) RNA-seq dataset is not detectable by northern blotting in newly generated mut-2(jam9) null mutants

To test if the wild-type activity of MUT-2 is required to protect miR-51 to miR-56 family from 5΄ truncation, we examined levels of the most highly expressed and most severely truncated member, miR-52, using northern blotting of total RNA prepared from mut-2(-) animals. To control for mutations induced by unregulated transposon activity in mut-2(-) animals (50), we analyzed independent passages of a newly generated mut-2 null mutant. Specifically, we replaced the entire coding portion of mut-2 with DNA encoding gfp (Supplementary Figure S2A), which resulted in the expression of gfp instead of mut-2 (Supplementary Figure S2B). When miR-52 was examined in three independent passages of the mut-2 null mutant using northern blotting, no evidence for shortening was detectable (Figure 2). For each passage, all detectable miR-52—from wild-type and from mut-2(-) animals—was ∼24 nt long, which is the annotated length of miR-52. This result is in contrast to the 9 nt truncation that was observed in ∼94% of the miR-52 reads by RNA-seq of mut-2(ne3364) worms (Figure 1C). Thus, our results suggest that MUT-2 is not required for the expression of full-length miR-52.
Figure 2.

Newly generated mut-2 null mutants do not show detectable shortening of miR-52 and short synthetic sequences are not effectively detected by conventional northern blotting. 5΄ shortened versions of miR-52 are not detectable in mut-2(jam9) animals using conventional northern blotting. Total RNA prepared from wild-type, mir-52 null mutant (mir-52(n4114)) or three isolates (a, b, c) of mut-2 null mutant (mut-2(jam9)) animals was probed for miR-52 or U6 (loading control). When used as size standards (DNA oligos), 24 nt of mir-52 sequence (10 fmol) was detected more easily than 5× molar excess of 16 nt of 5΄ shortened miR-52 sequence (50 fmol).

Newly generated mut-2 null mutants do not show detectable shortening of miR-52 and short synthetic sequences are not effectively detected by conventional northern blotting. 5΄ shortened versions of miR-52 are not detectable in mut-2(jam9) animals using conventional northern blotting. Total RNA prepared from wild-type, mir-52 null mutant (mir-52(n4114)) or three isolates (a, b, c) of mut-2 null mutant (mut-2(jam9)) animals was probed for miR-52 or U6 (loading control). When used as size standards (DNA oligos), 24 nt of mir-52 sequence (10 fmol) was detected more easily than 5× molar excess of 16 nt of 5΄ shortened miR-52 sequence (50 fmol).

5΄ truncated miRNAs are likely degradation products that arose during RNA-seq

Our failure to detect 5΄-truncated miR-52 in mut-2 null mutants raised the possibility that the apparent 5΄ truncation observed in RNA-seq datasets results from the preparatory procedures required for RNA-seq. Consistently, we observed 5΄-truncated miR-52 in 17 of 59 datasets with prevalence ranging from 79 to 19% of the miR-52 reads (Supplementary Figure S3A). Such 5΄ truncation of miR-52 was not restricted to datasets from any lab, developmental stage, sex or cloning method (Supplementary Figure S3A and Table S1). Additionally, all variations of circularly permuted miR-52 could also be detected (Supplementary Figure S3B). The simplest explanation for the presence of these circularly permuted sequences could be that the miRNA was circularized and then randomly cleaved along the phosphate backbone (Supplementary Figure S3C). If the linearized forms had the necessary 5΄- and 3΄-functional groups, it could then be captured in the RNA-seq library cloning protocol. Permuted miR-52 variants were rare compared to the total number of miR-52 reads but some forms were nevertheless sequenced more frequently than many other annotated miRNAs. Taken together, these observations suggest that RNA-seq datasets contain 5΄-truncated and other processed versions of miRNAs whose provenance is currently unclear.

Northern blotting using miRNA-length probes is biased against detecting short RNAs

To complement RNA-seq data, additional methods that effectively detect small RNAs are needed. Although the average size of miRNAs is ∼23 nt, miRbase (V.21) contains a wide range of miRNA sizes (from the 15 nt miR-7238-3p from Mus musculus to the 34 nt miR-5971 from Haemonchus contortus) with varying degrees of functional validation. Therefore, a method that can effectively detect RNAs in the range of 15 to 34 nt would be useful to independently evaluate the expression of some miRNAs and other short RNAs like the truncated variants that we detected in RNA-seq datasets. When examining the levels of the 24 nt long miR-52, we made an observation that suggests that typical northern blotting has a strong bias against shorter sequences. The 24 nt probe was able to detect 10 fmol of 24 nt DNA analogue of miR-52 effectively but barely detected 50 fmol of 16 nt DNA representing 5΄-truncated miR-52 (Figure 2, DNA oligos lane). Because both these DNA sequences were synthetic, the difference in their detection is likely due to differences in the length of the sequence and not due to any chemical changes to the nucleotides that could have occurred in vivo. Thus, northern blotting needs to be modified for effectively detecting short RNAs.

Northern blotting can be improved to reduce bias against short RNAs and maintain target specificity

A previous study reported the detection of a specific ∼14 nt RNA called pRNA in samples prepared from two species of bacteria using a modified northern blotting protocol (51). This approach used LNA probes that are 5΄ digoxigenin-labeled, chemical crosslinking of RNA to nylon membranes using 1-ethyl-3(3-dimethylaminopropyl)-carbodiimide (EDC), native PAGE and hybridization at 50 to 68°C. While this approach is powerful, the authors noted a few shortcomings: (i) LNA probes are expensive and detection is affected by the distribution of LNA modifications on the probe; (ii) EDC linking requires a terminal phosphate, in the absence of which, the signal was reduced by ∼10-fold; and (iii) to avoid EDC crosslinking to urea, RNA samples have to be run on native PAGE, which can affect the migration of small RNAs with secondary structure or that interact with complementary sequences (e.g. stable 6S RNA:pRNA hybrids). Therefore, we wondered whether the detection of miRNAs as well as truncated forms of miRNAs or other such shorter RNAs using cheaper DNA probes and under denaturing conditions could be achieved by UV crosslinking and performing hybridization at lower temperatures. To begin testing this possibility, we used two different length probes to detect progressively 5΄-shortened versions of synthetic miR-52 and miR-53. In addition to the 24 nt probes, we designed shorter probes (15 nt for miR-52 and 16 nt for miR-53) that target the 3΄ end of both miRNAs and include the single nucleotide polymorphism that distinguishes miR-52 from miR-53 (Figure 3A). To account for the differential base pairing potential of different length probes, we hybridized at 42°C for 24 nt probes and at room temperature (∼22°C) for short probes. The 24 nt probe could only detect the 24 to 18 nt versions of the miRNAs, with rapidly diminishing signal intensity for shorter RNAs (Figure 3A and B, black). However, the short probes were able to detect all of the different length RNAs with comparable efficiency (Figure 3A and B, red). The stark bias against shorter RNAs when 24 nt probes were used (e.g. ∼30-fold for 18 nt miR-53 and ∼1000-fold for 16 nt miR-52) was greatly reduced when 16 or 15 nt probes were used (a maximum of ∼7-fold difference across 24 to 14 nt).
Figure 3.

Short probes enable effective detection of short RNAs using northern blotting. (A and B) Short probes reduce length bias in northern blotting and enable detection of RNAs as short as 14 nt with single nucleotide discrimination. Equimolar (100 fmol) amounts of synthetic RNA representing miR-52 and miR-53 (differs from miR-52 by a single nucleotide, asterisk) with progressive 5΄ shortening in 2 nt increments were analyzed by northern blotting using short probes (red, 15 or 16 nt) or using full-length probes (black, 24 nt). Northern blot (A) and its quantification (B) are shown. Relative intensity of each band was determined using that of 24 nt RNAs as reference (long and short probes are as in (A)). Error bars represent the standard deviation of the signals for three blots using RNA from three independent dilutions and are omitted when negative (*). While blotting using full-length probes showed an extreme bias for longer RNA and substantial cross-reactivity (also see ref. 34), blotting with short probes detected all sizes with less cross-reactivity. (C) The selectivity of short DNA probes is comparable to long DNA probes. The selectivity of a probe was calculated as the ratio of the signal from precisely complementary RNA (target RNA) to that from 1 nt mismatched complementary RNA (mismatch RNA) of the same length. The relative selectivity of the short probe (x-axis) compared to the long probe was calculated by taking the ratio of the ratios. Error bars indicate standard deviations from three independent measurements. (D) Short probes with differing GC content are able to selectively detect different miRNAs. One membrane with equimolar (100 fmol) amounts of synthetic RNA representing miR-1, let-7, miR-58 and miR-52 was sequentially probed using the corresponding 15 nt probes with the indicated GC content (% GC).

Short probes enable effective detection of short RNAs using northern blotting. (A and B) Short probes reduce length bias in northern blotting and enable detection of RNAs as short as 14 nt with single nucleotide discrimination. Equimolar (100 fmol) amounts of synthetic RNA representing miR-52 and miR-53 (differs from miR-52 by a single nucleotide, asterisk) with progressive 5΄ shortening in 2 nt increments were analyzed by northern blotting using short probes (red, 15 or 16 nt) or using full-length probes (black, 24 nt). Northern blot (A) and its quantification (B) are shown. Relative intensity of each band was determined using that of 24 nt RNAs as reference (long and short probes are as in (A)). Error bars represent the standard deviation of the signals for three blots using RNA from three independent dilutions and are omitted when negative (*). While blotting using full-length probes showed an extreme bias for longer RNA and substantial cross-reactivity (also see ref. 34), blotting with short probes detected all sizes with less cross-reactivity. (C) The selectivity of short DNA probes is comparable to long DNA probes. The selectivity of a probe was calculated as the ratio of the signal from precisely complementary RNA (target RNA) to that from 1 nt mismatched complementary RNA (mismatch RNA) of the same length. The relative selectivity of the short probe (x-axis) compared to the long probe was calculated by taking the ratio of the ratios. Error bars indicate standard deviations from three independent measurements. (D) Short probes with differing GC content are able to selectively detect different miRNAs. One membrane with equimolar (100 fmol) amounts of synthetic RNA representing miR-1, let-7, miR-58 and miR-52 was sequentially probed using the corresponding 15 nt probes with the indicated GC content (% GC). Another important consideration for northern blotting is specific recognition of a target RNA. We measured the specificity of short and long probes by comparing their ability to detect perfectly complementary RNA (target) with that to detect RNA that differed by a single nucleotide. The 24 nt miR-52 probe showed almost equal reactivity to 24 nt miR-52 and miR-53 (Figure 3A), consistent with previous reports of cross-reactivity (52). The 24 nt miR-53 probe, which was subjected to washes at higher temperature (42°C washes compared to 37°C used for miR-52), showed reduced cross-reactivity. However, the increased stringency of the washes increased the bias against detecting shorter RNAs as evidenced by the exponential decrease in signal as the length of target RNA decreased (Figure 3A and B). In contrast, short probes were able to detect all targets with specificity comparable to full-length probes (Figure 3A–C; Supplementary Figures S4 and 5). Finally, short probes were able to specifically detect different small RNAs of varying GC content (Figure 3D). A 15 nt probe with only 27% GC could still bind specifically to its miR-1 target with no cross-reactivity to three other abundant synthetic RNAs on the same membrane (Figure 3D). Thus, short probes and hybridization at low temperatures can reduce the bias against the detection of short RNAs that arise when long probes and typical hybridization temperatures are used.

Fifteen nucleotide probes can specifically detect miRNAs in total RNA from C. elegans and pRNAs in total RNA from E. coli

Although our method demonstrates the specific detection of synthetic RNAs, it will be of limited utility unless it can be applied to detect specific sequences in complex mixtures of RNA. Therefore, we attempted the detection of three specific miRNAs (miR-1, miR-42 and miR-58) with 15 nt probes of varying GC content in preparations of C. elegans total RNA. These miRNAs were chosen based on the availability of viable deletion mutants that can be used as essential controls and to cover a range of GC content (miR-1, 27% GC; miR-52, 40% GC and miR-58, 47% GC). Total RNA from wild-type worms and from mutant worms with a deletion for a miRNA gene was purified and used for northern blotting. Both miR-58 and miR-52 were detected in total RNA from wild-type worms but not in total RNA from the corresponding mutants (Figure 4). However, miR-1 could not be detected in any case despite seven attempts (data not shown) and could only be detected using a 17 nt probe (Supplementary Figure S6), possibly because the low GC content of the 15 nt probe used (27% GC) is below the threshold for stable binding at room temperature.
Figure 4.

Short probes can specifically detect miRNAs in total RNA. Total RNA prepared from wild-type and miRNA deletion (mir-58(n4640), mir-52(n4114) and mir-1(n4101)) worm strains was used for northern blotting with short (15 nt) probes against mir-58 or mir-52. Synthetic RNAs that matched the sequence of full length (10 fmol of 24 nt, weakly detected) and shortened forms (50 fmol of 16 nt, clearly detected) of miR-58 and miR-52 were run as size markers. U6 was probed to indicate loading.

Short probes can specifically detect miRNAs in total RNA. Total RNA prepared from wild-type and miRNA deletion (mir-58(n4640), mir-52(n4114) and mir-1(n4101)) worm strains was used for northern blotting with short (15 nt) probes against mir-58 or mir-52. Synthetic RNAs that matched the sequence of full length (10 fmol of 24 nt, weakly detected) and shortened forms (50 fmol of 16 nt, clearly detected) of miR-58 and miR-52 were run as size markers. U6 was probed to indicate loading. Finally, we set out to test if our method can be applied to detect endogenous RNAs that are smaller than typical miRNAs in complex biological samples. A well-studied example of such biologically important small RNAs is pRNAs (reviewed in 53 and 54). These small RNAs are transcribed using 6S RNA as a template and can be detected shortly after starved cells are shifted to abundant nutrient conditions. Total RNA was purified from wild-type and 6S RNA (ssrS1) null E. coli 3 and 5 min after transferring stationary cells to fresh growth media. A DIG-tailed 15 nt DNA oligo hybridized at room temperature was able to specifically detect pRNAs from 14 nt elongated to 23 nt with no signal seen in the 6S RNA deletion strain (Figure 5). These results demonstrate the utility of the modifications we propose – shorter probes and lower hybridization temperatures—for the effective detection of a specific small RNA in total RNA.
Figure 5.

Short probes can specifically detect pRNAs in total RNA. Total RNA prepared from wild-type and ssrS1 null (ΔssrS1) bacterial strains was used for northern blotting with a 15 nt probe for pRNAs. Synthetic RNA matching the sequence of 14 nt pRNA was run as a size marker. 5S rRNA was probed to indicate loading.

Short probes can specifically detect pRNAs in total RNA. Total RNA prepared from wild-type and ssrS1 null (ΔssrS1) bacterial strains was used for northern blotting with a 15 nt probe for pRNAs. Synthetic RNA matching the sequence of 14 nt pRNA was run as a size marker. 5S rRNA was probed to indicate loading.

DISCUSSION

We detected unexpected forms of miRNAs in some RNA-seq datasets and found that northern blotting using miRNA-length probes can be biased against detecting shorter RNAs. We demonstrate that adjusting probe size and hybridization conditions can allow equivalent detection of 24 to 14 nt RNAs with single-nucleotide sequence discrimination using affordable DNA oligos as probes.

Possible origins of miRNA variants in RNA-seq datasets

We detect two unusual miRNA variants in RNA-seq datasets—5΄ truncated forms of miRNA and circularly permuted forms of miRNA. The truncated forms of miRNA could arise during sample preparation before RNA extraction. We observed that in most cases truncated miRNAs were missing sequences from the 5΄ end (Figure 1 and Supplementary Figure S1). Such truncations were particularly pronounced in all members of the miR-51 family. These observations suggest possible degradation of miRNAs by a 5΄ to 3΄ exoribonuclease. Consistently, C. elegans has a 5΄ to 3΄ exoribonuclease called XRN-2 that can act on miRNAs in general (55) and for which the miR-51 family could be an especially good substrate (56). Such mechanisms that can degrade miRNAs raise concerns about the accuracy of inferring the levels of miRNAs that function in vivo from RNA-seq and suggest the use of truncated versions of miR-51 family members as indicators of poor RNA quality. Circularly permuted forms of miRNA could arise during ligation steps required for library preparation. The number of reads that we count as circularly permuted is likely an underestimation. The ∼0.16% of miR-52 detected as circularly permuted in Supplementary Figure S3B reflect only the sub-population of circularized RNAs that became linear and were then successfully joined to sequencing adaptors. Those RNAs that remained as circles or did not get ligated to adaptors would escape detection. Although circular RNAs that result from RNA splicing have been detected in C. elegans (57,58), there is no evidence yet for the generation of circular miRNAs in vivo.

Technical considerations for the use of northern blotting to complement RNA-seq

Our results suggest that the hybridization temperature and GC content of probes are critical variables for the detection of short RNAs. Long probes hybridized at 22°C had off-target binding and retained a bias against short RNAs (Supplementary Figure S4), while short probes hybridized at 37–42°C did not have any detectable signal (data not shown). The significance of GC content can be seen in the miRNA northerns. The miR-52 probe with 40% GC had a markedly weaker signal compared to the miR-58 probe with 47% GC (Figure 4)—a difference of one GC base pair. We failed to detect miR-1 using a 15 nt probes with only four GC bases (27%), By using a 17 nt probe while still keeping four GC bases, we were able to successfully detect miR-1 in total RNA (Supplementary Figure S6), suggesting a minimum nucleotide composition and length requirement to use this northern blotting method. We also noted that incubation below 20°C results in components of the hybridization buffer becoming insoluble limiting how low we can adjust temperature to promote stable probe hybridization. Low incubation temperatures and appropriate washing conditions can enable short sequences to specifically detect complementary RNA. This specificity was despite the addition of a long tail (>100 nt) of random dA and DIG-conjugated dU to generate the DIG-tailed probe. Mono-DIG-labeled 15 nt DNA probes failed to detect miR-52 and mono-DIG-labeled 24 nt DNA probes detected miR-52 with a weaker signal than DIG-tailed probes (Supplementary Figure S4). Thus, using DIG-tailed probes reduces cost (∼20× compared with mono-DIG probes and ∼150× compared to DIG-LNA probes) and enhances the detectable signal without apparent off-target interactions from the long tail.

Applications of the modified northern blotting described in this study

The drastic bias against short RNAs observed when miRNA-length probes are used (Figure 3A), suggest that the careful study of isomiRs (12) will require designing probes that match the shortest form of the miRNA. Furthermore, this bias raises the possibility that RNA quantified by northern blotting in previous studies might have under-reported RNAs that are shorter than the length of the probe. Such shorter RNAs include biologically relevant RNAs such as pRNAs (∼14 nt) in bacteria (59), NU RNAs (<18 nt) in C. elegans (60) or unusually small RNAs (∼17 nt) that were reported to associate with Argonaute proteins in mammalian cells (61). Unlike a previous method that detected pRNAs (51), our approach does not need to sacrifice specificity for sensitivity. The ability to better discriminate RNAs that differ by single nucleotides when using short probes (Figure 3C) could help resolve expression from two different alleles without the amplification necessary for RNA-seq (62) or PCR (63) approaches. Finally, reducing the bias against shorter RNAs could enable tracking intermediates of RNA processing or small RNA turnover (64) in mechanistic studies and is thus an essential consideration when probing the realm of tiny RNAs. Click here for additional data file.
  64 in total

1.  A member of the polymerase beta nucleotidyltransferase superfamily is required for RNA interference in C. elegans.

Authors:  Chun-Chieh G Chen; Martin J Simard; Hiroaki Tabara; Daniel R Brownell; Jennifer A McCollough; Craig C Mello
Journal:  Curr Biol       Date:  2005-02-22       Impact factor: 10.834

2.  Sensitive and specific detection of microRNAs by northern blot analysis using LNA-modified oligonucleotide probes.

Authors:  Anna Válóczi; Csaba Hornyik; Nóra Varga; József Burgyán; Sakari Kauppinen; Zoltán Havelda
Journal:  Nucleic Acids Res       Date:  2004-12-14       Impact factor: 16.971

Review 3.  IsomiRs--the overlooked repertoire in the dynamic microRNAome.

Authors:  Corine T Neilsen; Gregory J Goodall; Cameron P Bracken
Journal:  Trends Genet       Date:  2012-08-08       Impact factor: 11.639

Review 4.  Using next-generation RNA sequencing to identify imprinted genes.

Authors:  X Wang; A G Clark
Journal:  Heredity (Edinb)       Date:  2014-03-12       Impact factor: 3.821

5.  Dicer partner proteins tune the length of mature miRNAs in flies and mammals.

Authors:  Ryuya Fukunaga; Bo W Han; Jui-Hung Hung; Jia Xu; Zhiping Weng; Phillip D Zamore
Journal:  Cell       Date:  2012-10-11       Impact factor: 41.582

6.  A mammalian microRNA expression atlas based on small RNA library sequencing.

Authors:  Pablo Landgraf; Mirabela Rusu; Robert Sheridan; Alain Sewer; Nicola Iovino; Alexei Aravin; Sébastien Pfeffer; Amanda Rice; Alice O Kamphorst; Markus Landthaler; Carolina Lin; Nicholas D Socci; Leandro Hermida; Valerio Fulci; Sabina Chiaretti; Robin Foà; Julia Schliwka; Uta Fuchs; Astrid Novosel; Roman-Ulrich Müller; Bernhard Schermer; Ute Bissels; Jason Inman; Quang Phan; Minchen Chien; David B Weir; Ruchi Choksi; Gabriella De Vita; Daniela Frezzetti; Hans-Ingo Trompeter; Veit Hornung; Grace Teng; Gunther Hartmann; Miklos Palkovits; Roberto Di Lauro; Peter Wernet; Giuseppe Macino; Charles E Rogler; James W Nagle; Jingyue Ju; F Nina Papavasiliou; Thomas Benzing; Peter Lichter; Wayne Tam; Michael J Brownstein; Andreas Bosio; Arndt Borkhardt; James J Russo; Chris Sander; Mihaela Zavolan; Thomas Tuschl
Journal:  Cell       Date:  2007-06-29       Impact factor: 41.582

7.  Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences.

Authors:  Jeremy Goecks; Anton Nekrutenko; James Taylor
Journal:  Genome Biol       Date:  2010-08-25       Impact factor: 13.583

8.  Carbodiimide-mediated cross-linking of RNA to nylon membranes improves the detection of siRNA, miRNA and piRNA by northern blot.

Authors:  Gurman Singh Pall; Carles Codony-Servat; Jane Byrne; Leigh Ritchie; Andrew Hamilton
Journal:  Nucleic Acids Res       Date:  2007-04-02       Impact factor: 16.971

9.  Destabilization of microRNAs in human cells by 3' deadenylation mediated by PARN and CUGBP1.

Authors:  Takayuki Katoh; Hiroaki Hojo; Tsutomu Suzuki
Journal:  Nucleic Acids Res       Date:  2015-06-30       Impact factor: 16.971

Review 10.  Biases in small RNA deep sequencing data.

Authors:  Carsten A Raabe; Thean-Hock Tang; Juergen Brosius; Timofey S Rozhdestvensky
Journal:  Nucleic Acids Res       Date:  2013-11-05       Impact factor: 16.971

View more
  3 in total

1.  Photoelectrochemical biosensor for microRNA detection based on multiple amplification strategies.

Authors:  Minghui Wang; Huanshun Yin; Yunlei Zhou; Jingrui Han; Tingqun He; Lin Cui; Shiyun Ai
Journal:  Mikrochim Acta       Date:  2018-04-20       Impact factor: 5.833

2.  Electrophoretic mobility shift as a molecular beacon-based readout for miRNA detection.

Authors:  Getulio P Oliveira-Jr; Raquel H Barbosa; Lauren Thompson; Brandy Pinckney; Moriah Murphy-Thornley; Shulin Lu; Jennifer Jones; Clinton H Hansen; John Tigges; Wesley P Wong; Ionita C Ghiran
Journal:  Biosens Bioelectron       Date:  2021-05-15       Impact factor: 12.545

3.  Endogenous microRNA triggered enzyme-free DNA logic self-assembly for amplified bioimaging and enhanced gene therapy via in situ generation of siRNAs.

Authors:  Qinghua Jiang; Shuzhen Yue; Kaixin Yu; Tian Tian; Jian Zhang; Huijun Chu; Zhumei Cui; Sai Bi
Journal:  J Nanobiotechnology       Date:  2021-09-26       Impact factor: 10.435

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.