Literature DB >> 32161805

RNA profile diversity across arthropoda: guidelines, methodological artifacts, and expected outcomes.

Danielle M DeLeo1, Jorge L Pérez-Moreno1, Hernán Vázquez-Miranda1, Heather D Bracken-Grissom1.   

Abstract

High-quality RNA is an important precursor for high-throughput RNA sequencing (RNAseq) and subsequent analyses. However, the primary metric used to assess RNA quality, the RNA Integrity Number (RIN), was developed based on model bacterial and vertebrate organisms. Though the phenomenon is not widely recognized, invertebrate 28S ribosomal RNA (rRNA) is highly prone to a form of denaturation known as gap deletion, in which the subunit collapses into two smaller fragments. In many nonmodel invertebrates, this collapse of the 28S subunit appears as a single band similar in size to the 18S rRNA subunit. This phenomenon is hypothesized to be commonplace among arthropods and is often misinterpreted as a "degraded" rRNA profile. The limited characterization of gap deletion in arthropods, a highly diverse group, as well as other nonmodel invertebrates, often biases RNA quality assessments. To test whether the collapse of 28S is a general pattern or a methodological artifact, we sampled more than half of the major lineages within Arthropoda. We found that the 28S collapse is present in ∼90% of the species sampled. Nevertheless, RNA profiles exhibit considerable diversity with a range of banding patterns. High-throughput RNAseq and subsequent assembly of high-quality transcriptomes from select arthropod species exhibiting collapsed 28S subunits further illustrates the limitations of current RIN proxies in accurately characterizing RNA quality in nonmodel organisms. Furthermore, we show that this form of 28S denaturation, which is often mistaken for true "degradation," can occur at relatively low temperatures.
© The Author(s) 2018. Published by Oxford University Press.

Entities:  

Keywords:  RNAseq; crustaceans; denaturation; gap deletion; genomics; insects; invertebrates; nucleic acids; transcriptomics

Year:  2018        PMID: 32161805      PMCID: PMC6994094          DOI: 10.1093/biomethods/bpy012

Source DB:  PubMed          Journal:  Biol Methods Protoc        ISSN: 2396-8923


Introduction

Next-generation RNA sequencing (RNAseq) has become a revolutionary tool for RNA-based characterization studies. Similar to traditional quantification methods (i.e., quantitative polymerase chain reaction (PCR)), it can now be performed in most modern laboratories with relative ease and accuracy [1] while yielding more precise and robust datasets [2]. For this reason, RNAseq has become a powerful technique for studying an array of biological questions in both model and nonmodel organisms, including de novo transcriptome reconstruction and profiling, gene discovery and functional analyses, differential gene expression studies [3], and evolutionary investigations (e.g., [4]). Moreover, continued advancements to sequencing technologies and analysis software have expanded the reach and utility of RNAseq to include microscopic inputs (singe-cell RNAseq), enabling spatial mapping and tissue reconstruction [5, 6]. Compared with DNA, total RNA extraction and sequencing poses nontrivial methodological challenges to maintain RNA stability. Due to its ephemeral nature, RNA is prone to cellular degradation [7] from the initial cellular lysis step to final visualization and quantitation. In addition, ribonucleases (RNases), ubiquitous enzymes present in cells and the environment can further degrade RNA if proper care is not taken during sample preservation, extraction, and downstream protocols [7]. To prevent degradation and loss of transcriptomic information, most extraction guidelines suggest using dedicated equipment in isolated areas and frequent, thorough RNase decontamination throughout the protocol. Ideally, samples should be flash frozen or preserved in specialized salt-saturated buffers (e.g., RNAlater®; [8]), when the use of fresh tissue is not feasible, and kept frozen at −80°C prior to extraction and post-isolation to prevent native degradation. These rigorous methodological demands portray the labile nature of RNA. Historically, RNA was assumed to be of high quality if the ratio of large to small ribosomal subunits (28S:18S) was ∼2 or higher [9], though this method of assessment is sample consuming and imprecise [10]. Presently, there are two common methods used to test for RNA integrity and degradation after extraction: (i) visualization via denaturing (agarose) gel electrophoresis and (ii) fragment size chip electropherograms on a Bioanalyzer (Agilent). In particular, the Bioanalyzer step calculates the RNA integrity number (RIN) [11], now considered as the gold standard for successful library preparation and sequencing [10, 12–15]. The RIN model is estimated from RNA fragment distribution peaks and areas under the curve, and ranges from 1 to 10 (low to pristine quality, respectively). Sample RIN values of ≥7 are typically deemed as “high-quality” and appropriate for RNAseq (e.g., [16]), and often used as a cutoff for which sequencing facilities will guarantee results. Another common standard used to assess RNA integrity is ribosomal subunit size. Using rodent and human models, size has been estimated to be between 4400 and 5000 base pairs (bps) in the large 28S subunit and ∼2000 bp in the small 18S subunit [17]. Since RNA degradation can occur at any step in most protocols, it is paramount to determine appropriate reference standards and metrics for accurate RNA quality control [12]. Although relevant ribosomal RNA (rRNA) profile standards exist for several model organisms [18], most branches across the “Tree of Life” lack genomic references and resources. This paucity of genomic data is particularly visible across Arthropoda, a globally distributed group that encompasses >80% of animal diversity [19] and occupies a variety of niches from the poles to the tropics, high mountains, and deep-sea trenches. They fulfill significant roles in ecosystems (e.g., pollination and pests) and are of medical (i.e., disease vectors) and economic importance (e.g., fisheries) [20]. Due to their global relevance, the need for increased genomic references and reliable metrics to evaluate nucleic acid quality is crucial for the understanding of multiple evolutionary and ecological phenomena. There are few recent studies that have evaluated the universality and suitability of “standard” RNA quality metrics for nonmodel organisms (e.g., [21]). Early observations of RNA profiles in insects documented a “hidden break” of the 28S subunit, also referred to as gap deletion, in which the 28S denatures and collapses (decouples) into two smaller fragments (α and β) [22, 23] and a short, variable stretch of rRNA is removed or deleted [24, 25). This produces an apparent “degraded” RNA profile where both 18S and 28S overlap in a single band. As RNAseq has become a common method for RNA investigations in recent years, more and more invertebrate rRNA profiles suggest that this “hidden break” – a form of 28S denaturation – is the norm rather than the exception [21, 26, 27]. Hidden breaks in RNA are correlated with the presence of a cleavage site characterized as an Uracil (U) and Adenine (A) or UAAU-rich segment in the rRNA loop [28]. High temperature exposure is believed to result in UAAU decoupling and the subsequent dissociation (collapse) of the 28S [21-23]. This dissociation was hypothesized to occur as Hydrogen-bonds in this AU-rich region melt – the strength of the bonds are comparable to the strength of the chemical changes associated with denaturing conditions [22]; the double bonds of the AU pairs are weaker relative to the triple bonds of Guanine-Cytosine (GC) associations [29]. Differences in base-stacking stability [30], secondary structures and enzyme-dependent cleavage events are also thought to play a role [25]. Likewise, mutations in this UAAU-rich segment are thought to prevent this dissociation [28]. In a recent study, McCarthy et al. [21] hypothesized that exposing RNA to heat at 70°C, mainly for electropherogram preparation in a Bioanalyzer, was responsible for the 28S subunit collapse. However, there are additional steps often used in RNA extraction protocols, such as gel electrophoresis and deoxyribonuclease (DNase) applications for DNA removal, which may also expose samples to temperatures high enough to trigger 28S cleavage. As DNase treatments can be performed at relatively low temperatures (∼37°C) for enzymes that do not require deactivation at 70°C, and temperature can be regulated during electrophoresis (e.g., ice bath and chilled buffers), 70°C may be an exaggerated upper limit for this denaturation to occur. Given the UAAU loop region is sensitive to heat, DNase enzyme activation temperatures of 37°C may result in 28S denaturation prior to 70°C exposure, and possibly result in different “degraded” profiles. Moreover, wider taxon sampling may reveal variable processing of this gap (loop) region, including insertions/deletions of additional UAAU-rich segments, yielding additional RNA profiles. Here we document the range of variation among RNA extraction profiles for the major lineages within Arthropoda, with dense taxonomic sampling. Our primary motivation is to disentangle natural patterns of subunit denaturing from true degradation and methodological artifacts. Data available from prior studies are a composite of multiple extraction protocols from different laboratories that can further obscure these artifacts. This study was done using standardized reagents and conditions to improve comparisons among taxa. High-throughput RNAseq and de novo transcriptome assembly for two representative arthropod species exhibiting seemingly denatured RNA profiles provide further support for high-quality RNA despite evidence of 28S subunit collapse. Findings from this effort will aid future nucleic acid quality assessments for arthropods and other nonmodel invertebrates and will provide methodological insight for laboratories beginning to conduct RNA-based research.

Materials and methods

Arthropod lineages

We selected representative lineages covering nine classes across the four extant Arthropod subphyla [31, 32] (Table 1, n = total number of individuals sampled within each subphyla): Chelicerata (n = 4), Myriapoda (n = 11), Crustacea (n = 24), and Hexapoda (n = 20). Within Chelicerata, we sampled Arachnida (e.g., spiders and scorpions) and Merostomata (horseshoe crabs); within Myriapoda, we sampled Chilopoda (centipedes) and Diplopoda (millipedes); within Crustacea, we sampled Branchiopoda (water fleas), Oligostraca (e.g., barnacles), Multicrustacea (e.g., copepods), and Malacostraca (e.g., crabs, shrimp, lobsters, isopods, and amphipods); and within Hexapoda, we sampled Insecta (insects) and Entognatha (springtails). Our lineage selection encompasses all current Arthropod subphyla, approximately two-thirds of the classes in both Chelicerata and Crustacea, all classes within Hexapoda and half of all classes in Myriapoda. To account for the large diversity within Hexapoda and Crustacea (“Pancrustacea,” [33]), we sampled some groups more heavily; approximately half of all orders in Insecta [34] and infraorders in Decapoda [35]. We were not able to sample Pycnogonida, Pauropoda, Symphyla, Remipedia, Ostracoda, Ichthyostraca, and Cephalocarida due to lack of available RNA quality tissue. Onychophora and Tardigrada are panarthropods considered as sister to Arthropoda, but their rarity and/or small size made their inclusion in this study unfeasible.
Table 1:

Samples utilized in this study

Fig. 1HBGSubPhylumClassOrderSpeciesCommon nameTissueRNA bands
A014858CrustaceaMalacostracaIsopodaCaecidotea cf. communisIsopodWhole2b
E072867CrustaceaMalacostracaIsopodaAsellus aquaticusRiver isopod 1Whole3
E082873CrustaceaMalacostracaIsopodaAsellus aquaticusRiver isopod 2Whole3
E092861-2CrustaceaMalacostracaIsopodaAsellus aquaticusRiver isopod 3Whole (2x)3
F055062CrustaceaMalacostracaIsopodaCubaris cf. murinusLand isopod 1Whole5Δ
F065063CrustaceaMalacostracaIsopodaCubaris cf. murinusLand isopod 2Whole5
F075064CrustaceaMalacostracaIsopodaCubaris cf. murinusLand isopod 3Whole5
A094866CrustaceaMalacostracaAmphipodaGammarus pulexAmphipodWhole1
E052850CrustaceaMalacostracaAmphipodaNiphargus hrabeiRiver Amphipod 2Whole1
E062855CrustaceaMalacostracaAmphipodaNiphargus hrabeiRiver Amphipod 3Whole1
B112843-4CrustaceaMalacostracaAmphipodaNiphargus hrabeiRiver amphipod 1Whole (2x)1
B024870CrustaceaMalacostracaDecapodaStenopus hispidusCoral-banded Shrimp1 chelae2b
E023047CrustaceaMalacostracaDecapodaOplophorus spinosusPelagic shrimpEyes2b
E043180CrustaceaMalacostracaDecapodaFarfantepenneaeus duorarumPink shrimp1 eye1
D024893CrustaceaMalacostracaDecapodaClibanarius tricolorThree-colored hermit crabAll legs4
E033162CrustaceaMalacostracaDecapodaEurypanopeus depressusMud crabGill2
F085273CrustaceaMalacostracaDecapodaCalinectes sapidusBlue CrabOne claw3
F093018.ECrustaceaMalacostracaDecapodaBarbouria cubensisCave shrimp 1Eyes2
F103018.ACrustaceaMalacostracaDecapodaBarbouria cubensisCave shrimp 2Antennae2
F113018.SCrustaceaMalacostracaDecapodaBarbouria cubensisCave shrimp 3SDO2
B084876CrustaceaMalacostracaHolocaridaGonodactylus spMantis shrimpCephalothorax1
C064886CrustaceaBranchiopodaCladoceraDaphnia magnaWater fleaWhole (10x)3
B094878CrustaceaCopepodaaCalanoidaCyclops spFreshwater copepodWhole2b
C054885CrustaceaThecostracaaSessilia Amphibalanus eburneusIvory barnacleWhole (4x)1
A024859ChelicerataArachnidaAraneaeNephila clavipes Golden web Orb spiderHead1
B044872ChelicerataArachnidaScorpionesHoffmannius spinigerusStripe-tailed scorpion1 leg1
D054898ChelicerataArachnidaAraneaeGasteracantha cancriformisCrab SpiderHead1
B054873ChelicerataXiphosuraXiphosuridaLimulus polyphemusAtlantic horseshoe crab1 leg2b
A034860HexapodaInsectaThysanuraLepisma saccharinaSilverfish 1Whole1
B064874HexapodaInsectaThysanuraLepisma saccharinaSilverfish 2Whole1
A044861HexapodaInsectaDipteraDrosophila melanogasterFruitflyWhole (10x)1
C014880HexapodaInsectaDipteraTipula sayiCrane flyWhole2b
A064863HexapodaInsectaHymenopteraApis melliferaBeeHead1
A104867HexapodaInsectaHymenopteraCamponatus floridanusFlorida carpenter antTwo heads1
A074864HexapodaInsectaLepidopteraAscia monusteGreat Southern WhiteHead + abdomen1
A084865HexapodaInsectaColeopteraTenebrio molitorMealwormhead1
C044884HexapodaInsectaColeopteraIgnelater havaniensisGlowing click beetleAll photo phores1
A114868HexapodaInsectaOrthopteraAcheta domesticaBrown cricketHead1
B074875HexapodaInsectaEphemeropteraCallibaetis floridianusMayflyWhole1
B104879HexapodaInsectaBlattodeaBlattella germanicaGerman cockroachHead2b
C024881HexapodaInsectaBlattodeaReticulitermes flavipesSubterranean termiteWhole2
D014892HexapodaInsectaBlattodeaPeriplaneta americanaAmerican cockroachHead1
D034894HexapodaInsectaOdonataBrachymesia gravidaFour-spotted PennantHeadD
D044895HexapodaInsectaOdonataBrachymesia gravidaFour-spotted PennantHead2b
C034883HexapodaInsectaMantodeaSphodromantis viridisAfrican giant mantis1 leg1
C074887HexapodaInsectaMecopteraPanorpa debilisScorpion fly1 smallWhole1
C084888HexapodaInsectaMecopteraPanorpa debilisScorpion fly2 bigWhole1
B034871HexapodaEntognataCollembolaFolsomia candidaGiant springtailWhole (20x)1
A054862MyriapodaDiplopodaSpirobolidaAnadenobolus monilicornisBumble bee millipede 1Head+1/3 body2
D065053MyriapodaDiplopodaSpirobolidaAnadenobolus monilicornisBumble bee millipede 2Head2Δ
D075054MyriapodaDiplopodaSpirobolidaAnadenobolus monilicornisBumble bee millipede 3Head2Δ
D085055MyriapodaDiplopodaSpirobolidaAnadenobolus monilicornisBumble bee millipede 4Head2Δ
B014869MyriapodaChilopodaScolopendromorphaScolopendra polymorphaTiger centipedeHead4
F015058MyriapodaChilopodaScolopendromorphaHemiscolopendra marginataBlue centipede 1Head1
F025059MyriapodaChilopodaScolopendromorphaHemiscolopendra marginataBlue centipede 2Head1
F035060MyriapodaChilopodaScolopendromorphaHemiscolopendra marginataBlue centipede 3Head1
F045061MyriapodaChilopodaScolopendromorphaHemiscolopendra marginataBlue centipede 4Head1
E01REFProteo- bacteriaGamma- proteobacteriaEnterobacterialesEscherichia coliQubit S21 μL2Δ

The first column corresponds to lanes on electropherograms (Fig. 1). HBG corresponds to voucher numbers in the Zoological collection at FIU. Subphylum to Species columns correspond to taxonomic classification (aSubclass). Common name stands for labels on Fig. 1. Tissue column indicates what part of the body was used for extraction and whether multiple individuals were pooled together. RNA bands column indicates the number of ribosomal fragments detected in Fig. 1.

D denotes a degraded sample, whereas Δ marks samples with shifted RNA bands (which would require manual adjustment for proper size estimation) and bdenotes samples with denatured profiles despite a two-banded pattern.

Samples utilized in this study The first column corresponds to lanes on electropherograms (Fig. 1). HBG corresponds to voucher numbers in the Zoological collection at FIU. Subphylum to Species columns correspond to taxonomic classification (aSubclass). Common name stands for labels on Fig. 1. Tissue column indicates what part of the body was used for extraction and whether multiple individuals were pooled together. RNA bands column indicates the number of ribosomal fragments detected in Fig. 1.
Figure 1:

Bioanalyzer electropherograms of total RNA for a diverse array of Arthropod lineages. Sample information including taxonomic classifications are given in Table 1. Samples in Panels a–f are randomized (nontaxonomic order) but corresponding lane information is in Table 1. Ladders correspond to the picochip RNA size standard. Red (or highlighted) cells above a lane indicate that the RIN could not be calculated.

D denotes a degraded sample, whereas Δ marks samples with shifted RNA bands (which would require manual adjustment for proper size estimation) and bdenotes samples with denatured profiles despite a two-banded pattern.

Specimen preservation and tissue harvesting

We collected live specimens from the Biscayne Bay Campus of Florida International University (FIU) in North Miami, FL, USA, and brought them back to the lab for preservation. Some specimens were ordered from a commercial vendor (Carolina Biology, NC, USA) or donated by colleagues (see Acknowledgements section). Smaller individuals (bee size and below) were flash frozen and preserved at −80°C. Larger individuals were placed in RNAlater (Life Technologies, Foster City, CA, USA) with an abdominal incision and then placed at −80°C until RNA extraction. All specimens/or tissues are cataloged and deposited as vouchers (specimen or tissue) in the Florida International Crustacean Collection at FIU (Table 1).

RNA extraction

We extracted RNA from 40 species across the four Arthropod subphyla. RNA was extracted from replicates where possible, including: the isopods Asellus aquaticus (n = 3), Cubaris cf. murinus (n = 3), amphipod Niphargus hrabei (n = 3), cave shrimp Barbouria cubensis (n = 3), silverfish Lepisma saccharina (n = 2), the insects Brachymesia gravida (n = 2) and Panorpa debilis (n = 2), the millipede Anadenobolus monilicornis (n = 4), and centipede Hemiscolopendra marginata (n = 4). Though RNA quality tissue replicates were difficult to obtain for some species, all species were included in the study regardless as an effort to more thoroughly survey RNA profile diversity across Arthropoda. All stages of our RNA extraction protocol were carried out under a fume hood in a dedicated RNase-free room. Bleach (2% solution in distilled water) was applied to all surfaces, equipment, and reagent containers to remove RNases and secondarily sterilized with RNaseZAP® (Life Technologies, Carlsbad, CA, USA). We isolated total RNA from varied tissues or body parts depending on the specimen size (see Table 1). For very small specimens, we used the entire body or pooled multiple individuals. Tissues were homogenized in 1 mL TRIZOL® Reagent (Life Technologies, Carlsbad, CA, USA). Approximately 10–12 sterile 2 mm zirconia/ceramic beads were used in a MiniBead Beater (BioSpec, Bartlesville, OK, USA) for homogenization in a 2 mL screwcap tube, followed by the addition of chloroform (0.2 mL). Samples were mixed (shaken) vigorously by hand for 15  s and incubated at room temperature for 2–3 min. Phase separation was carried out by centrifuging samples (12 000g for 15 min) at 4°C, following manufacturer’s instructions, and carefully transferring only the top (clear) aqueous phase containing the RNA into new vials. RNA was then precipitated from the clear aqueous phase by mixing in (∼100%) isopropyl alcohol (0.5 mL) and glycogen (0.1 μL). Samples were then incubated at room temperature for 10 min and centrifuged (12 000g for 15 min) at 4°C. The supernatant was discarded, the RNA pellet washed twice with freshly made 75% ethanol (EtOH) and the supernatant was again discarded; 1 mL EtOH was added before centrifugation at 7500g for 5 min at 4°C (X2). Before elution, precipitated RNA was treated with DNases (Clonetech, Mountain View, CA, USA) to remove genomic DNA (15-min incubation at 37°C) following manufacturer’s guidelines. After repeating the isopropyl precipitation and ethanol washes (2×), total RNA was eluted in 50 μL of RNase-free water (Sigma, St Louis, MO, USA) and stored at −80°C. RNA integrity was determined initially on a 1% agarose gel (made with Tris base, acetic acid and Edetic acid (EDTA) or TAE) treated with a 1% bleach solution and ran in cold buffer at 4°C to prevent RNA degradation during electrophoresis (e.g., 500 μL bleach diluted in 50 mL agarose TAE solution [36]). In order to determine the size of the 18S and 28S rRNA subunits and check for degraded samples, we added a RiboRuler RNA ladder (ThermoFisher, USA) to the agarose gel. The RNA concentration of each extraction was measured using a Qubit 2.0 fluorometer (Life Technologies, USA). Finally, we diluted a subsample from every extraction to 5 ng/μL and determined the RNA profile by measuring ribosomal number, size, and RIN number using a BioAnalyzer 2100 with RNA 6000 picochips (Agilent, USA) in the DNA Core Facility at FIU; all samples were processed without the heat exposure (70°C) typical of bioanalyzer protocols. Gloves were changed between each protocol step and RNaseZap was used liberally.

High-throughput sequencing of RNA exhibiting 28S denaturation

RNA aliquots of the isopod A. aquaticus and amphipod N. hrabei, each exhibiting denatured RNA profiles indicative of 28S collapse (Fig. 3a and b, respectively), underwent mRNA isolation and complementary DNA (cDNA) library prep using the NEBNext® UltraTM II Directional RNA Library Prep Kit for Illumina®. Libraries were then sequenced on an Illumina HiSeq4000 (for more details, see [37]). These data are available as part of the BioProject PRJNA476149. Raw data were quality checked, trimmed and assembled de novo according to the methods described in Pérez-Moreno et al. [37]. Briefly, the quality of raw sequencing data was assessed with FastQC [38] and trimmed with Trimmomatic v0.36 [39] using the following parameters: ILLUMINACLIP: 2: 30: 10 CROP: 140 HEADCROP: 20 LEADING: 15 TRAILING: 15 SLIDINGWINDOW: 4: 20 MINLEN: 36. High-quality adaptor trimmed data were assembled with Trinity v2.5.0 (minimum transcript length 200 bp; k-mer size 23). The resulting de novo transcriptome assembly was assessed using Transrate v1.0.3 [40] and Benchmarking Universal Single-Copy Orthologs (BUSCO) v3.0.2 [41] using OrthoDB’s Arthropod database of orthologous groups (n = 1066) [42] to determine assembly quality and completeness.
Figure 3:

De novo transcriptome assembly statistics for the isopod Asellus aquaticus (a) and amphipod Niphargus hrabei (b), each exhibiting denatured RNA profiles indicative of 28S rRNA subunit collapse (left). Raw data were generated on an Illumina HiSeq4000 (BioProject PRJNA476149) using methods described in [37]. Transcriptomes were assembled using Trinity and assessed for quality and completeness using Transrate and BUSCO. Metrics include: total number of assembled transcripts, longest generated transcript (bp), mean length of all transcripts (bp), number of transcripts with an open reading frame (ORF), mean ORF percent, N50 statistic and overall GC content (see [37] for additional details).

Distinguishing true RNA degradation from analytical artifacts

High temperature exposure is considered as one of the determining factors behind the 28S subunit collapse [21, 22]. To test whether denaturation could occur at temperatures <70°C, we chose three representative lineages with differing rRNA profiles: millipede, centipede, and crab. We used a bacterial reference (total) RNA (Escherichia coli, Life Technologies, USA) as a noncleaved 28S positive control. RNA aliquots were taken from the three samples prior to the DNase treatment, precipitated, eluted, and kept at 4°C. We prepared a heat treatment gradient as recommended by the Bioanalyzer manufacturer, heating all samples in a MJ Research PTC-200 thermocycler, incubating for 2 min at a designated temperature using a nonheated lid, prior to cooling at 4°C. RNA from the four samples (including the positive control) were aliquoted into eight different temperature treatment sets (0.2 mL PCR strip tubes). Each set was placed across a temperature gradient ranging from 20°C to 90°C, at increments of 10°C. This range encompasses “room temperature” and standard DNase treatment temperatures, as well as temperatures 20°C higher than the recommended denaturing step in an attempt to force denaturation for comparison. After the eight tube sets were incubated at their corresponding temperature on the thermal gradient, we kept the samples at 4°C and stored them permanently at −80°C until they were run on both a 1% TAE bleach gel in cool buffer and Bioanalyzer picochips as described previously.

Results

An array of RNA profile patterns within arthropoda

In total, we found six distinct patterns of 28S denaturation (Table 1), and banding patterns were consistent across replicates. In most arthropod lineages, we found a consistent “denatured” or “collapsed” RNA profile, where the 28S subunit is denatured into equally sized α-β fragments (Fig. 1). This dominant pattern was found in 22 species and appeared on the gel as a single thick band resulting from the collapse of 28S and subsequent overlap with 18S at around 1900 bp (Fig. 1a–f); a size estimate reported previously for the honey bee Apis mellifera (27). We detected two prominent bands in the fruit fly, cranefly (Diptera), pelagic shrimp, banded-coral shrimp, blue crab, and mudcrab (Decapoda), at ∼2000 and 1900 bp, respectively. Three well-defined bands were detected in river isopods (Isopoda), marine crabs (Decapoda), and the water flea (Cladocera) at ∼2200, 2000, and 1900 bp. The hermit crab (Decapoda) appeared to have four bands of similar size between 1800 and 2200 bp although it is possible that the presence of minor degradation may have confounded this profile. The only lineage with four distinct bands was the tiger centipede (Chilopoda) at ∼2000, 1800, 1000 and 800 bp. Land isopods (Isopoda) showed five distinct bands at ∼2200, 2000, 1900, 1700 and 600 bp though some degradation was present. Lastly, we found a nondenatured, two-band profile for cave shrimp (Decapoda) and bumblebee millipede (Diplopoda) – with subunit bands at ∼3900 bp and 1900 bp, similar to the model rRNA references [17] (Fig. 1). It is important to note that the rRNA fragments of some replicate lanes appear to have shifted on the picochip (e.g., Fig. 1f, lane 05) and would require manual adjustments for accurate size estimation (see Table 1), though the banding patterns remain consistent. Bioanalyzer electropherograms of total RNA for a diverse array of Arthropod lineages. Sample information including taxonomic classifications are given in Table 1. Samples in Panels a–f are randomized (nontaxonomic order) but corresponding lane information is in Table 1. Ladders correspond to the picochip RNA size standard. Red (or highlighted) cells above a lane indicate that the RIN could not be calculated. Some samples did show “true” degradation <1800 bp including the hermit crab Clibanarius tricolor and one replicate of the dragonfly Brachymesia gravida (Table 1 and Fig. 1), which appeared as multiple bands on the Bioanalyzer traces. Likewise, residual DNA contamination was evident in the mud crab Eurypanopeus depressus after the DNase treatment (Fig. 1e–l); it is possible that the DNase was not as effective because a higher volume of tissue was available and used for this particular extraction. Many samples also exhibited remnants of the original size of the 28S subunit below the 4000 bp mark, appearing as a faint band and a shorter peak on the electropherograms (Fig. 1). Mean sizes for the original 28S and the 18S peaks suggest that, in arthropods, RNA profiles with no denaturation can be measured at ∼3900 bp and 1900 bp, respectively. Several samples showed multiple faint bands beyond 1900 bp, which may imply some level of degradation and/or remnants of extracted 5/5.8S ribosomal subunits and tRNAs (Fig. 1).

Ribosomal 28S denaturation can occur at “low” temperatures

The temperature gradient revealed that samples with the 28S “hidden break” (blue centipede and blue crab) had at least a portion of their 28S subunit already collapsed at 20°C (Fig. 2 and Supplementary Fig. S1). Blue crab RNA maintained some copies of its 28S subunit at ∼4000 bp until reaching 80°C. The positive bacterial control (E. coli) and the millipede (A. monilicornis) maintained 28S subunit integrity throughout the 20–90°C gradient, exhibiting a two-band profile reminiscent of the vertebrate rRNA model reference. The blue centipede (H. marginata) showed a single band throughout the entire gradient (Fig. 2 and Supplementary Fig. S1). Some lanes on the electropherograms (Fig. 2) showed RNA degradation not found on the agarose gel (Supplementary Fig. S1).
Figure 2:

Temperature gradient (denaturing treatment) for three representative taxa and a positive control (See Table 1): Escherichia coli (Qubit Standard 2), Millipede (HBG5053), Centipede (HBG5058), and Blue crab (HBG5273). (a) 20–30°C, (b) 40–50°C, (c) 60–70°C, and (d) 80–90°C. Ladder corresponds to the picochip RNA size standard. These samples match those used in Supplementary Fig. S1.

Temperature gradient (denaturing treatment) for three representative taxa and a positive control (See Table 1): Escherichia coli (Qubit Standard 2), Millipede (HBG5053), Centipede (HBG5058), and Blue crab (HBG5273). (a) 20–30°C, (b) 40–50°C, (c) 60–70°C, and (d) 80–90°C. Ladder corresponds to the picochip RNA size standard. These samples match those used in Supplementary Fig. S1.

High-throughput sequencing indicates high-quality RNA despite denaturation

Approximately 32 M paired-end reads were generated for the arthropods A. aquaticus and N. hrabei, exhibiting 28S denaturation (Fig. 3). De novo transcriptome assemblies yielded ∼98.3k and 134.5k transcripts/contigs, respectively (for more details, see [37]). Correspondingly, these assemblies had mean transcript lengths of 938 and 881 bp and N50 statistics of 1737 and 1648 bp (Fig. 3). BUSCO assessments, used to evaluate the completeness of the de novo transcriptomes, revealed 90.1% (C: 90.1%, F: 6.2%, M: 3.7%, n: 1066) and 90.5% (C: 90.5%, F: 5.3%, M: 4.2%, n: 1066), respectively, of the single-copy orthologs employed for benchmarking in Arthropoda were present, indicating fairly complete, high-quality assemblies. De novo transcriptome assembly statistics for the isopod Asellus aquaticus (a) and amphipod Niphargus hrabei (b), each exhibiting denatured RNA profiles indicative of 28S rRNA subunit collapse (left). Raw data were generated on an Illumina HiSeq4000 (BioProject PRJNA476149) using methods described in [37]. Transcriptomes were assembled using Trinity and assessed for quality and completeness using Transrate and BUSCO. Metrics include: total number of assembled transcripts, longest generated transcript (bp), mean length of all transcripts (bp), number of transcripts with an open reading frame (ORF), mean ORF percent, N50 statistic and overall GC content (see [37] for additional details).

Discussion

One size does not fit all: diversity of RNA banding profiles across arthropoda

There is a growing body of evidence suggesting that the current vertebrate RNA profile references are not reliable proxies to assess the quality of invertebrate RNA with seemingly “degraded” RNA profile patterns [21–23, 27, 28]. Our results suggest that 28S “collapse” or “denaturation” is a common occurrence across Arthropoda, but is likely a complex phenomenon resulting in diverse banding patterns, some of which are newly reported here. This renders model organism RNA profiles of limited use for various nonmodel taxa. Moreover, RINs could not be estimated for a majority of the samples exhibiting the denatured 28S rRNA subunits; paradoxically, RINs were only estimated for a few samples showing some level of “true” degradation [e.g., Fig. 1f (L01-03)]. Therefore, RIN estimation may be impossible or biased if algorithms do not account for denaturation and collapsed ribosomal subunits in a variety of invertebrates though manual bioanalyzer corrections (i.e., higher anomaly thresholds upon detection of the additional peak(s) or signal) can partially reduce bias [11]. In addition to deviations from the common rRNA two-banded profile, our results further show that rRNA 28S in invertebrates is smaller than that in vertebrates, 3900 bp versus 4400–5000 bp, though the 18S sizes appear to be within a 100 bp differential margin. Bacterial RNA profile sizes are even smaller (E. coli 2900 bp for 23S and 1700 bp for 16S; Fig. 1e). These estimated size differences will allow for a more accurate and prompt assessment of rRNA in nonmodel invertebrates following RNA extraction, including the presence of bacterial and human contamination. The diversity of RNA profiles, ranging from one to five bands (Fig. 1 and Table 1), indicates that even though a one-band profile is the most frequently observed pattern among arthropods, individual lineages may vary. Some single bands within Insecta were also relatively thick, suggesting a possible artifact resulting from two high-quality overlapping bands of similar size. Denatured two-banded profiles, evident in several decapod crustaceans and insects [e.g., Fig.1a (L01), not to be mistaken with the banding profile of the vertebrate model reference], suggest that one of the two 28S fragments (α or β) is at least 100 bp longer than 18S and the other is very similar to 18S in length. The three-banded profile found among river isopods, mud and blue crabs, and Daphnia water fleas, suggests a significant translocation of the UAAU loop with 400 bp separating the two 28S fragments. Likewise, the possibility exists that a second UAAU region, or underlying secondary structure, is creating an additional cleavage point. A four-banded profile in tiger centipedes also implies that complex secondary structures within the loop region are creating additional cleavage points, possibly making them more accessible to enzymes (e.g., RNases). Alternatively, a secondary UAAU-rich loop may be present, resulting in three 28S fragments (α, β, and a novel γ) as tiger centipede RNA showed no signs of degradation (i.e., smears or a multitude of fragments). The third and fourth bands in tiger centipedes are also much smaller than bacterial rRNA subunits, rendering contamination from mitochondrial DNA or microorganisms unlikely. However, this four-banded pattern was not found in the blue centipede species (single band). This suggests that the four-banded pattern is not unanimous among Chilopoda. Similar to the RNA profiles observed in aphids [28, 42], millipedes (Diplopoda), the sister clade to centipedes, maintained the integrity of the 28S subunit, suggesting this four-banded pattern could be a genus or species-specific phenomenon. The observed variation in RNA profiles and 28S denaturation found in this study is comparable to other systems. For example, a majority of mammals have a typical nondenatured two-banded vertebrate profile, but rodents in the genus Ctenomys have a denatured 28S subunit [17]. Thus, an important distinction exists between subunit collapse/denaturing and degradation. The generation of high-quality de novo transcriptome assemblies from RNAseq data for two arthropod species exhibiting denatured RNA profiles provides additional support for high-quality RNA despite 28S subunit collapse [37]. Assembly statistics for both species compare favorably with previously published RNAseq studies on A. aquaticus [43] and other amphipods [44], as well as arthropod species with nondenatured RNA profiles [45]. Denaturation appears to be an arthropod-wide phenomenon with some lineage-specific exceptions, whereas degradation remains constant due to the ephemeral nature of RNA [46]. For researchers working on organisms with unknown rRNA profiles, we therefore suggest documenting (replicated) rRNA banding patterns to make accurate assessments about RNA integrity and avoid falsely labeling 28S rRNA denaturing as “true” degradation.

It's getting hot in here: the effect of temperature on 28S denaturation

A prior study by McCarthy et al. [21] examining rRNA degradation in a few arthropod lineages theorized 28S cleavage would occur at temperatures around 70°C as part of the Bioanalyzer protocol. Our temperature-gradient experiments showed that 28S denaturing can occur across a broad array of temperatures in arthropods, including room temperature (∼20°C). This can have important implications given human body temperatures are ∼37°C, and consequently excessive sample handling could contribute to unpredicted denaturing. DNase treatments, which should be a standard practice in RNA extraction protocols, typically require heating samples to 37°C for enzyme activation. Therefore, 28S fragmentation will likely occur as part of the extraction protocol in all organisms with a UAAU-rich loop. Our findings also suggest that the large ribosomal subunit of some species may be resistant to denaturation as millipedes were able to maintain 28S integrity regardless of heat exposures to 90°C. This is possibly a result of a missing UAAU loop region similar to aphids (Hemiptera, Insecta; [28, 42]). Our results further show that if samples are exposed to temperatures beyond 70°C for enzymes requiring heat deactivation, RNA integrity can be severely impacted. However, it appears that DNases can be largely removed with a second round of alcohol precipitation; this allowed us to avoid high temperature incubations by employing an enzyme brand that does not require temperature deactivation. Thus, we recommend avoiding enzymes or denaturing treatments that require high temperatures in order to ensure RNA integrity.

Denaturing gels versus electropherograms

Here, we show that both methods for evaluating RNA quality, denaturing gel electrophoresis and Bioanalyzer electropherograms, have their individual strengths while also being subjected to methodological artifacts. Though RIN values are unreliable in profiles where 28S is collapsed, electropherograms should still be used to assess RNA quality as they provide more accurate size estimates. Bleach agarose gels [36] can also be used for size estimation and to identify degradation or DNA contamination, in place of more toxic denaturing gels (e.g., Dimethyl sulfoxide (DMSO) and formaldehyde), to minimize time, effort, and costs. Furthermore, assessing RNA integrity via electrophoresis prior to diluting and running on a Bioanalyzer will allow researchers to more readily detect sources of degradation. For example, electropherograms for our gradient revealed degradation on a few lanes that were not initially present in the samples, indicating this occurred at some point during Bioanalyzer prep and/or transport. For these reasons, we recommend running samples on denaturing gels prior to or in conjunction with the bioanalyzer.

Arthropod extraction techniques

Although comparing multiple RNA extraction methods are outside the scope of this study, we found one in particular that worked well for arthropods and downstream RNAseq studies. We ultimately selected solvent-based (phenol-chloroform; e.g., TRIZOL) over column-based methods (e.g., QIAGEN and NUCLEOSPIN), as it was cost-effective while also yielding higher concentrations of high-molecular weight nucleic acids as well as small and micro-RNAs. Column membranes sequestered too much material and often resulted in the isolation of ≪1 μg of total RNA per sample. Obtaining high RNA yields becomes particularly important when working with small-bodied organisms, a challenge faced by many invertebrate researchers. Lastly, solvent-based methods prove to be reliable across taxa and laboratories as the RNA profiles reported here are consistent with past arthropod RNA studies (e.g., one band in spiders, [47]; three bands in crabs [48]).

Conclusions

Here we compared RNA profiles across Arthropoda from a diverse array of lineages including insects, crustaceans, arachnids, millipedes, and centipedes. We found that a majority of lineages displayed what would be incorrectly interpreted as a “degraded” RNA profile when compared with vertebrate references. Specifically, six distinct RNA banding patterns were revealed, providing insight into the high degree of variation that can be found across Arthropoda. However, 18S size (1900–2000 bp) remained constant among arthropods, making it a useful proxy for RNA integrity. Lastly, our study demonstrates that RNA is substantially labile and extraction protocol temperatures (>37°C) will denature 28S that contains the UAAU-rich loop region.

Data accessibility

Raw sequencing data are available on the National Center for Biotechnology Information (NCBI)’s Sequence Read Archive (SRA) database under Bioproject PRJNA476149. Click here for additional data file.
  39 in total

1.  Validation of two ribosomal RNA removal methods for microbial metatranscriptomics.

Authors:  Shaomei He; Omri Wurtzel; Kanwar Singh; Jeff L Froula; Suzan Yilmaz; Susannah G Tringe; Zhong Wang; Feng Chen; Erika A Lindquist; Rotem Sorek; Philip Hugenholtz
Journal:  Nat Methods       Date:  2010-09-19       Impact factor: 28.547

2.  High-throughput spatial mapping of single-cell RNA-seq data to tissue of origin.

Authors:  Kaia Achim; Jean-Baptiste Pettit; Luis R Saraiva; Daria Gavriouchkina; Tomas Larsson; Detlev Arendt; John C Marioni
Journal:  Nat Biotechnol       Date:  2015-04-13       Impact factor: 54.908

3.  Molecular adaptation in the world's deepest-living animal: Insights from transcriptome sequencing of the hadal amphipod Hirondellea gigas.

Authors:  Yi Lan; Jin Sun; Renmao Tian; Douglas H Bartlett; Runsheng Li; Yue Him Wong; Weipeng Zhang; Jian-Wen Qiu; Ting Xu; Li-Sheng He; Harry G Tabata; Pei-Yuan Qian
Journal:  Mol Ecol       Date:  2017-05-18       Impact factor: 6.185

4.  rRNA processing: removal of only nineteen bases at the gap between 28S alpha and 28S beta rRNAs in Sciara coprophila.

Authors:  V C Ware; R Renkawitz; S A Gerbi
Journal:  Nucleic Acids Res       Date:  1985-05-24       Impact factor: 16.971

5.  Molecular mechanism of introduction of the hidden break into the 28S rRNA of insects: implication based on structural studies.

Authors:  H Fujiwara; H Ishikawa
Journal:  Nucleic Acids Res       Date:  1986-08-26       Impact factor: 16.971

6.  Ecdysozoan mitogenomics: evidence for a common origin of the legged invertebrates, the Panarthropoda.

Authors:  Omar Rota-Stabelli; Ehsan Kayal; Dianne Gleeson; Jennifer Daub; Jeffrey L Boore; Maximilian J Telford; Davide Pisani; Mark Blaxter; Dennis V Lavrov
Journal:  Genome Biol Evol       Date:  2010-07-12       Impact factor: 3.416

7.  Base-stacking and base-pairing contributions into thermal stability of the DNA double helix.

Authors:  Peter Yakovchuk; Ekaterina Protozanova; Maxim D Frank-Kamenetskii
Journal:  Nucleic Acids Res       Date:  2006-01-31       Impact factor: 16.971

8.  Towards standardization of RNA quality assessment using user-independent classifiers of microcapillary electrophoresis traces.

Authors:  Sandrine Imbeaud; Esther Graudens; Virginie Boulanger; Xavier Barlet; Patrick Zaborski; Eric Eveno; Odilo Mueller; Andreas Schroeder; Charles Auffray
Journal:  Nucleic Acids Res       Date:  2005-03-30       Impact factor: 16.971

9.  A Transcriptomic Analysis of Cave, Surface, and Hybrid Isopod Crustaceans of the Species Asellus aquaticus.

Authors:  Bethany A Stahl; Joshua B Gross; Daniel I Speiser; Todd H Oakley; Nipam H Patel; Douglas B Gould; Meredith E Protas
Journal:  PLoS One       Date:  2015-10-13       Impact factor: 3.240

Review 10.  A survey of best practices for RNA-seq data analysis.

Authors:  Ana Conesa; Pedro Madrigal; Sonia Tarazona; David Gomez-Cabrero; Alejandra Cervera; Andrew McPherson; Michał Wojciech Szcześniak; Daniel J Gaffney; Laura L Elo; Xuegong Zhang; Ali Mortazavi
Journal:  Genome Biol       Date:  2016-01-26       Impact factor: 13.583

View more
  3 in total

1.  Assessment of RNA extraction protocols from cladocerans.

Authors:  Muhammad Raznisyafiq Razak; Ahmad Zaharin Aris; Fatimah Md Yusoff; Zetty Norhana Balia Yusof; Sang Don Kim; Kyoung Woong Kim
Journal:  PLoS One       Date:  2022-04-26       Impact factor: 3.752

2.  Daily transcriptomes of the copepod Calanus finmarchicus during the summer solstice at high Arctic latitudes.

Authors:  Laura Payton; Céline Noirot; Claire Hoede; Lukas Hüppe; Kim Last; David Wilcockson; Elizaveta A Ershova; Sophie Valière; Bettina Meyer
Journal:  Sci Data       Date:  2020-11-24       Impact factor: 6.444

3.  Annual transcriptome of a key zooplankton species, the copepod Calanus finmarchicus.

Authors:  Laura Payton; Céline Noirot; Kim S Last; Jordan Grigor; Lukas Hüppe; David V P Conway; Mona Dannemeyer; Amandine Suin; Bettina Meyer
Journal:  Ecol Evol       Date:  2022-02-22       Impact factor: 2.912

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.