| Literature DB >> 27713560 |
Nikhil Kumar1, Mingqun Lin2, Xuechu Zhao1, Sandra Ott1, Ivette Santana-Cruz1, Sean Daugherty1, Yasuko Rikihisa2, Lisa Sadzewicz1, Luke J Tallon1, Claire M Fraser1,3, Julie C Dunning Hotopp1,4.
Abstract
Despite numerous advances in genomics and bioinformatics, technological hurdles remain to examine host-microbe transcriptomics. Sometimes the transcriptome of either or both can be ascertained merely by generating more sequencing reads. However, many cases exist where bacterial mRNA needs to be enriched further to enable cost-effective sequencing of the pathogen or endosymbiont. While a suitable method is commercially available for mammalian samples of this type, development of such methods has languished for invertebrate samples. Furthermore, a common method across multiple taxa would facilitate comparisons between bacteria in invertebrate vectors and their vertebrate hosts. Here, a method is described to concurrently remove polyadenylated transcripts, prokaryotic rRNA, and eukaryotic rRNA, including those with low amounts of starting material (e.g. 100 ng). In a Wolbachia-Drosophila system, this bacterial mRNA enrichment yielded a 3-fold increase in Wolbachia mRNA abundance and a concomitant 3.3-fold increase in the percentage of transcripts detected. More specifically, 70% of the genome could be recovered by transcriptome sequencing compared to 21% in the total RNA. Sequencing of similar bacterial mRNA-enriched samples generated from Ehrlichia-infected canine cells covers 93% of the Ehrlichia genome, suggesting ubiquitous transcription across the entire Ehrlichia chaffeensis genome. This technique can potentially be used to enrich bacterial mRNA in many studies of host-microbe interactions.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27713560 PMCID: PMC5054355 DOI: 10.1038/srep34850
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Bioanalyzer analysis of total RNA and bacterial mRNA-enriched samples from Drosophila ananassae colonized by its Wolbachia endosymbiont.
The subtraction of Drosophila rRNA was assessed by running equivalent amounts of total RNA (blue) and Ribo-Zero reduced RNA (pink) on a Bioanalyzer. The software calculated the concentration of each sample by integrating the area under the rRNA peaks. Total RNA was 331 ng/μL and Ribo-Zero reduced RNA was 8 ng/μL, for an RNA loss of >97%, most of which is in the rRNA peaks for both the bacterial endosymbiont and invertebrate host.
Transcripts assessed by qRT-PCR.
| RNA Type | Removal Component | Post-enrichment Abundance | Target | Gene Description | ΔCt | ΔCt Drosophila-Wolbachia Replicate 2 | ΔCt Drosophila-Wolbachia Replicate 3 | ΔCt Brugia-Wolbachia Replicate 1 | ΔCt Brugia-Wolbachia Replicate 2 |
|---|---|---|---|---|---|---|---|---|---|
| Dynabeads | Decreased | Act5c | Actin 5C | −1.98 | −1.30 | −2.12 | NA | NA | |
| Dynabeads | Decreased | RpL32 exon | Rbosomal protein L32 (RPL32) within an exon | −3.06 | −2.41 | −3.38 | NA | NA | |
| Dynabeads | Decreased | RpL32 boundary | Ribosomal protein L32 (RPL32) across intron/exon boundary | −3.08 | −2.51 | −3.37 | NA | NA | |
| Ribo-Zero (H/M/R) | Decreased | 28S rRNA | NQ | NQ | NQ | NA | NA | ||
| Ribo-Zero (H/M/R) | Decreased | 18S rRNA | NQ | NQ | NQ | NA | NA | ||
| Ribo-Zero (Gram-Neg) | Decreased | 16S rRNA | −5.48 | −13.9 | −13.9 | −7.06 | −8.00 | ||
| No removal | No change | WD_1289 | Ribosomal protein S10 | 0.17 | 0.475 | 0.69 | NA | NA | |
| No removal | No change | WD_0443 | Hypothetical protein | 0.66 | −0.33 | −0.02 | NA | NA | |
| No removal | No change | WD_0880 | Coenzyme PQQ synthesis protein C | 0.69 | −0.15 | 0.36 | NA | NA | |
| Dynabeads | Decreased | Actin | actin (partial mRNA) | NA | NA | NA | −2.08 | −2.68 | |
| Dynabeads | Decreased | Bm1_03910 | 40S ribosomal protein S27 | NA | NA | NA | −3.02 | −3.95 | |
| Ribo-Zero (H/M/R) | Decreased | Bm_18S | NA | NA | NA | −16.4 | −17.4 | ||
| No removal | No change | Wbm0276 | DnaA from | NA | NA | NA | 0.91 | −1.02 | |
| No removal | No change | Wbm0350 | GRoEL from | NA | NA | NA | 0.84 | −1.16 |
*The ΔCt was calculated as the difference between the Ct value of the original RNA sample and the sample following bacterial mRNA enrichment such that a negative value reflects loss of the molecule being tested.
**NQ = not quantifiable. More specifically, values for the Drosophila rRNA are not shown because they are not in the linear range under these conditions. Modifying the conditions to make them linear would have prevented the detection of the Wolbachia mRNA. However, a large not quantifiable decrease in the Ct was also observed for the Drosophila rRNA following rRNA depletion.
***NA = not applicable.
Figure 2Depletion of Drosophila rRNA and actin transcripts.
Coverage was compared for the Drosophila rRNA (panel A) and the actin gene (panel B) for the total (blue), polyA-selected (pink), and the bacterial mRNA-enriched (gray) RNA samples after normalizing for the number of reads sequenced, as calculated as NCPM, or normalized coverage per million reads sequenced. rRNA is highly abundant in the total RNA, but significantly reduced in the polyA-selected and the bacterial mRNA-enriched samples. In contrast, the actin transcript was enriched only in the polyA-enriched sample. Therefore the method of bacterial mRNA enrichment was effective at removing both eukaryotic mRNA and rRNA.
Figure 3Depletion of Wolbachia rRNA and enrichment of Wolbachia mRNA.
Coverage was compared for the Wolbachia rRNA (panel A) and the WRi_010910 gene (panel B) for the total (blue), polyA-selected (pink), and bacterial mRNA-enriched (gray) samples after normalizing for the number of reads sequenced, as calculated as NCPB, or normalized coverage per billion reads sequenced. rRNA was highly abundant in the total RNA, but significantly reduced in the polyA-selected and the bacterial mRNA-enriched samples. In contrast, the WRi_010910 transcript was enriched in the bacterial mRNA-enriched sample compared to the total RNA. Therefore the method was effective at enriching for bacterial mRNA.
Figure 4Bioanalyzer analysis of Ehrlichia and canine total RNA abundance.
Using the Bioanalyzer, the Ehrlichia-canine total RNA (blue, left axis) was compared to the RNA 6000 ladder (pink, right axis), which contains 0.2, 0.5, 1.0, 2.0, 4.0, and 6.0 kbp fragments. Only canine rRNA was evident with no detectable bacterial rRNAs.
Figure 5Ehrlichia transcriptome sequencing coverage across a genome segment.
The bacterial mRNA-enriched transcriptome sequencing coverage was plotted for the first 20 kbp of the Wakulla genome (Panel A) and compared to the predicted genes for this region (Panel B). While 99% of the genome is transcribed, troughs are apparent for tRNAs, which are too small to be recovered with the RNA isolation method used here. The coverage peaks at the 5′-end of transcripts and decays over the length of the transcript, as has been observed for other bacterial transcriptomes. Troughs are seen at transcriptional start sites, but not always at the 3′-ends of transcripts. This suggests that either the 3′-end of the transcripts overlap the ends of other transcripts, or that this strain lacks discrete transcription termination sites.
Approximate Costs.
| Standard Input Bacterial mRNA Enrichment | Low Input Bacterial mRNA Enrichment | Standard rRNA Reduction | Low Input rRNA Reduction | No reduction | Agilent SureSelect | |
|---|---|---|---|---|---|---|
| Ribo-Zero Human/Mouse/Rat | $64.81 | $16.01 | $64.81 | $16.01 | $0 | $0 |
| Ribo-Zero Gram Negative Bacteria | $12.88 | $3.22 | $12.88 | $3.22 | $0 | $0 |
| DynaBeads | $46.20 | $46.20 | $0 | $0 | $0 | $0 |
| Agilent SureSelect RNA capture | $0 | $0 | $0 | $0 | $0 | $562.50 |
| mRNA library construction | $325 | $325 | $325 | $325 | $325 | $325 |
| Sequencing costs (insect-bacteria) | $12,000 | $12,000 | $12,000 | $12,000 | $600,000 | — |
| Total (insect-bacteria) | $12,448.89 | $12,390.43 | $12,402.69 | $12,344.23 | — | — |
| Sequencing costs (canine-bacteria) | $839 | $839 | $839 | $839 | — | — |
| Total (canine-bacteria) | $1,288.05 | $1,229.59 | $1,241.85 | $1,183.39 | — | — |
*Based on a $1,830 kit of 24 reactions and using 8.5 μL for the standard input and 1.7 μL for the low input, as described in the methods.
**Based on a $515 kit of 6 reactions and using 1.5 μL in the standard input and 0.3 μL in the low input, as described in the methods.
***Based on a $462 kit of 10 reactions.
****Based on the $7,500 list price for 16 reactions of a Tier 2 custom capture of a 0.5–2.9 Mbp genome and a $1,500 reagent kit.
*****Based on a $2,400 100-bp paired end HiSeq channel generating 200 million read pairs and targeting the acquisition of 10 million mapped bacterial read pairs/transcriptome, using the Drosophila-Wolbachia sequencing results, specifically that 1.0% of reads matched Wolbachia sequences as opposed to 0.02% in the total RNA; In reality the samples without reduction would perform markedly worse since the reads would most likely map to the rRNA.
******Based on a $2,400 100-bp paired end HiSeq channel generating 200 million read pairs and targeting the acquisition of 10 million mapped bacterial read pairs/transcriptome, using the Ehrlichia-canine sequencing results, specifically that 14.3% of reads mapped to Ehrlichia following enrichment.
Primer sequences.
| Name | Forward Name | Forward Sequence | Reverse Name | Reverse Sequence | Size |
|---|---|---|---|---|---|
| Act5c | Act5CF | GTCATCTTCTCACGGTTAGC | Act5CR | AGATCTGGCATCACACCTTC | 109 |
| RpL32 exon | RpL32_exonF | TCGCTTCAAGGGTCAGTACC | RpL32_exonR | TCTGCATCAGCAGGACCTC | 134 |
| RpL32 boundary | RpL32_boundaryF | CGAAGTTGTCGCACAAATGG | RpL32_boundaryR | GGTGCGCTTGTTGGAACC | 113 |
| Dana_UNIQ_1 | Dana_UNIQ_1_F | CTGAGCTGCGAATACTGCAC | Dana_UNIQ_1_R | CAAGTCCGGCTTAATCTTGG | 186 |
| Dana_28S | Dana_28S_F | CCAAAGAGTCGTGTTGCTTG | Dana_28S_R | AACGGATATTCAGGTTCATCG | 187 |
| Dana_18S | Dana_18S_F | TGGTCTTGTACCGACGACAG | Dana_18S_R | GCTGCCTTCCTTAGATGTGG | 156 |
| Wana_16S | Wana_16S_F | GCTCGTGTCGTGAGATGTTG | Wana_16S_R | AAGGGCCATGATGACTTGAC | 146 |
| WD_qPCR_1289 | WD_qPCR_1289F | TTTGCACTCGGTGCATTTAC | WD_qPCR_1289R | CAAGCAGTTGCACCATTTTTAC | 101 |
| WD_qPCR_0443 | WD_qPCR_0443F | TGCAATTGCCAATGGTTATG | WD_qPCR_0443R | ATTCTGCCTTCAACGTCAGG | 117 |
| WD_qPCR_0880 | WD_qPCR_0880F | AATGGCATTCTGAGGAATGTG | WD_qPCR_0880R | CAAGAAAACCCCACAAGAGC | 111 |
| WD_qPCR_1012 | WD_qPCR_1012F | AGCGAAAGATGGAAGTGGTG | WD_qPCR_1012R | CATTTCCTTCCACTCCAAGC | 139 |
| WD_qPCR_1094 | WD_qPCR_1094F | GGAAACGAGGAATTAATCAAGC | WD_qPCR_1094R | CCTGTTCCATCGCAGTAACC | 104 |
| Actin | Actin_F | TGCTGATCGTATGCAGAAGG | Actin_R | GGAGAGTGACGCCAGGATAG | 124 |
| Bm1_03910 | Bm1_03910_F | GCCGTTAGCACGAGATTTATTG | Bm1_03910_R | AGGGCACTTTACATCCATGAAG | 112 |
| Wbm0276 | Wbm0276_F | GGTTCGCCACTAGATCCAAG | Wbm0276_R | CCCACTCCGCCATATAGAAAC | 146 |
| Wbm0350 | Wbm0350_F | CGTTGCTGTGCTTAAAGTCG | Wbm0350_R | AAGTGCAACTCCACCACCTG | 133 |
| Bm_18S | Bm_18S_F | ACTTCATGCGGCTAAACACC | Bm_18S_R | TGGTGGAGTGATTTGTCTGG | 124 |