| Literature DB >> 28207848 |
Alexander J Westermann1, Lars Barquist1, Jörg Vogel1,2.
Abstract
The transcriptome is a powerful proxy for the physiological state of a cell, healthy or diseased. As a result, transcriptome analysis has become a key tool in understanding the molecular changes that accompany bacterial infections of eukaryotic cells. Until recently, such transcriptomic studies have been technically limited to analyzing mRNA expression changes in either the bacterial pathogen or the infected eukaryotic host cell. However, the increasing sensitivity of high-throughput RNA sequencing now enables "dual RNA-seq" studies, simultaneously capturing all classes of coding and noncoding transcripts in both the pathogen and the host. In the five years since the concept of dual RNA-seq was introduced, the technique has been applied to a range of infection models. This has not only led to a better understanding of the physiological changes in pathogen and host during the course of an infection but has also revealed hidden molecular phenotypes of virulence-associated small noncoding RNAs that were not visible in standard infection assays. Here, we use the knowledge gained from these recent studies to suggest experimental and computational guidelines for the design of future dual RNA-seq studies. We conclude this review by discussing prospective applications of the technique.Entities:
Mesh:
Year: 2017 PMID: 28207848 PMCID: PMC5313147 DOI: 10.1371/journal.ppat.1006033
Source DB: PubMed Journal: PLoS Pathog ISSN: 1553-7366 Impact factor: 6.823
Fig 1Methods for RNA sequencing of bacterial infections.
A. Concept of dual RNA-seq. Total RNA is extracted from infected cells and analyzed by RNA-seq. The mixed sequencing reads are assigned to their originating genomes in silico. B. Different approaches to quantify gene expression of bacteria in context with mammalian host cells. Traditionally, host material was depleted prior to analysis, either by detergent-mediated lysis of host cells (left) or by sequence-specific removal of host transcripts (middle). Instead, dual RNA-seq omits host depletion (right) and analyzes pathogen and host gene expression in parallel.
Overview of dual RNA-seq and related studies published to date.
“Dual SAGE” refers to the simultaneous analysis of host and pathogen by Serial Analysis of Gene Expression (SAGE), and “Multi RNA-seq” refers to a metatranscriptomic analysis of bacterial species constituting the airway microbiota in conjunction with nasal epithelial host cells. “M,” million; “TPM,” transcripts per million; “RPKMO,” reads per kilobase pairs of a gene per million reads aligning to annotated ORFs. Databases containing raw sequencing data: NCBI (National Center for Biotechnology Information), ENA (European Nucleotide Archive), GEO (Gene Expression Omnibus).
| One-sided (focus on bacterial gene expression) | Dual RNA-seq | Dual SAGE | "Multi" RNA-seq | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Mandlik et al. [ | Lamont et al. [ | Szafranska et al. [ | Srikumar et al. [ | Avican et al. [ | Humphrys et al. [ | Vannucci et al. [ | Mavromatis et al. [ | Rienksma et al. [ | Baddal et al. [ | Avraham et al. [ | Westermann et al. [ | Aprianto et al. [ | Afonso-Grunz et al. [ | Pérez-Losada et al. [ | |
| Bacterial species | uropathogenic | nontypeable | airway microbiota | ||||||||||||
| Host model | infant rabbits and mice | bovine epithelial cells (MAC-T); primary bovine monocyte–derived macrophages | mouse model of osteomyelitis | murine monocytic cells (RAW 264.7) | FVB/N mice | human epithelial cells (HEp-2) | primary porcine enterocytes | mouse bone marrow–derived macrophages | human monocytic cells (THP-1) | primary normal human bronchial epithelial cells | mouse bone marrow–derived macrophages | diverse cell culture models (human, murine, porcine) | human lung alveolar epithelial cells (A549) | human epithelial cells (HeLa) | nasal epithelium from human donors |
| Intracellular/extracellular | extracellular | intracellular | extracellular | intracellular | extracellular | obligate intracellular | obligate intracellular | intracellular | intracellular | extracellular | intracellular | intracellular | extracellular | intracellular | extracellular |
| Sample fixation? | RNA | - | murine tibiae incubated in RNA | RNA stabilization solution (0.2% SDS/19% ethanol/1% acidic phenol) | cryosections incubated in RNA | - | treated with RNase inhibitor prior to embedding | - | - | - | - | RNA | saturated ammonium sulfate solution | - | - |
| Enrichment of invaded cells? | n.a. | n.a. | n.a. | n.a. | n.a. | - | laser capture microdissection | - | - | n.a. | FACS-based (upon lipopolysaccharide [LPS] staining) | FACS-based (green fluorescent protein [GFP]-expressing bacteria) | n.a. | - | - |
| Lysis technique | tissue homogenized; cells lysed in TRIzol or lysis/binding buffer ( | TRIzol + zirconium beads | lysostaphin treatment followed by the addition of RLT buffer and mechanic disruption | 0.2% SDS/19% ethanol/1% acidic phenol to lyse host cells; TRIzol-based bacterial lysis | Dispomix Drive; glass beads | freeze-thaw + Lysis Solution (MasterPure RNA Purification kit) | Extraction Buffer (PicoPure kit) | Buffer RLT (RN | TRIzol + bead beating | TRIzol | freeze-thaw | lysis/binding buffer ( | bead beating and phenol-chloroform | lysis/binding buffer ( | TRIzol |
| RNA extraction technique | TRIzol or | TRIzol | RN | TRIzol | hot phenol | MasterPure RNA Purification | PicoPure | RN | TRIzol | TRIzol | RNAClean SPRI beads | High Pure RNA Isolation | TRIzol | ||
| Enrichment of bacterial cells/transcripts? | MICROB | MICROB | anti- | selective lysis and differential centrifugation | MICROB | with or without polyA-depletion (Poly(A) Purist Mag) to enrich bacterial transcripts; re-combined both RNA samples prior to sequencing | - | MICROB | with or without differential lysis (with guanidine thiocyanate) | - | - | - | - | with or without polyA-enrichment (Dynabeads Oligo dT25); poly(A)+ and poly(A)- samples analyzed separately | - |
| rRNA depletion? | MICROB | - | Terminator Exonuclease | - | MICROB | RiboZero (gram-negative bacteria; human/mouse/rat) | - | RiboZero (gram-negative bacteria; human/mouse/rat) | RiboZero (epidemiology) | RiboZero (epidemiology) | RiboZero (epidemiology) | RiboZero (epidemiology) | RiboZero (gram-positive bacteria; human/mouse/rat) | RiboZero (gram-negative bacteria; human/mouse/rat) | RiboZero |
| cDNA library preparation | Illumina: strand-specific ds-cDNA; Helicos: ss-cDNA | mRNA Seq library preparation kit (Illumina) | ScriptSeq | Illumina-based protocol | TruSeq | TruSeq | Ovation RNA-Seq System V2 | Digital Gene Expression Tag Profiling kit | TruSeq | ScriptSeq | RNAtag protocol (generation of multiple RNA-seq libraries in a single reaction) | Illumina-based protocol | TruSeq | SuperSAGE libraries for poly(A)+ and poly(A)- fractions | TruSeq |
| Sequencing platform | Illumina (paired-end), Helicos | GA IIx (paired-end) | HiSeq 2500 (single-end) | HiSeq 2000 | HiSeq 2000 (paired-end) | HiSeq 2000 (paired-end) | GA IIx (paired-end) | HiSeq 2000 (paired-end) | HiSeq 1500 (paired-end) | HiSeq 2500 (paired-end) | HiSeq 2500 | HiSeq; NextSeq 500 (single-end) | NextSeq 500 (single-end) | HiSeq 2000 (single-end) | HiSeq 2500 (single-end) |
| Sequencing depth/library | ~50 M (Illumina); 1–5 M (Helicos) | 20 M for bovine samples; 7.5 M for bacterial samples | 53–105 M | ~20 M | ~20–250 M | ~14–353 M | ~22 M | ~15–30 M | ~22–40 M (for infection samples) | ~60–180 M | on average 6 M | varies (~25 M for main time-course experiment) | on average 70 M | ~1–7.5 M | ~40 M |
| Fraction of bacterial reads (of all aligned reads in infection samples) | n.a. | n.a. | 0.7%–16.5% | n.a. | 0.002%–19% | ~0.02% (1 h postinfection); ~30% (24 h postinfection) | ~5% | ~0.03%–58% | 2–4% (nonenriched); 11%–25% (enriched) | ~0.2%–1.5% | on average 0.28% | ~1%–10% | on average 67% | ~0.7% (30 min postinfection); ~2% (24 h postinfection) | ~5% |
| Differential expression analysis tool | DESeq | Cufflinks | edgeR, DESeq, SAMseq | TPM | RPKMO | DESeq | Cuffdiff (Cufflinks) | Cuffdiff (Cufflinks) | edgeR | limma | TPM; DESeq | edgeR | DESeq | TPM | Cufflinks |
| Data availability | n.a. | PRJNA218473 (NCBI) | PRJEB6003 (ENA) | GSM1462575–1462579, GSM1914919 (GEO) | GSE55292 (GEO) | GSE44253 (GEO) | n.a. | PRJNA256028 (NCBI) | PRJEB6552 (ENA) | GSE63900 (GEO) | GSE65528–31 (GEO) | GSE60144 (GEO) | GSE79595 (GEO) | GSE61730 (GEO) | n.a. |
Fig 2A generic dual RNA-seq workflow analyzing total mixed RNA after double rRNA depletion that discovered the role of PinT small regulatory RNA (sRNA) during Salmonella infection of host cells [13].
Salmonella having gfp stably integrated in their chromosome and expressed from a constitutive promoter were used to infect cultures of HeLa cells. RNA-seq of the bacterial input (1) or mock-infected HeLa cells (2) served as reference controls for Salmonella or human gene expression analysis, respectively. Infection was carried out in parallel with wild-type and sRNA mutant Salmonella strains, and samples were taken over a time-course of infection. The resulting cell samples constituted a mixed population consisting of both invaded (GFP-positive) and uninfected bystander (GFP-negative) cells (3). To obtain a homogeneous population of invaded cells, the samples were sorted based on the emitted GFP fluorescence (4). Total RNA was extracted from the thus enriched cells, rRNA from both infection partners was depleted (5), and rRNA-free samples were converted into cDNA libraries and sequenced. The resulting sequencing reads were mapped in parallel against the Salmonella and human (core and mitochondrial) genome. Differential expression analysis of the time course revealed the strong induction over time of a novel Salmonella sRNA, PinT, and comparative analysis unraveled the molecular footprint of this sRNA in the bacterial transcriptome (6). Likewise, comparison of the host transcriptome between wild-type and ΔpinT infections revealed PinT-dependent changes in the immune response, including a differential activation of Janus kinase-Signal Transducer and Activator of Transcription (JAK-STAT) signaling as well as changes with respect to the expression of host long noncoding RNAs (lncRNAs) and microRNAs (miRNAs) (7). In addition, the pinT status of the infecting bacterium influenced mitochondrial gene expression, and infection with ΔpinT Salmonella led to the relocalization of mitochondria in invaded host cells (8).
Fig 3Illustration of biological insights obtained from dual RNA-seq studies in four different bacterial infection models.
HEp-2 cells infected with obligate intracellular Chlamydia trachomatis [10], primary airway epithelial cells with nontypeable Haemophilus influenzae [16], primary murine bone marrow macrophages with uropathogenic E. coli (UPEC) [11], and diverse human, mouse, and porcine cell lines with Salmonella Typhimurium [13]. See main text for details.
Fig 4Bioinformatic analysis pipeline for dual RNA-seq datasets.
Quality-filtered RNA-seq reads are aligned in parallel against the respective host and pathogen replicons. Reads mapping equally well to both reference organisms (“cross-mappings”) are quantified and discarded from downstream analyses. Reads unequivocally mapped to either the bacterial or host reference are used for quantification and functional analyses. Dual RNA-seq enables a wide range of downstream analyses, discussed in detail in the text. “MT,” mitochondrial genome.