Literature DB >> 32517038

YerA41, a Yersinia ruckeri Bacteriophage: Determination of a Non-Sequencable DNA Bacteriophage Genome via RNA-Sequencing.

Katarzyna Leskinen1, Maria I Pajunen1, Miguel Vincente Gomez-Raya Vilanova1, Saija Kiljunen1, Andrew Nelson2, Darren Smith2, Mikael Skurnik1,3.   

Abstract

YerA41 is a Myoviridae bacteriophage that was originally isolated due its ability to infect Yersinia ruckeri bacteria, the causative agent of enteric redmouth disease of salmonid fish. Several attempts to determine its genomic DNA sequence using traditional and next generation sequencing technologies failed, indicating that the phage genome is modified in such a way that it is an unsuitable template for PCR amplification and for conventional sequencing. To determine the YerA41 genome sequence, we performed RNA-sequencing from phage-infected Y. ruckeri cells at different time points post-infection. The host-genome specific reads were subtracted and de novo assembly was performed on the remaining unaligned reads. This resulted in nine phage-specific scaffolds with a total length of 143 kb that shared only low level and scattered identity to known sequences deposited in DNA databases. Annotation of the sequences revealed 201 predicted genes, most of which found no homologs in the databases. Proteome studies identified altogether 63 phage particle-associated proteins. The RNA-sequencing data were used to characterize the transcriptional control of YerA41 and to investigate its impact on the bacterial gene expression. Overall, our results indicate that RNA-sequencing can be successfully used to obtain the genomic sequence of non-sequencable phages, providing simultaneous information about the phage-host interactions during the process of infection.

Entities:  

Keywords:  RNA-sequencing; YerA41; Yersinia ruckeri; bacteriophage; genome assembly; nucleotide modification; transcriptome

Mesh:

Substances:

Year:  2020        PMID: 32517038      PMCID: PMC7354516          DOI: 10.3390/v12060620

Source DB:  PubMed          Journal:  Viruses        ISSN: 1999-4915            Impact factor:   5.048


1. Introduction

YerA41 is a bacteriophage that was originally isolated due its ability to infect Yersinia ruckeri bacteria, the causative agent of enteric redmouth disease of salmonid fish [1]. Bacteriophage YerA41 was first described in 1984 as a tailed icosahedral virus that lysed the vast majority of tested Y. ruckeri serovar I strains, and thus was believed to have a potential value for the diagnosis of redmouth disease. However, it was found that YerA41 has a relatively broad host range among Enterobacteriacae as it could infect some strains of other Yersinia species, Escherichia coli, Shigella flexneri, Enterobacter cloacae, Klebsiella and Erwinia spp. [1]. The YerA41 phage particles are large with heads of about 110 nm in diameter, 10 × 8 nm necks, 250 × 20 nm non-contracted sheaths and 70 nm long tail fibers [2]. Several attempts to sequence the YerA41 genome using next generation sequencing (Illumina) approaches failed previously. Additionally, it was observed that the phage DNA could not be digested by any of the numerous tested restriction enzymes. These results led to the conclusion that the phage DNA is modified in such a way that prevents the function of enzymes such as DNAP and DNA-modifying restriction enzymes. Indeed, preliminary analysis of the nucleosides confirmed the presence of modifications, however, without revealing their exact nature. Naturally occurring modifications of the canonical deoxynucleotides can be found in the DNA of organisms from all domains of life. They may constitute only a small fraction of the bases or even replace the standard, un-modified base entirely [3]. Collectively, the greatest diversity of modified bases can be observed among bacteriophages. These modifications include both the types observed among the bacterial host species, and the more unusual that are not observed among different organisms [4]. A variety of different chemical groups can be attached to the nucleotide, ranging from simple methyl groups through amino acids, polyamines, monosaccharides to more complexed oligosaccharides. They do not lead to alterations in the specificity of base pairing, and primarily appear to be a part of the arms race between the infecting organism and its host. For example, these modifications can provide additional information needed for the control of gene expression, recognition of self and non-self DNA, or protection from enzymatic degradation and host defense mechanisms [3,4]. The hypermodified bases observed among phages include 5-hydroxymethylpyrimidines and their glycosylated derivatives, α-putrescinylated and α-glutamylated thymines, sugar-substituted 5-hydroxypentyl uracil, N6-(1-acetamido)-modified adenine and 7-methylguanine [3,5]. Interestingly, the hydroxymethyl deoxypyrimidines are produced from free nucleotides by phage-encoded deoxypyrimidine hydroxymethylases before their incorporation into DNA. This mechanism is different from the primary base methylations that are commonly catalyzed in situ on DNA. After incorporation, these hydroxymethylpyrimidines often undergo further modifications introduced by phage-encoded enzymes such as glycosyl- or acetyl-transferases [3,6,7]. Here, we have used RNA-sequencing to determine the nucleotide sequence of the phage YerA41 genome using its encoded transcripts. This approach allowed us to determine the nucleotide sequence of the greater part of the phage genome and to characterize its gene expression during infection. Additionally, our approach provided insight into the interactions between YerA41 and its host bacterium. Moreover, our results indicated that YerA41 represents a novel, previously unidentified, group of bacteriophages that at the level of nucleotide sequence shares almost no similarity with those previously reported.

2. Materials and Methods

2.1. Bacterial Strains and Phage Propagation

Bacteriophage YerA41 was propagated in Y. ruckeri strain RS41 [1] as described previously [8]. Y. ruckeri was grown in lysogeny broth (LB) at room temperature (RT, 22 °C). LB agar (LA) plates were used for all solid cultures and prepared by supplementing LB with 1.5% Bacto agar and 0.4% for soft agar. Bacteriophage YerA41 was stored at −70 °C in tryptic soy broth (TSB) supplemented with 7% dimethyl sulfoxide (DMSO). The bacterial strains used in this study are listed in Table S1.

2.2. Purification of Phage Particles

The standard method for the production and purification of bacteriophages [9] was used. Briefly, an overnight culture of host bacteria was diluted 10-fold in TSB in a total volume of 1 L divided into four 2 L erlenmeyer flasks, 250 mL each, and infected with appropriate number of phage to reach the multiplicity of infection (MOI) of 1. The infected cultures were incubated at 25 °C with vigorous aeration (250 rpm), until after 6–7 h the bacterial lysis took place. The lysed cultures were treated with DNase I (1.2 µg/mL; Roche Diagnostic, Mannheim, Germany) and RNase A (1 µg/mL; Sigma Chemicals, St. Louis, MO, USA) and incubated at RT for 30 min. Solid NaCl was added to a final concentration of 1 M and lysates were kept on ice for 1 h, and then the solution was centrifuged at 11,000× g for 10 min at 4 °C to remove the precipitated bacterial debris. The phage was recovered from the supernatant by precipitating with polyethylene glycol (PEG 8000) (10%, w/v; > 60 min; 0 °C) and was resuspended into TM buffer (50 mM Tris-HCl, pH 7.5, 10 mM MgSO4). Alternatively, the phage was purified from semiconfluent soft-agar plates as described [9]. The phage was further purified by chloroform extraction and one to three rounds of discontinuous glycerol density gradient ultracentrifugation at 35,000 rpm at 4 °C for 1 h in a Beckman SW41 rotor. After the centrifugations phages were resuspended in SM buffer (50 mM Tris-HCl pH 7.5, 100 mM NaCl, 8 mM MgSO4, 0.01% gelatin) containing 8% of sucrose.

2.3. One Step Growth Curve

A mid-exponential-phase culture (10 mL) of RS41 (optical density at 600 nm [OD600] 0.4 to 0.5) was harvested by centrifugation at 3,000 rpm for 15 min and resuspended in 1 mL of TSB medium. YerA41 phage was added at MOI of 0.0005 and allowed to adsorb for 2 min at RT. The culture was then centrifuged, the pelleted bacterial cells were resuspended to 10 mL of TSB, and incubation was continued at RT. Samples (100 µL) were taken at 10 min intervals. The first set of samples was immediately diluted and plated on lambda agar plates (per liter of broth: Bacto-agar 15 g; lambda broth per liter: tryptone 10 g, NaCl 2.5 g) for phage titration. In order to determine the eclipse period the second set of samples was treated with 1% chloroform before plating to release the intracellular phage particles [10,11]. The number of plaque forming units (PFU) in the immediately diluted 0 min time point samples was set to 1 to represent the number of infected cells in the experiment, and the PFU of all the other samples were normalized against that number. The burst size was then directly obtained from the normalized value after the rise period.

2.4. Total RNA Extraction

RNA extractions were carried out from three separate infection experiments. In the first experiment, the RS41 bacteria were grown for 16 h at RT and subsequently diluted 1:20 in fresh LB to a total volume of 20 mL. When the OD600 of the culture reached 0.7, duplicate 1 mL samples were withdrawn to represent 0 min uninfected samples, then 9 mL of the culture was infected with 1 mL of phage YerA41 stock (5 × 1010 pfu/mL) to achieve a MOI of ca. 50. Duplicate 1 mL aliquots were withdrawn at 5, 15, and 30 min post-infection (p.i.). In the second experiment, the RS41 bacteria were grown for 16 h at RT and subsequently diluted 1:20 in fresh LB to a total volume of 20 mL. When the OD600 of the culture reached 0.7, 8.5 mL of the culture was infected with 1.5 mL of phage YerA41 stock to achieve a MOI of 10. One mL aliquots were withdrawn at 33, 45, 63, 75 and 92 min p.i. in one replicate each. In the third experiment, three biological replicates were grown for 16 h at RT and diluted 1:20 in 10 mL. When the cultures reached OD600 of 0.6, one mL aliquots were withdrawn from each culture, these representing the uninfected 0 min samples in triplicate. Then, to 5.3 mL of culture 0.7 mL of phage stock was added to reach a MOI of 50, mixed for 3 min followed by 3 min standing and 3 min centrifugation at 4500× g at 22 °C. The supernatant was replaced with 6 mL fresh medium and triplicate 1 mL samples, one from each tube, were withdrawn at 15, 30 and 60 min p.i. In all experiments, total RNA was isolated from the samples using the SV Total RNA Isolation System (Promega, Madison, WI, USA) and the quality assessment was performed using LabChip (PerkinElmer, Waltham, MA, USA), using the DNA 5K/RNA/CZE chip with HT RNA Reagent Kit.

2.5. RNA Sequencing

The RNA-sequencing and data analysis were performed at the Nu-Omics DNA sequencing research facility at University of Northumbria. The rRNA was removed using Ribo-ZeroTM rRNA Removal Kit for Gram-negative Bacteria (Illumina, San Diego, CA, USA). The sequencing library was prepared using the ScriptSeq-v2 RNA-Seq Library Preparation Kit (Illumina). Paired-end sequencing was performed on MiSeq (Illumina) with the read length of 150 nucleotides.

2.6. Transcriptome Assembly

The obtained sequencing reads were quality filtered and aligned against Y. ruckeri PB-H2 chromosome (Acc.no. LN681231.1) and plasmids pYR2 (Acc.no. LN681229.1) and pYR3 (Acc.no. LN681230.1) using Bowtie2 aligner [12]. The reads that failed to align to these reference sequences were merged together and assembled using Velvet [13] and SPAdes [14]. The obtained assembled sequences were blasted against the NCBI nucleotide collection (https://blast.ncbi.nlm.nih.gov/Blast.cgi) and contigs that showed high sequence identity to Yersinia strains were excluded. The phage genomic scaffolds were auto-annotated using Rapid Annotation Using Subsystem Technology (RAST, http://rast.nmpdr.org/) and the obtained annotation was validated manually using the Artemis tool [15]. Presence of suitable ribosomal binding sites in front of start codons of each predicted gene was confirmed manually. The tRNA genes were identified using the ARAGORN (http://130.235.46.10/ARAGORN/) and tRNA-SCAN (http://lowelab.ucsc.edu/tRNAscan-SE/index.html) tools.

2.7. RNA-Sequencing Data Analysis

The quality filtered sequencing reads from different time points were aligned against the obtained YerA41 assembly scaffolds and against the Y. ruckeri PB-H2 reference sequence using Bowtie2 aligner [12]. The reads aligning over each gene were counted using the HTSeq [16]. The differential gene expression of bacterial transcriptome was analyzed using the edgeR [17].

2.8. Proteome Analysis

Purified phages were concentrated by centrifugation for 2 h at 4 °C at 16,000× g. Prior to the digestion of proteins to peptides with trypsin, the proteins in the samples were reduced with tris (2-carboxyethyl)phosphine (TCEP) and alkylated with iodoacetamide. Tryptic peptide digests were purified using C18 reversed-phase chromatography columns [18] and the mass spectrometry (MS) analysis was performed on an Orbitrap Elite Electron-Transfer Dissociation (ETD) mass spectrometer (Thermo Scientific, Waltham, MA, USA), using Xcalibur version 2.2, coupled to an Thermo Scientific nLC1000 nanoflow High Pressure Liquid Chromatography (HPLC) system. Peak extraction and subsequent protein identification was achieved using Proteome Discoverer 1.4 software (Thermo Scientific). Calibrated peak files were searched against the YerA41 and Y. ruckeri PB-H2 chromosome and plasmid proteins by a SEQUEST search engine. Error tolerances on the precursor and fragment ions were ±15 ppm and ±0.8 Da, respectively. For peptide identification, a stringent cut-off (0.05 false discovery rate or 5%) was used. The LC-MS/MS was performed at the Proteomics Unit, Institute of Biotechnology, University of Helsinki.

2.9. DNA Isolation

Phage DNA was obtained from high-titer phage preparations as described [9] using proteinase K plus SDS treatment followed by phenol-chloroform extractions and ethanol precipitation.

2.10. Accession Numbers

The RNA sequence data were deposited to the Gene Expression Omnibus (Acc. no GSE146319).

3. Results

3.1. Characterization of YerA41

The growth curve of YerA41 propagated in Y. ruckeri RS41 is shown in Figure S1. Apparent eclipse and latent periods of 30 and 40 min, respectively, were followed by a rise period of 20 min, and the burst size was 138 PFU per infected cell. The host range of YerA41 was studied on bacteria grown both at RT and at 37 °C, as temperature is known to regulate surface structures in the genus Yersinia [19]. Altogether, 129 strains representing 9 different Yersinia species and 3 other genera were used (Table 1 and Table S1). All Y. enterocolitica strains except for those of serotype O:8 were resistant to YerA41. Most tested Y. intermedia and Y. ruckeri strains were sensitive to YerA41 phage at both tested temperatures. In addition, many Y. kristenseni strains, and the Shigella flexneri strain, were sensitive to YerA41 (Table 1). Such a broad host range indicates that the phage uses a conserved bacterial surface structure as a receptor.
Table 1

Host range of bacteriophage YerA41 (numbers of studied strains for each species and serotype are given in parenthesis). The sensitivity was tested both at RT and at 37 °C.

Bacterial Species YerA41 Sensitive Serotypes YerA41Sensitive Serotypesat 37 °C YerA41 Resistant Serotypes or Strains
Yersinia enterocolitica O:8 (5)O:8 (5)O:1(2),O:1,2,3(1), O:2(2), O:3(2), O:4(1), O:4,32(2), O:5(3), O:5,27(2), O:6(2), O:6,30(2), O:6,31(2), O:7,8(8), O:9(2), O:13,17(1), O:13,7(2), O:13a,13b(2), O:14(1), O:15(2) O:20(2), O:21(2), O:25(1), O:25,26(1), O:26,44(1), O:28,50(1), O:34 (1), O:35,36 (1), O:35,52(1), O:41(27)43(2), O:41(27)42 K1(1), O:50(2), O:41(27)K1(1),O:41,43(1), K1 non-typable (2), non-typable(4)
Yersinia pseudo-tuberculosis --O:1(1), O:1a(1), O:1b(1), O:2 (2), O:2a(1), O:2b(1), O:2c(1), O:3 (2), O:4a (1), O:4b (1), O:5a (1), O:5b(1), O:6 (1), O:7 (1), O:8 (1), O:9(1),
Yersinia frederikseni -O:16(1),O:35(1)O:48 (1), non-typable (4)
Yersinia intermedia O:16,21(1), O:52,54 (1)O:16,21(1)-
Yersinia kristenseni O:12,25(1), O:16(2), non-typable (1)O:12.25(1), O:16(2), non-typable (1)non-typable(1)
Yersinia mollareti --O:59(20,36,7) (1)
Yersinia pestis --(2)
Yersinia bercoveri --O:58,16(1), non-typable(1)
Yersinia ruckeri (2)(2)-
Providencia rettgeri --(1)
Salmonella typhimurium --(1)
Shigella flexneri (1)(1)-

3.2. Genome Analysis

The RNA-sequencing reads not aligned to the Y. ruckeri reference genome were de novo assembled; the obtained unique contigs with no sequence similarity to Yersinia species were scaffolded (Figure 1). This resulted in nine qualified scaffolds comprising altogether 143,296 bp (Table 2, Figure 2). The overall GC ratio of the YerA41 genome was 32.3%. Importantly, the scaffolds did not present any significant overall identity to nucleotide sequences stored in databases.
Figure 1

Workflow of the RNA-seq approach. Freshly diluted Y. ruckeri RS41 bacteria were grown until OD600 = 0.6 and then infected with phage YerA41 at MOI equal to 10 or 50. The culture was washed with LB to remove the unbound phage particles, and thus prevent re-infection of bacterial cells at later stages of the experiment. Samples for RNA isolation were taken at different time points p.i. (0, 5, 15, 33, 45, 63, 75, 92 min). After the removal of bacterial rRNA, the prepared libraries were sequenced. The obtained sequencing reads were quality filtered and aligned against Y. ruckeri PBH2 chromosome (Acc.no. LN681231.1) and plasmids pYR2 (Acc.no. LN681229.1) and pYR3 (Acc.no. LN681230.1). The reads that failed to align to these reference sequences were merged together and assembled using Velvet [13] and SPAdes [14]. The obtained assembled sequences were blasted against the NCBI nucleotide collection and contigs showing high identity rates with Yersinia strains were excluded. The phage genomic scaffolds were auto-annotated using Rapid Annotation Using Subsystem Technology (RAST) [20]. Presence of suitable ribosomal binding sites in front of each predicted start codon was confirmed. Figure was created using BioRender (https://app.biorender.com).

Table 2

The assembled scaffolds of YerA41 genome. The numbering of the scaffolds is based on their length and does not reflect their actual position in the phage genome.

IDSize [bp]GC%
scaffold_142 98734.5
scaffold_227 37731.6
scaffold_326 59131.7
scaffold_415 13432.0
scaffold_511 50232.8
scaffold_68 24029.6
scaffold_75 28728.9
scaffold_83 67229.6
scaffold_92 50629.6
TOTAL/Average: 143 296 32.3
Figure 2

Gene organization of the nine scaffolds of the phage YerA41 genome. The scaffolds are organized based on size. The gene colors indicate predicted functions of their products: Green, hypothetical proteins; Blue, RNA polymerase subunits; Red, DNA polymerase-like proteins; Brown, phage particle associated (structural) proteins. The figure was generated using Geneious 10.2.6 (www.geneious.com).

A total of 201 putative genes were detected by RAST analysis and manual annotation. Predicted functions could be assigned to 60 of the 201 gene products, the others showed no significant similarity to any protein sequences in the databases (Table 3 and Table S2). The predicted YerA41 gene products showed similarity to DNA polymerases (Gp061, Gp097, Gp137, Gp0195) and RNA polymerase β-, β’- and other subunits (Gp019, Gp054, Gp055, Gp056, Gp162), helicases (Gp143, Gp145, Gp193), topoisomerases (Gp135, Gp136, Gp190), and a DNA ligase (Gp199). In addition, the predicted genes encode for enzymes involved in nucleoside metabolism, including 5′-deoxynucleotidase (Gp060), dCTP deaminase (Gp114), dUTP diphosphatase (Gp141), thymidylate synthetase (Gp127), ribonucleoside-diphosphate reductase (Gp110), ribonucleotide reductase (Gp109), and several endo- and exo-nucleases (Gp126, Gp149, Gp167, Gp179, Gp180, Gp189). The other gene products with recognizable function encoded different phage structural proteins. Three tRNA genes encoding tRNA-Arg, tRNA-Met, and tRNA-Leu, were identified by both ARAGORN and tRNA-SCAN, all located in a module on the scaffold_3. Finally, the gene g064-g070 products showed similarity to UDP-GlcNAc 2-epimerase, SDR family oxidoreductases, polysaccharide deacetylase, 2-C-methyl-D-erythritol-4-phosphate cytidylyltransferase, and glycerophosphodiester phosphodiesterase, suggesting them roles in biosynthesis of sugar-modified nucleotides.
Table 3

Temporal expression profiles of the gene products of phage YerA41 having predicted functions. The full list of all gene products, including the hypothetical proteins of unknown function, is presented in Supplementary Table S2. The LC-MS/MS-identified PPAPs are marked with asterisk.

Temporal Expression Gene ID Scaffold Putative Functions of Gene Products Based on Database Similarity
Early g054 2DNA directed RNA polymerase, subunit*
g055 2DNA-directed RNA polymerase, subunit*
g056 2DNA-directed RNA polymerase*
g078 2Lytic transglycosylase
g097 3DNA polymerase III, subunit
g116 3RNA 2’-phosphotransferase
g126 3Endonuclease-like protein
g135 4DNA topoisomerase*
g161 5Tail fiber protein
g162 5DNA directed RNA polymerase, subunit / Putative DNA helicase
g189 7Endonuclease-like protein
g190 7DNA topoisomerase*
g193 7Helicase*
g195 8DNA polymerase
g199 9DNA ligase*
Middle g060 25’-deoxynucleotidase
g061 2DNA polymerase*
g064 2UDP-GlcNAc 2-epimerase
g065 2Oxidoreductase
g066 2SDR family oxidoreductase
g067 2Polysaccharide deacetylase
g068 22-C-methyl-D-erythritol 4-phosphate cytidylyltransferase (EC 2.7.7.60)
g070 2Glycerophosphodiester phosphodiesterase
g109 3Ribonucleotide reductase
g110 3Ribonucleoside-diphosphate reductase subunit alpha*
g112 3Ribosomal protein modification protein*
g114 3dCTP deaminase
g127 3Thymidylate synthetase
g133 4Transglycosylase
g136 4DNA topoisomerase*
g137 4DNA polymerase III, subunit*
g140 4Structural protein
g141 4dUTP diphosphatase*
g143 4ATP-dependent DNA helicase*
g145 4Replicative helicase
g149 4Exodeoxyribonuclease*
g167 5Endonuclease*
g174 5Phage baseplate assembly protein
g179 6Exonuclease
g180 6Exonuclease*
Late g007 1DNA packaging terminase*
g012 1Prohead core protein protease*
g014 1Capsid protein*
g016 1Sugar binding protein*
g019 1RNA polymerase sigma factor*
g024 1Phage tail sheath protein*
g025 1Tail protein
g028 1Tail family protein*
g031 1Tail protein
g032 1Tail protein
g034 1Tail-associated lysozyme
g035 1Tail-associated lysozyme
g037 1Baseplate wedge protein*
g040 1Capsid protein
g041 1Virion structural protein
g042 1Baseplate wedge protein*
g043 1Tail fiber protein*
g044 1Phage tail fiber assembly protein*
g045 1Tail fiber protein*
g049 1Endolysin*

3.3. Proteomic Analysis of the Phage Structural Proteins

To determine the phage particle associated and structural proteins, a proteomic analysis of the purified phage particles using LC-MS/MS was carried out. The structural proteins were identified through the comparative analysis of the obtained tryptic peptide sequence patterns and the sequence-based in silico determined tryptic peptide sequences of phage proteins. Altogether, 63 phage proteins were reliably identified in the LC-MS/MS analysis (Table S2). The analysis revealed among the identified proteins structural proteins such as major capsid, tail sheath, baseplate wedge, tail fiber, and tail fiber assembly proteins. In addition, several DNA- or RNA modifying enzymes, such as DNA polymerases, helicases, recombinases, topoisomerases, endo- and exonucleases, and RNA polymerase subunits and a sigma factor were phage particle associated proteins (PPAPs, Table S2). These are likely to be injected into the bacteria along with the genomic DNA to take over the host metabolism as soon as possible after infection (Table S2).

3.4. Temporal Expression of Phage Genes

The heatmap of YerA41 genes expressed at different time points revealed evident temporal gene expression (Figure 3). The mean values of expression of early (5–15 min p.i.), middle (33–45 min), and late (63–92 min) were calculated and compared between each other. A gene was assigned to certain class based on the post infection time when its expression peaked (Table 3 and Table S2). In total 47 genes (23.4% of phage putative genes) had their highest expression in the first min post infection. The function of the majority of the gene products remained unknown, yet six of the gene products were predicted to be involved in nucleic acid processing. Interestingly, the genes of the early phase were scattered across several scaffolds. The heatmap (Figure 3) shows that the YerA41 gene expression progresses rapidly and indistinctly to the middle phase. This phase was characterized by the expression of 101 (50.2%) genes, including numerous genes presumably involved in nucleic acid metabolism. During the latest state of infection, 51 (25.4%) genes were induced. According to the in silico analysis, the majority of these genes encode for the structural proteins of phage particles. The remaining two putative genes (the g200 and g201 genes) showed minimal expression with no changes throughout the course of infection; thus, it is highly probable that they constitute bioinformatics artifacts.
Figure 3

Heatmap of YerA41 genes expressed at different time points post infection. Gene expression values were normalized to the highest expression to show the timing of expression; therefore, the intensity of color on the heatmap reflects the difference of expression of one gene at different time points, yet not the difference of expression between different genes.

3.5. Bacterial Response to Lytic Infection

It is crucial to get insight into different phases of phage infection in order to understand the bacterial host response. Based on the one-step growth curve and the results of short time interval RNA-sequencing, three different time points: 15, 30, and 60 min p.i. were chosen as the representative timepoints for early, middle, and late stages of infection. Analysis of the RNA-sequencing data from three independent biological replicates revealed that Y. ruckeri differentially regulated the expression of 167 genes during the early stage of infection (15 min p.i.). Among the differentially expressed genes of the bacterial host (4.6%), 38% of the genes showed upregulation, and the remaining 62% showed downregulation (Table 4). Most of the upregulated genes encoded products implicated in the protection of bacterial cells from oxidative damage. These included genes encoding catalase (CSF007_12285), thioredoxin (CSF007_14055), glutaredoxin (CSF007_6685), peroxiredoxin family protein (CSF007_17480), glutathione reductase (CSF007_0665), and thioredoxin reductase (CSF007_6870). Additionally, RNA-sequencing showed the induction of genes involved in glycolysis, namely the genes encoding dihydrolipoamide dehydrogenase (CSF007_17485), glucose-6-phosphate isomerase (CSF007_16360), and pyruvate kinase (CSF007_9590), as well as three genes involved in the biosynthesis of siderophores (CSF007_15200, CSF007_15205, CSF007_15210). Among the host bacterium genes presenting the strongest upregulation p.i. was dps (CSF007_6300), which encodes a non-specific DNA-binding protein involved in DNA protection during exposure to severe environmental insult. An increase in expression was also observed for host bacterium genes involved in the biosynthesis of antibacterial agents like polymyxin and enterobactin (CSF007_15260, CSF007_15255, CSF007_15265). In contrast, the bacteria downregulated genes that are involved in the metabolism of different carbohydrates, as well as several genes encoding the succinate dehydrogenase complex (CSF007_5810, CSF007_5800, CSF007_5795, CSF007_5805).
Table 4

Transcriptional response of Y. ruckeri to infection with YerA41. The list of bacterial genes showing significant (p-value < 0.001) differential expression at both 15 min and 30 min time points compared to non-infected bacteria. The lists of genes differentially expressed at different time points (15, 30 and 60 min p.i.) are presented in Supplementary Table S3. LogFC; log-ratio of a transcript’s expression values in two different conditions. FDR; False Discovery Rate.

Gene ID Function 15 min 30 min
logFC FDR logFC FDR
CSF007_17485Dihydrolipoamide dehydrogenase7.071.97 × 10−1461.282.16 × 10−07
CSF007_17480Peroxiredoxin family protein/glutaredoxin6.954.94 × 10−1281.127.36 × 10−05
CSF007_6300Non-specific DNA-binding protein Dps / Iron-binding ferritin-like antioxidant protein / Ferroxidase4.394.13 × 10−661.905.48 × 10−11
CSF007_12285Catalase4.103.26 × 10−591.549.40 × 10−09
CSF007_9590Pyruvate kinase1.957.27 × 10−152.119.00 × 10−13
CSF007_11760Putrescine importer1.431.44 × 10−071.714.96 × 10−05
CSF007_5840Cytochrome d ubiquinol oxidase subunit I1.211.89 × 10−061.490.00011
CSF007_5845Cytochrome d ubiquinol oxidase subunit II1.141.98 × 10−061.482.78 × 10−05
CSF007_13405Inosine-5-monophosphate dehydrogenase 0.910.000341.190.00017
CSF007_12920hypothetical protein0.880.000141.110.00015
CSF007_5505hypothetical protein−0.781.77 × 10−05−1.181.44 × 10−05
CSF007_14715Glycine cleavage system H protein−0.810.00044−1.883.38 × 10−07
CSF007_9025Alkyl sulfatase−0.932.94 × 10−06−1.321.03 × 10−05
CSF007_13885D-ribulokinase−0.946.94 × 10−07−2.049.33 × 10−14
CSF007_13880Phosphosugar isomerase/binding protein−1.012.19 × 10−06−1.986.52 × 10−09
CSF007_1760Aspartate ammonia-lyase−1.026.88 × 10−05−1.983.86 × 10−12
CSF007_0675Oligopeptidase A−1.030.00043−1.290.00043
CSF007_9680Hemin transport protein HmuS−1.061.77 × 10−05−1.380.00097
CSF007_17975Glutamine synthetase type I−1.090.000141.924.07 × 10−05
CSF007_14720Aminomethyltransferase (glycine cleavage system T protein)−1.113.59 × 10−09−1.697.18 × 10−10
CSF007_11035Transcriptional repressor of PutA and PutP / Proline dehydrogenase (Proline oxidase) / Delta-1-pyrroline-5-carboxylate dehydrogenase−1.129.24 × 10−06−2.145.30 × 10−14
CSF007_13080NADP-dependent malic enzyme−1.129.98 × 10−08−1.682.82 × 10−09
CSF007_6400Galactose/methyl galactoside ABC transport system ATP-binding protein MglA−1.131.92 × 10−07−1.726.99 × 10−09
CSF007_0690Universal stress protein A−1.201.55 × 10−07−1.565.98 × 10−06
CSF007_0605Aerobic C4-dicarboxylate transporter for fumarate/L-malate/D-malate/succunate−1.231.07 × 10−09−1.090.00046
CSF007_1210Cyclic AMP receptor protein−1.325.98 × 10−08−1.421.30 × 10−05
CSF007_024516 kDa heat shock protein A−1.370.00033−1.806.55 × 10−06
CSF007_5820Dihydrolipoamide succinyltransferase component (E2) of 2-oxoglutarate dehydrogenase complex−1.417.97 × 10−08−2.871.10 × 10−12
CSF007_18075Ribose ABC transport system periplasmic ribose-binding protein RbsB−1.411.96 × 10−11−1.578.42 × 10−07
CSF007_16000hypothetical protein−1.422.65 × 10−06−1.996.12 × 10−06
CSF007_11865Mannonate dehydratase−1.468.78 × 10−10−2.136.04 × 10−12
CSF007_13895Ribose ABC transport system permease protein RbsC−1.481.29 × 10−12−2.077.16 × 10−12
CSF007_0935Transcriptional activator of maltose regulon MalT−1.497.07 × 10−14−1.488.42 × 10−07
CSF007_16315Maltose operon periplasmic protein MalM−1.516.85 × 10-06−2.030.00043
CSF007_18085Ribose ABC transport system ATP-binding protein RbsA−1.517.72 × 10−10−1.852.56 × 10−06
CSF007_9675TonB-dependent hemin ferrichrome receptor−1.559.46 × 10−16−1.130.00018
CSF007_5825Succinyl-CoA ligase [ADP-forming] beta chain−1.607.19 × 10−08−2.865.80 × 10−10
CSF007_5830Succinyl-CoA ligase [ADP-forming] alpha chain−1.631.56 × 10−09−3.014.78 × 10−14
CSF007_18090Ribose ABC transport system high affinity permease RbsD−1.661.10 × 10−11−2.164.82 × 10−08
CSF007_16340Maltose/maltodextrin ABC transporter substrate binding periplasmic protein MalE−1.685.91 × 10−08−2.031.45 × 10−06
CSF007_3355Aconitate hydratase 2−1.693.40 × 10−12−1.661.56 × 10−06
CSF007_58152-oxoglutarate dehydrogenase E1 component−1.802.19 × 10−13−2.963.74 × 10−18
CSF007_9550Putative transport protein−1.804.34 × 10−12−1.512.19 × 10−06
CSF007_16325Maltose/maltodextrin transport ATP-binding protein MalK−1.816.80 × 10−06−2.933.65 × 10−08
CSF007_9650Phosphoenolpyruvate synthase−1.826.56 × 10−15−2.541.16 × 10−09
CSF007_11875D-mannonate oxidoreductase−1.873.62 × 10−13−2.574.08 × 10−14
CSF007_12965Sialic acid transporter (permease) NanT−1.922.90 × 10−12−2.724.08 × 10−14
CSF007_13900Ribose/xylose/arabinose/galactoside ABC-type transport system ATP-binding protein−2.082.79 × 10−28−2.067.97 × 10−05
CSF007_6395Galactose/methyl galactoside ABC transport system galactose-binding periplasmic protein MglB−2.082.72 × 10−14−2.791.40 × 10−17
CSF007_11455hypothetical protein−2.128.23 × 10−28−1.574.48 × 10−05
CSF007_0865Gluconokinase−2.129.37 × 10−17−1.807.28 × 10−06
CSF007_12460membrane protein−2.213.34 × 10−18−2.357.75 × 10−09
CSF007_15720Hexuronate transporter−2.387.15 × 10−28−2.401.25 × 10−10
CSF007_17650Glycerol uptake facilitator protein−2.391.55 × 10−28−2.085.48 × 10−11
CSF007_16005Trehalose-6-phosphate hydrolase−2.463.37 × 10−14−3.863.31 × 10−24
CSF007_5810Succinate dehydrogenase iron-sulfur protein−2.471.33 × 10−16−3.066.67 × 10−15
CSF007_13910Ribose/xylose/arabinose/galactoside ABC-type transport system periplasmic sugar binding protein−2.481.26 × 10−19−3.701.55 × 10−33
CSF007_5800Succinate dehydrogenase hydrophobic membrane anchor protein−2.494.39 × 10−19−2.641.15 × 10−10
CSF007_17655Glycerol kinase−2.561.20 × 10−19−3.312.81 × 10−15
CSF007_5795Succinate dehydrogenase cytochrome b-556 subunit−2.644.76 × 10−26−2.049.68 × 10−13
CSF007_15715hypothetical protein−2.642.61 × 10−10−3.741.21 × 10−17
CSF007_5805Succinate dehydrogenase flavoprotein subunit−2.652.01 × 10−22−3.183.94 × 10−18
CSF007_12450Ascorbate-specific PTS system, EIIA component−3.076.74 × 10−23−3.176.76 × 10−10
CSF007_12455Putative sugar phosphotransferase component II B−3.211.62 × 10−22−3.521.76 × 10−09
CSF007_5790Citrate synthase −3.238.51 × 10−22−3.128.36 × 10−10
CSF007_13905hypothetical protein−3.282.14 × 10−13−5.246.87 × 10−10
CSF007_16010PTS system, trehalose-specific IIB component-PTS system−3.393.53 × 10−41−3.851.29 × 10−27
At the mid time point, 30 min p.i., altogether 98 host bacterium genes presented significantly differential pattern of expression. Of the genes, 78% were significantly downregulated in response to the ongoing YerA41 infection. Further decrease in the expression of host bacterium genes encoding for succinate dehydrogenase complex (CSF007_5805, CSF007_5810, CSF007_5830, CSF007_5800, CSF007_5825, CSF007_5795) and those involved in carbohydrate metabolism and transport of sugars was observed when compared to the 15 min (early) time point (Table 4). A substantially smaller fraction (22%) of host bacterium genes were upregulated when compared to uninfected bacteria. That includes increase in expression of genes encoding cytochrome d ubiquinol oxidase subunits I and II (CSF007_5840 and CSF007_5845) and DNA-binding protein Fis (CSF007_16180)—a prominent factor implicated in bacterial gene regulation. Yet, the strongest overexpression was observed for the cold shock protein CspG (CSF007_10385). Similar to the initial phase of infection, bacteria responded by positive induction of expression of genes encoding the non-specific DNA-binding protein Dps, and catalase. Interestingly, during the late phase, (60 min p.i.), only 15 host bacterium genes showed significantly different expression level when compared to the uninfected bacteria. All but one (phosphoenolpyruvate synthase, CSF007_9650) showed downregulation of expression. The strongest decrease in expression was observed for genes implicated in the metabolism of fructose, including the fructose-specific phosphocarrier protein HPr, fructose-specific PTS system component and 1-phosphofructokinase (CSF007_11820, CSF007_11810, CSF007_11815). Moreover, during this phase, bacteria downregulated the expression of genes encoding for factors involved in ribonucleotide reduction, such as protein NrdH and reductases of class Ib (CSF007_13980, CSF007_13985, CSF007_13990).

4. Discussion

In this study, we show that a transcriptomic approach can be used to obtain the genomic sequence of infectious organisms that cannot be sequenced using traditional DNA based sequencing approaches as they may possess hypermodified deoxyribonucleotides. The major advantage of the method is that it allows us to obtain the sequence of the viral transcriptome and the insight into the phage-host interaction simultaneously during the infection process. Since the phage genomes are characteristically compactly packed with minimal non-coding regions, their transcriptomes constitute the vast majority of genomic sequences, leaving very little of the unresolved genomic sequence. We performed our RNA-sequencing study on bacterial cells infected at a relatively high MOI (MOI = 50) compared to other phage-host analyses [21,22]. At this MOI value, nearly all the bacterial cells in the culture are infected by the phage; therefore, the pattern of gene expression displayed in this study should reflect the bacterial response to lytic phage infection. One interesting phenomenon observed in this study was that the number of differentially expressed bacterial host genes decreased throughout the course of infection. We believe that it is caused by some degree of degradation of host transcripts, in combination with differences in ratio between the infected and uninfected bacteria in later phases of infection. In this situation, the degraded RNA transcripts from the infected cells would be lost. Based upon active gene expression during infection, the YerA41 genome contains >201 putative genes; however, of these, only 60 could be assigned a predictive function. The remaining gene products exibited very limited amino acid sequence similarity to other proteins deposited in the databases, further illustrating the novelty of the YerA41 bacteriophage. Among the known gene products, there were several DNAP, RNAP β- and β’-subunits, topoisomerases, DNA ligase, helicases, as well as endo- and exonucleases. The functional analysis of these genes also revealed the presence of a putative endolysin (g49). Due to the ability to lyse bacterial cell walls, endolysins are considered to be of special interest as potential novel antimicrobials [23,24]. A very interesting group of genes were identified from scaffold 2, where the gene g064-g070 products showed similarities sugar biosynthesis related enzymes, suggesting that they might play a role in biosynthesis of sugar-modified nucleotides. The obtained genomic sequence indicates that YerA41 is a member of a novel, previously unidentified, group of bacteriophages. At the nucleotide level it shares no significant similarity with genomic sequences of any known organisms deposited in the databases. Conversely, the in silico analysis revealed the presence of multiple unique proteins that are predicted to be involved in nucleic acid processing and metabolism. Taking this into consideration, it is logical that it is equipped with its own machinery for transcription and amplification of the genetic information. At the moment, the exact nature of the nucleotide modification present in the genome of YerA41 is still unknown, but the research tackling this question is ongoing.
  23 in total

Review 1.  The biosynthesis and biological role of lipopolysaccharide O-antigens of pathogenic Yersiniae.

Authors:  Mikael Skurnik; José A Bengoechea
Journal:  Carbohydr Res       Date:  2003-11-14       Impact factor: 2.104

2.  Fast gapped-read alignment with Bowtie 2.

Authors:  Ben Langmead; Steven L Salzberg
Journal:  Nat Methods       Date:  2012-03-04       Impact factor: 28.547

Review 3.  Biosynthesis and Function of Modified Bases in Bacteria and Their Viruses.

Authors:  Peter Weigele; Elisabeth A Raleigh
Journal:  Chem Rev       Date:  2016-06-20       Impact factor: 60.622

4.  Yersiniophage phiR1-37 is a tailed bacteriophage having a 270 kb DNA genome with thymidine replaced by deoxyuridine.

Authors:  Saija Kiljunen; Kristo Hakala; Elise Pinta; Suvi Huttunen; Patrycja Pluta; Aneta Gador; Harri Lönnberg; Mikael Skurnik
Journal:  Microbiology       Date:  2005-12       Impact factor: 2.777

5.  Crystal structure of deoxycytidylate hydroxymethylase from bacteriophage T4, a component of the deoxyribonucleoside triphosphate-synthesizing complex.

Authors:  H K Song; S H Sohn; S W Suh
Journal:  EMBO J       Date:  1999-03-01       Impact factor: 11.598

6.  THE GROWTH OF BACTERIOPHAGE.

Authors:  E L Ellis; M Delbrück
Journal:  J Gen Physiol       Date:  1939-01-20       Impact factor: 4.086

7.  HTSeq--a Python framework to work with high-throughput sequencing data.

Authors:  Simon Anders; Paul Theodor Pyl; Wolfgang Huber
Journal:  Bioinformatics       Date:  2014-09-25       Impact factor: 6.937

8.  Comparative transcriptomics analyses reveal the conservation of an ancestral infectious strategy in two bacteriophage genera.

Authors:  Bob G Blasdel; Anne Chevallereau; Marc Monot; Rob Lavigne; Laurent Debarbieux
Journal:  ISME J       Date:  2017-05-12       Impact factor: 10.302

9.  edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.

Authors:  Mark D Robinson; Davis J McCarthy; Gordon K Smyth
Journal:  Bioinformatics       Date:  2009-11-11       Impact factor: 6.937

10.  The RAST Server: rapid annotations using subsystems technology.

Authors:  Ramy K Aziz; Daniela Bartels; Aaron A Best; Matthew DeJongh; Terrence Disz; Robert A Edwards; Kevin Formsma; Svetlana Gerdes; Elizabeth M Glass; Michael Kubal; Folker Meyer; Gary J Olsen; Robert Olson; Andrei L Osterman; Ross A Overbeek; Leslie K McNeil; Daniel Paarmann; Tobias Paczian; Bruce Parrello; Gordon D Pusch; Claudia Reich; Rick Stevens; Olga Vassieva; Veronika Vonstein; Andreas Wilke; Olga Zagnitko
Journal:  BMC Genomics       Date:  2008-02-08       Impact factor: 3.969

View more
  3 in total

1.  Phage Genome Annotation: Where to Begin and End.

Authors:  Anastasiya Shen; Andrew Millard
Journal:  Phage (New Rochelle)       Date:  2021-12-16

2.  Temporal Transcriptional Responses of a Vibrio alginolyticus Strain to Podoviridae Phage HH109 Revealed by RNA-Seq.

Authors:  Xixi Li; Ce Zhang; Xingkun Jin; Fucheng Wei; Fei Yu; Douglas R Call; Zhe Zhao
Journal:  mSystems       Date:  2022-04-11       Impact factor: 7.324

3.  The DNA polymerase of bacteriophage YerA41 replicates its T-modified DNA in a primer-independent manner.

Authors:  Miguel V Gomez-Raya-Vilanova; Katarzyna Leskinen; Arnab Bhattacharjee; Pasi Virta; Petja Rosenqvist; Jake L R Smith; Oliver W Bayfield; Christina Homberger; Tobias Kerrinnes; Jörg Vogel; Maria I Pajunen; Mikael Skurnik
Journal:  Nucleic Acids Res       Date:  2022-04-22       Impact factor: 19.160

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.