| Literature DB >> 32517038 |
Katarzyna Leskinen1, Maria I Pajunen1, Miguel Vincente Gomez-Raya Vilanova1, Saija Kiljunen1, Andrew Nelson2, Darren Smith2, Mikael Skurnik1,3.
Abstract
YerA41 is a Myoviridae bacteriophage that was originally isolated due its ability to infect Yersinia ruckeri bacteria, the causative agent of enteric redmouth disease of salmonid fish. Several attempts to determine its genomic DNA sequence using traditional and next generation sequencing technologies failed, indicating that the phage genome is modified in such a way that it is an unsuitable template for PCR amplification and for conventional sequencing. To determine the YerA41 genome sequence, we performed RNA-sequencing from phage-infected Y. ruckeri cells at different time points post-infection. The host-genome specific reads were subtracted and de novo assembly was performed on the remaining unaligned reads. This resulted in nine phage-specific scaffolds with a total length of 143 kb that shared only low level and scattered identity to known sequences deposited in DNA databases. Annotation of the sequences revealed 201 predicted genes, most of which found no homologs in the databases. Proteome studies identified altogether 63 phage particle-associated proteins. The RNA-sequencing data were used to characterize the transcriptional control of YerA41 and to investigate its impact on the bacterial gene expression. Overall, our results indicate that RNA-sequencing can be successfully used to obtain the genomic sequence of non-sequencable phages, providing simultaneous information about the phage-host interactions during the process of infection.Entities:
Keywords: RNA-sequencing; YerA41; Yersinia ruckeri; bacteriophage; genome assembly; nucleotide modification; transcriptome
Mesh:
Substances:
Year: 2020 PMID: 32517038 PMCID: PMC7354516 DOI: 10.3390/v12060620
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Host range of bacteriophage YerA41 (numbers of studied strains for each species and serotype are given in parenthesis). The sensitivity was tested both at RT and at 37 °C.
| Bacterial Species | YerA41 Sensitive Serotypes | YerA41 | YerA41 |
|---|---|---|---|
|
| O:8 (5) | O:8 (5) | O:1(2),O:1,2,3(1), O:2(2), O:3(2), O:4(1), O:4,32(2), O:5(3), O:5,27(2), O:6(2), O:6,30(2), O:6,31(2), O:7,8(8), O:9(2), O:13,17(1), O:13,7(2), O:13a,13b(2), O:14(1), O:15(2) O:20(2), O:21(2), O:25(1), O:25,26(1), O:26,44(1), O:28,50(1), O:34 (1), O:35,36 (1), O:35,52(1), O:41(27)43(2), O:41(27)42 K1(1), O:50(2), O:41(27)K1(1),O:41,43(1), K1 non-typable (2), non-typable(4) |
|
| - | - | O:1(1), O:1a(1), O:1b(1), O:2 (2), O:2a(1), O:2b(1), O:2c(1), O:3 (2), O:4a (1), O:4b (1), O:5a (1), O:5b(1), O:6 (1), O:7 (1), O:8 (1), O:9(1), |
|
| - | O:16(1),O:35(1) | O:48 (1), non-typable (4) |
|
| O:16,21(1), O:52,54 (1) | O:16,21(1) | - |
|
| O:12,25(1), O:16(2), non-typable (1) | O:12.25(1), O:16(2), non-typable (1) | non-typable(1) |
|
| - | - | O:59(20,36,7) (1) |
|
| - | - | (2) |
|
| - | - | O:58,16(1), non-typable(1) |
|
| (2) | (2) | - |
|
| - | - | (1) |
|
| - | - | (1) |
|
| (1) | (1) | - |
Figure 1Workflow of the RNA-seq approach. Freshly diluted Y. ruckeri RS41 bacteria were grown until OD600 = 0.6 and then infected with phage YerA41 at MOI equal to 10 or 50. The culture was washed with LB to remove the unbound phage particles, and thus prevent re-infection of bacterial cells at later stages of the experiment. Samples for RNA isolation were taken at different time points p.i. (0, 5, 15, 33, 45, 63, 75, 92 min). After the removal of bacterial rRNA, the prepared libraries were sequenced. The obtained sequencing reads were quality filtered and aligned against Y. ruckeri PBH2 chromosome (Acc.no. LN681231.1) and plasmids pYR2 (Acc.no. LN681229.1) and pYR3 (Acc.no. LN681230.1). The reads that failed to align to these reference sequences were merged together and assembled using Velvet [13] and SPAdes [14]. The obtained assembled sequences were blasted against the NCBI nucleotide collection and contigs showing high identity rates with Yersinia strains were excluded. The phage genomic scaffolds were auto-annotated using Rapid Annotation Using Subsystem Technology (RAST) [20]. Presence of suitable ribosomal binding sites in front of each predicted start codon was confirmed. Figure was created using BioRender (https://app.biorender.com).
The assembled scaffolds of YerA41 genome. The numbering of the scaffolds is based on their length and does not reflect their actual position in the phage genome.
| ID | Size [bp] | GC% |
|---|---|---|
| scaffold_1 | 42 987 | 34.5 |
| scaffold_2 | 27 377 | 31.6 |
| scaffold_3 | 26 591 | 31.7 |
| scaffold_4 | 15 134 | 32.0 |
| scaffold_5 | 11 502 | 32.8 |
| scaffold_6 | 8 240 | 29.6 |
| scaffold_7 | 5 287 | 28.9 |
| scaffold_8 | 3 672 | 29.6 |
| scaffold_9 | 2 506 | 29.6 |
|
|
|
|
Figure 2Gene organization of the nine scaffolds of the phage YerA41 genome. The scaffolds are organized based on size. The gene colors indicate predicted functions of their products: Green, hypothetical proteins; Blue, RNA polymerase subunits; Red, DNA polymerase-like proteins; Brown, phage particle associated (structural) proteins. The figure was generated using Geneious 10.2.6 (www.geneious.com).
Temporal expression profiles of the gene products of phage YerA41 having predicted functions. The full list of all gene products, including the hypothetical proteins of unknown function, is presented in Supplementary Table S2. The LC-MS/MS-identified PPAPs are marked with asterisk.
|
|
|
|
|
| Early |
| 2 | DNA directed RNA polymerase, subunit* |
|
| 2 | DNA-directed RNA polymerase, subunit* | |
|
| 2 | DNA-directed RNA polymerase* | |
|
| 2 | Lytic transglycosylase | |
|
| 3 | DNA polymerase III, subunit | |
|
| 3 | RNA 2’-phosphotransferase | |
|
| 3 | Endonuclease-like protein | |
|
| 4 | DNA topoisomerase* | |
|
| 5 | Tail fiber protein | |
|
| 5 | DNA directed RNA polymerase, subunit / Putative DNA helicase | |
|
| 7 | Endonuclease-like protein | |
|
| 7 | DNA topoisomerase* | |
|
| 7 | Helicase* | |
|
| 8 | DNA polymerase | |
|
| 9 | DNA ligase* | |
| Middle |
| 2 | 5’-deoxynucleotidase |
|
| 2 | DNA polymerase* | |
|
| 2 | UDP-GlcNAc 2-epimerase | |
|
| 2 | Oxidoreductase | |
|
| 2 | SDR family oxidoreductase | |
|
| 2 | Polysaccharide deacetylase | |
|
| 2 | 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase (EC 2.7.7.60) | |
|
| 2 | Glycerophosphodiester phosphodiesterase | |
|
| 3 | Ribonucleotide reductase | |
|
| 3 | Ribonucleoside-diphosphate reductase subunit alpha* | |
|
| 3 | Ribosomal protein modification protein* | |
|
| 3 | dCTP deaminase | |
|
| 3 | Thymidylate synthetase | |
|
| 4 | Transglycosylase | |
|
| 4 | DNA topoisomerase* | |
|
| 4 | DNA polymerase III, subunit* | |
|
| 4 | Structural protein | |
|
| 4 | dUTP diphosphatase* | |
|
| 4 | ATP-dependent DNA helicase* | |
|
| 4 | Replicative helicase | |
|
| 4 | Exodeoxyribonuclease* | |
|
| 5 | Endonuclease* | |
|
| 5 | Phage baseplate assembly protein | |
|
| 6 | Exonuclease | |
|
| 6 | Exonuclease* | |
| Late |
| 1 | DNA packaging terminase* |
|
| 1 | Prohead core protein protease* | |
|
| 1 | Capsid protein* | |
|
| 1 | Sugar binding protein* | |
|
| 1 | RNA polymerase sigma factor* | |
|
| 1 | Phage tail sheath protein* | |
|
| 1 | Tail protein | |
|
| 1 | Tail family protein* | |
|
| 1 | Tail protein | |
|
| 1 | Tail protein | |
|
| 1 | Tail-associated lysozyme | |
|
| 1 | Tail-associated lysozyme | |
|
| 1 | Baseplate wedge protein* | |
|
| 1 | Capsid protein | |
|
| 1 | Virion structural protein | |
|
| 1 | Baseplate wedge protein* | |
|
| 1 | Tail fiber protein* | |
|
| 1 | Phage tail fiber assembly protein* | |
|
| 1 | Tail fiber protein* | |
|
| 1 | Endolysin* |
Figure 3Heatmap of YerA41 genes expressed at different time points post infection. Gene expression values were normalized to the highest expression to show the timing of expression; therefore, the intensity of color on the heatmap reflects the difference of expression of one gene at different time points, yet not the difference of expression between different genes.
Transcriptional response of Y. ruckeri to infection with YerA41. The list of bacterial genes showing significant (p-value < 0.001) differential expression at both 15 min and 30 min time points compared to non-infected bacteria. The lists of genes differentially expressed at different time points (15, 30 and 60 min p.i.) are presented in Supplementary Table S3. LogFC; log-ratio of a transcript’s expression values in two different conditions. FDR; False Discovery Rate.
|
|
|
|
| ||
|
|
|
|
| ||
| CSF007_17485 | Dihydrolipoamide dehydrogenase | 7.07 | 1.97 × 10−146 | 1.28 | 2.16 × 10−07 |
| CSF007_17480 | Peroxiredoxin family protein/glutaredoxin | 6.95 | 4.94 × 10−128 | 1.12 | 7.36 × 10−05 |
| CSF007_6300 | Non-specific DNA-binding protein Dps / Iron-binding ferritin-like antioxidant protein / Ferroxidase | 4.39 | 4.13 × 10−66 | 1.90 | 5.48 × 10−11 |
| CSF007_12285 | Catalase | 4.10 | 3.26 × 10−59 | 1.54 | 9.40 × 10−09 |
| CSF007_9590 | Pyruvate kinase | 1.95 | 7.27 × 10−15 | 2.11 | 9.00 × 10−13 |
| CSF007_11760 | Putrescine importer | 1.43 | 1.44 × 10−07 | 1.71 | 4.96 × 10−05 |
| CSF007_5840 | Cytochrome d ubiquinol oxidase subunit I | 1.21 | 1.89 × 10−06 | 1.49 | 0.00011 |
| CSF007_5845 | Cytochrome d ubiquinol oxidase subunit II | 1.14 | 1.98 × 10−06 | 1.48 | 2.78 × 10−05 |
| CSF007_13405 | Inosine-5-monophosphate dehydrogenase | 0.91 | 0.00034 | 1.19 | 0.00017 |
| CSF007_12920 | hypothetical protein | 0.88 | 0.00014 | 1.11 | 0.00015 |
| CSF007_5505 | hypothetical protein | −0.78 | 1.77 × 10−05 | −1.18 | 1.44 × 10−05 |
| CSF007_14715 | Glycine cleavage system H protein | −0.81 | 0.00044 | −1.88 | 3.38 × 10−07 |
| CSF007_9025 | Alkyl sulfatase | −0.93 | 2.94 × 10−06 | −1.32 | 1.03 × 10−05 |
| CSF007_13885 | D-ribulokinase | −0.94 | 6.94 × 10−07 | −2.04 | 9.33 × 10−14 |
| CSF007_13880 | Phosphosugar isomerase/binding protein | −1.01 | 2.19 × 10−06 | −1.98 | 6.52 × 10−09 |
| CSF007_1760 | Aspartate ammonia-lyase | −1.02 | 6.88 × 10−05 | −1.98 | 3.86 × 10−12 |
| CSF007_0675 | Oligopeptidase A | −1.03 | 0.00043 | −1.29 | 0.00043 |
| CSF007_9680 | Hemin transport protein HmuS | −1.06 | 1.77 × 10−05 | −1.38 | 0.00097 |
| CSF007_17975 | Glutamine synthetase type I | −1.09 | 0.00014 | 1.92 | 4.07 × 10−05 |
| CSF007_14720 | Aminomethyltransferase (glycine cleavage system T protein) | −1.11 | 3.59 × 10−09 | −1.69 | 7.18 × 10−10 |
| CSF007_11035 | Transcriptional repressor of PutA and PutP / Proline dehydrogenase (Proline oxidase) / Delta-1-pyrroline-5-carboxylate dehydrogenase | −1.12 | 9.24 × 10−06 | −2.14 | 5.30 × 10−14 |
| CSF007_13080 | NADP-dependent malic enzyme | −1.12 | 9.98 × 10−08 | −1.68 | 2.82 × 10−09 |
| CSF007_6400 | Galactose/methyl galactoside ABC transport system ATP-binding protein MglA | −1.13 | 1.92 × 10−07 | −1.72 | 6.99 × 10−09 |
| CSF007_0690 | Universal stress protein A | −1.20 | 1.55 × 10−07 | −1.56 | 5.98 × 10−06 |
| CSF007_0605 | Aerobic C4-dicarboxylate transporter for fumarate/L-malate/D-malate/succunate | −1.23 | 1.07 × 10−09 | −1.09 | 0.00046 |
| CSF007_1210 | Cyclic AMP receptor protein | −1.32 | 5.98 × 10−08 | −1.42 | 1.30 × 10−05 |
| CSF007_0245 | 16 kDa heat shock protein A | −1.37 | 0.00033 | −1.80 | 6.55 × 10−06 |
| CSF007_5820 | Dihydrolipoamide succinyltransferase component (E2) of 2-oxoglutarate dehydrogenase complex | −1.41 | 7.97 × 10−08 | −2.87 | 1.10 × 10−12 |
| CSF007_18075 | Ribose ABC transport system periplasmic ribose-binding protein RbsB | −1.41 | 1.96 × 10−11 | −1.57 | 8.42 × 10−07 |
| CSF007_16000 | hypothetical protein | −1.42 | 2.65 × 10−06 | −1.99 | 6.12 × 10−06 |
| CSF007_11865 | Mannonate dehydratase | −1.46 | 8.78 × 10−10 | −2.13 | 6.04 × 10−12 |
| CSF007_13895 | Ribose ABC transport system permease protein RbsC | −1.48 | 1.29 × 10−12 | −2.07 | 7.16 × 10−12 |
| CSF007_0935 | Transcriptional activator of maltose regulon MalT | −1.49 | 7.07 × 10−14 | −1.48 | 8.42 × 10−07 |
| CSF007_16315 | Maltose operon periplasmic protein MalM | −1.51 | 6.85 × 10-06 | −2.03 | 0.00043 |
| CSF007_18085 | Ribose ABC transport system ATP-binding protein RbsA | −1.51 | 7.72 × 10−10 | −1.85 | 2.56 × 10−06 |
| CSF007_9675 | TonB-dependent hemin ferrichrome receptor | −1.55 | 9.46 × 10−16 | −1.13 | 0.00018 |
| CSF007_5825 | Succinyl-CoA ligase [ADP-forming] beta chain | −1.60 | 7.19 × 10−08 | −2.86 | 5.80 × 10−10 |
| CSF007_5830 | Succinyl-CoA ligase [ADP-forming] alpha chain | −1.63 | 1.56 × 10−09 | −3.01 | 4.78 × 10−14 |
| CSF007_18090 | Ribose ABC transport system high affinity permease RbsD | −1.66 | 1.10 × 10−11 | −2.16 | 4.82 × 10−08 |
| CSF007_16340 | Maltose/maltodextrin ABC transporter substrate binding periplasmic protein MalE | −1.68 | 5.91 × 10−08 | −2.03 | 1.45 × 10−06 |
| CSF007_3355 | Aconitate hydratase 2 | −1.69 | 3.40 × 10−12 | −1.66 | 1.56 × 10−06 |
| CSF007_5815 | 2-oxoglutarate dehydrogenase E1 component | −1.80 | 2.19 × 10−13 | −2.96 | 3.74 × 10−18 |
| CSF007_9550 | Putative transport protein | −1.80 | 4.34 × 10−12 | −1.51 | 2.19 × 10−06 |
| CSF007_16325 | Maltose/maltodextrin transport ATP-binding protein MalK | −1.81 | 6.80 × 10−06 | −2.93 | 3.65 × 10−08 |
| CSF007_9650 | Phosphoenolpyruvate synthase | −1.82 | 6.56 × 10−15 | −2.54 | 1.16 × 10−09 |
| CSF007_11875 | D-mannonate oxidoreductase | −1.87 | 3.62 × 10−13 | −2.57 | 4.08 × 10−14 |
| CSF007_12965 | Sialic acid transporter (permease) NanT | −1.92 | 2.90 × 10−12 | −2.72 | 4.08 × 10−14 |
| CSF007_13900 | Ribose/xylose/arabinose/galactoside ABC-type transport system ATP-binding protein | −2.08 | 2.79 × 10−28 | −2.06 | 7.97 × 10−05 |
| CSF007_6395 | Galactose/methyl galactoside ABC transport system galactose-binding periplasmic protein MglB | −2.08 | 2.72 × 10−14 | −2.79 | 1.40 × 10−17 |
| CSF007_11455 | hypothetical protein | −2.12 | 8.23 × 10−28 | −1.57 | 4.48 × 10−05 |
| CSF007_0865 | Gluconokinase | −2.12 | 9.37 × 10−17 | −1.80 | 7.28 × 10−06 |
| CSF007_12460 | membrane protein | −2.21 | 3.34 × 10−18 | −2.35 | 7.75 × 10−09 |
| CSF007_15720 | Hexuronate transporter | −2.38 | 7.15 × 10−28 | −2.40 | 1.25 × 10−10 |
| CSF007_17650 | Glycerol uptake facilitator protein | −2.39 | 1.55 × 10−28 | −2.08 | 5.48 × 10−11 |
| CSF007_16005 | Trehalose-6-phosphate hydrolase | −2.46 | 3.37 × 10−14 | −3.86 | 3.31 × 10−24 |
| CSF007_5810 | Succinate dehydrogenase iron-sulfur protein | −2.47 | 1.33 × 10−16 | −3.06 | 6.67 × 10−15 |
| CSF007_13910 | Ribose/xylose/arabinose/galactoside ABC-type transport system periplasmic sugar binding protein | −2.48 | 1.26 × 10−19 | −3.70 | 1.55 × 10−33 |
| CSF007_5800 | Succinate dehydrogenase hydrophobic membrane anchor protein | −2.49 | 4.39 × 10−19 | −2.64 | 1.15 × 10−10 |
| CSF007_17655 | Glycerol kinase | −2.56 | 1.20 × 10−19 | −3.31 | 2.81 × 10−15 |
| CSF007_5795 | Succinate dehydrogenase cytochrome b-556 subunit | −2.64 | 4.76 × 10−26 | −2.04 | 9.68 × 10−13 |
| CSF007_15715 | hypothetical protein | −2.64 | 2.61 × 10−10 | −3.74 | 1.21 × 10−17 |
| CSF007_5805 | Succinate dehydrogenase flavoprotein subunit | −2.65 | 2.01 × 10−22 | −3.18 | 3.94 × 10−18 |
| CSF007_12450 | Ascorbate-specific PTS system, EIIA component | −3.07 | 6.74 × 10−23 | −3.17 | 6.76 × 10−10 |
| CSF007_12455 | Putative sugar phosphotransferase component II B | −3.21 | 1.62 × 10−22 | −3.52 | 1.76 × 10−09 |
| CSF007_5790 | Citrate synthase | −3.23 | 8.51 × 10−22 | −3.12 | 8.36 × 10−10 |
| CSF007_13905 | hypothetical protein | −3.28 | 2.14 × 10−13 | −5.24 | 6.87 × 10−10 |
| CSF007_16010 | PTS system, trehalose-specific IIB component-PTS system | −3.39 | 3.53 × 10−41 | −3.85 | 1.29 × 10−27 |