Literature DB >> 31410079

Complete genome sequence analysis of a lytic Shigella flexneri vB-SflS-ISF001 bacteriophage.

Khashayar Shahin1,2, Majid Bouzari1, Ran Wang2.   

Abstract

Shigellosis is one of the most important acute enteric infections caused by different species of Shigella, such as Shigella flexneri. Despite the use of antibiotic therapy to reduce disease duration, this approach is becoming less effective due to the emergence of antibiotic resistance among Shigella spp. Bacteriophages have been introduced as an alternative for controlling shigellosis. However, the bacteriophages must be without any lysogenic or virulence factors, toxin coding, or antibiotic-resistant genes. In this study, the whole genome sequence of vB-SflS-ISF001, a virulent Siphoviridae bacteriophage specific for Shigella flexneri, was obtained, and a comparative genomic analysis was carried out to identify its properties and safety. vB-SflS-ISF001 genomic DNA was measured at 50,552 bp with 78 deduced open reading frames (ORFs), with 24 ORFs (30.77%) sharing similarities with proteins from the genomes of homologous phages that had been reported earlier. Genetic analysis classifies it under the genus T1virus of the subfamily Tunavirinae . Moreover, comparative genomic analysis revealed no undesirable genes in the genome of vB-SflS-ISF001, such as antibiotic resistance, virulence, lysogeny, or toxin-coding genes. The results of this investigation indicate that vB-SflS-ISF001 is a new species, and confirm its safety for the biocontrol of S. flexneri.

Entities:  

Keywords:  Bacteriophage; Shigella flexneri; Siphoviridae; T1virus; whole genome sequence

Year:  2019        PMID: 31410079      PMCID: PMC6667099          DOI: 10.3906/biy-1808-97

Source DB:  PubMed          Journal:  Turk J Biol        ISSN: 1300-0152


1. Introduction

Shigella flexneri is a gram-negative, rod-shaped, invasive pathogen for humans and primates that causes inflammation in colonic mucosa (Jennison and Verma, 2004), a causative agent of diarrhea that is frequently bloody. It has been reported as the main cause of endemic shigellosis in developing countries and has resulted in the annual infection of more than 2 million individuals worldwide (Niu et al., 2017). The first line of drugs to treat shigellosis is antibiotics, but due to the occurrence of antibiotic resistance among Shigella spp., it seems that these drugs are getting less effective over time (Ye et al., 2010) . To tackle such an important issue, it is very important to come up with effective new alternatives. Bacteriophage therapy is a promising approach. Bacteriophages are the most common biological entities in the world (Olszak et al., 2017); previous studies have indicated that lytic bacteriophages can control a bacterial population (Wommack and Colwell, 2000). On the other hand, phages that are known as temperate bacteriophages can transfer undesirable genes within a bacterial population, including adhesion and invasion, exotoxin production, and other types of virulence genes (Wagner and Waldor, 2002; Shahin et al., 2018). Previous studies have reported a number of Shigella species and Escherichia coli strains susceptible to lysogenic phages (James et al., 2001). Additionally, antigen conversion by phage in S. flexneri has been reported (Gemski et al., 1975). S. flexneri harbors various bacteriophage-mediated virulence genes on its plasmids and chromosomes (Walker and Verma, 2002). Thus, to avoid transmission of such virulence genes to the bacterial host in a lytic bacteriophage product for the biocontrol of S. flexneri, analyzing the genome sequence for such genes is absolutely essential. vB-SF1S-ISF001, a specific phage for S. flexneri, belongs to the Siphoviridae family. It has been isolated from wastewater; its biological characteristics such as host range, host range, absorption rate, burst size, lytic activity, pH, and thermal and saline stability were reported in our previous study (Shahin and Bouzari, 2018). In the current study, we aimed to sequence the entire genome of the S. flexneri vB-SflS-ISF001 phage and perform a comparative genomic analysis and phylogenic analysis. Additionally, we have evaluated the safety of vB-SflS-ISF001 phage for use as a biocontrol agent by looking for any undesirable genes such as antibiotic resistance, virulence factors, or lysogeny genes.

2. Materials and methods

2.1. Bacterial culture

S. flexneri [Persian Type Culture Collection (PTCC 1234)] was obtained from the Iranian Research Organization for Science and Technology (IROST), Tehran, Iran, and stored at −80 °C. An overnight culture was prepared by adding 50 μL of the thawed stock suspension of the bacterium to 5 mL of brain heart infusion (BHI) broth (Merck, Darmstadt, Germany), and then incubated at 37 °C for 18 h with constant shaking (220 rpm).

2.2. Bacteriophage propagation and concentration

Bacteriophage vB-SflS-ISF001 (Shahin and Bouzari, 2018) was used in this study at a primary titer of 1010 PFU/mL. vB-SflS-ISF001 was propagated using S. flexneri (PTCC 1234) as host according to the method of Sambrook and Russell (2001). One hundred milliliters of sterile BHI broth was inoculated with 1 mL of the overnight culture of the host bacterium and incubated at 37 °C with constant shaking (220 rpm). The biomass production of the host bacterium was routinely checked until it reached an earlylog phase (OD600nm ≈ 0.2), when it was supplemented with 200 μL of the bacteriophage suspension (1010 PFU/mL). The mixture was incubated again at 37 °C for 24 h with constant shaking at 100 rpm. The media was then centrifuged at 10,000 × g for 10 min at 4 °C, and the phagecontaining supernatant was filtered through 0.22 µm syringe filters (Sartorius, Bangalore, India). The phage titer was then determined using the double-layer agar method (Kropinski et al., 2009). A high-titer stock of the phage was prepared using ultracentrifugation in an ultracentrifuge at 105,000 × g, 3 h, and 4 °C (Beckman Optima L-80 XP, TYPE 45 Ti rotor; Beckman Coulter, Brea, CA, USA). The pellet was then resuspended in 1 mL of sterilized SM buffer (100 mM NaCl, 8 mM MgSO4, 2% gelatin, 50 mM Tris-HCl, pH 7.5). This high-titer phage suspension was stored at 4 °C until further use.

2.3. Phage genome extraction and the whole genome sequencing

The genomic DNA of the phage was extracted according to Sambrook and Russell (2001). To remove nonphagerelated DNA and RNA, 10 μg/mL DNase I and RNase I (Sigma, Hong Kong, China) were added to the high-titer phage suspension (750 μL) and incubated for 1 h at 37 °C. Then, 78 μL of 20% SDS and proteinase K (20 mg/mL) (Sigma, Hong Kong, China) were added to the mixture, followed by an overnight incubation at 56 °C. DNA was then precipitated by adding 150 μL of 5 M sodium chloride. Subsequently, an equal volume of phenol/ chloroform/isoamyl alcohol solution was added before centrifugation at 13,000 × g for 10 min. The aqueous phase was collected carefully and remixed with an equal volume of phenol/chloroform/isoamyl alcohol solution before centrifugation at 13,000 × g for 10 min. The aqueous phase was then transferred to a new sterile tube. The phage DNA was precipitated by adding 3 M sodium acetate (one-tenth volume of the aqueous phase) and cold pure ethanol (twice volume of the aqueous phase). The sample was mixed well and incubated overnight at –20 °C before centrifugation at 20,000 × g for 20 min. Finally, the DNA pellet was washed twice with ethanol (70%) and then resuspended in RNaseand DNase-free water (Takara, Shiga, Japan). The phage genome DNA was stored at –20 °C until sequencing. DNA libraries were prepared by DNA fragmentation, adapter ligation, and amplification, and then subjected to the whole-genome DNA sequencing with 2 × 300 bp pairedend reads, carried out by the TGS Company (Shenzhen, China) on an Illumina HiSeq. The sequencing data were assembled using default parameters with SOAPdenovo (v2.04), and the sequence was deposited in DDBJ/EMBL/ GenBank under accession number MG049919.

2.4. Bioinformatic analysis

Open reading frames (ORFs) were predicted with Prokaryotic GeneMark.hmm version 3.25 (http://opal. biology.gatech.edu/genemark/gmhmmp.cgi) (Besemer et al., 2001) , and then were checked manually using the NCBI ORF Finder to confirm the predictions (https://www.ncbi.nlm.nih.gov/orfinder/). Isoelectric pH and molecular weight of translated ORFs and tRNA sequences were predicted using the ExPASy compute pI/Mw tool (http://web.expasy.org/compute_pi/) (Gasteiger et al., 2005) and tRNAscan-SE (Schattner et al., 2005) , respectively. ORF regions were translated to protein sequences using online ExPASy translate tool (http://web.expasy.org/translate/). Basic Local Alignment Search Tool (BLASTp), (https://blast.ncbi.nlm.nih.gov/Blast.cgi), HHpred (https://toolkit.tuebingen.mpg.de/#/tools/hhpred), Pfam (http://pfam.xfam.org/search#tabview=tab1) (Finn et al., 2015) , and InterProScan (http://www.ebi.ac.uk/interpro/search/sequence-search) (Altschul et al., 1997) programs with various protein domain databases were used for comparative analyses of the putative functions and conserved domains of the translated products.

2.5. Comparative genomics

CoreGenes 3.5 (http://gateway.binf.gmu.edu:8080/CoreGenes3.5/) (Turner et al., 2013) was used to find the proteins of vB_SflS-ISF001 that are similar to those of related phages. Mauve was used for the whole genome comparison at a DNA level with other related phages (Darling et al., 2004).

2.6. Phage protein analysis

Phage proteins were analyzed using sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE) as previously described (Ghasemi et al., 2014). The high-titer phage suspension (prepared using ultracentrifugation as described above) was mixed with the loading buffer (YEASEN, China) and heated in a boiling water bath for 10 min. Phage suspension (25–30 μL) was introduced to 12% (w/v) SDS-PAGE gel (YEASEN, China), and the separated protein bands were visualized by staining the gel with Coomassie blue G-250. A PageRuler Prestained Protein Ladder (Thermo Scientific, Waltham, MA, USA) was used as the size standard (10 to 180 kDa).

2.7. Phylogenetic analysis

The amino acid sequences of 1 structural ORF (ORF29, the major tail protein) and 1 nonstructural ORF (ORF14, the DNA primase) were selected to construct the phylogenic tree of the vB-SflS-ISF001 phage. The gene sequences of other phages belonging to different genera of Siphoviridae were obtained from GenBank. All sequences were aligned in MEGA 7.0 using MUSCLE, and then the phylogenetic tree was generated using UPGMA (unweighted pair group method with arithmetic mean) with 2000 bootstrap replications (Kumar et al., 2016) . Salmonella phage vB-SPuM-SP116 (accession number: KP010413) was used as the outgroup for both analyses.

3. Results

3.1. Genome characterizations

The whole genome sequencing was performed with 12,290,282 total reads (184,354,300 total bases). The sequencing data assembled using default parameters with SOAPdenovo (v2.04) showed that the dsDNA genome of vB-SflS-ISF001 phage had a 50,552 bp size (coverage > 1000×), a G + C content of 45.58%, and included LTRs of 52 bp in both ends of the genome. Bioinformatic analysis revealed that phage vB-SflS-ISF001 genome contained 78 putative ORFs (19 on the forward strand and 59 on the reverse strand) which are fairly similar to other T1virus members (Table 1). ATG was identified as the only start codon for all ORFs (Table 2). According to BLASTP searches in the GenBank database, the function of 24 ORFs (30.77%) were predicted, and the remaining ORFs (54 ORFs, 69.23%) were considered as hypothetical proteins due to their shared similarities with uncharacterized database entries (Table 2). A different range of identified ORFs from 25% (Shfl1) to 31.8% (SH6) was reported in the phages belonging to the T1 virus genus (Table 1). Among the identified ORFs and detected conserved domains of the vB-SflSISF001 genome, no sequences related to undesirable genes including antibiotic resistance, virulence, lysogenic mediated, or toxin coding genes were found. In addition, no tRNA-encoding sequences were found in the genome (Figure 1 and Table 2). The predicted ORFs of phage vB-SflS-ISF001 were divided into 4 groups according to their function (Figure 1).
Table 1

Comparison of the basic genomic properties of phage vB_SflS-ISF001 and other similar phages.

Shigella phagesEscherichia phages
PropertiesvB_SflS-ISF001SH6Shfl1pSf-2ADB-2JMPW2T1JMPW1
% identity-89899091898988
GC-content45.5845.8345.4145.4445.5545.3845.5545.56
Total/identified ORF78/2482/2680/2083/2479/2580/2477/237823
No. of tRNA00000000
Isolation countryIranCanadaBrazilSouth KoreaIndiaChinaCanadaChina
Accession no.MG049919KX828710HM035024KP085586JX912252KU194205AY216660KU194206
Table 2

Analysis of the predicted ORFs of vB_SflS-ISF001 and their putative functions.

Best match (NCBI database)
ORFsStrandLeftRightStart codonSize (aa)PIMw (Kda)Predicted product [organism]E valueIdentityAccession no
1+209616ATG1359.216166.87Hypothetical protein T1p10 [Escherichia virus T1]7E-9095%YP_003935.1
2+8061021ATG714.788176.19Hypothetical protein B508_00390 [Escherichia phage ADB-2]3E-3583%YP_007112743.1
3-10351436ATG1338.8913972.21Hypothetical protein JMPW1_065 [Escherichia phage JMPW1]1E-7389%ALT58269.1
4-14361924ATG1629.3518135.87Endolysin [Shigella phage SH6]2E-10190%APC44908.1
5-19242139ATG716.067645.94Putative holin [Escherichia virus T1]6E-3690%YP_003932.1
6-24983649ATG3836.5443031.97Hypothetical protein JMPW1_061 [Escherichia phage JMPW1]092%ALT58265.1
7-37284015ATG957.8410788.54Hypothetical protein B508_00365 [Escherichia phage ADB-2]4E-5588%YP_007112738.1
8-42104434ATG749.738432.87Hypothetical protein ISF001_007 [Shigella phage vB_SsoS-ISF002]9E-3988%ASD50891.1
9-44864734ATG824.679713.19Hypothetical protein ISF001_008 [Shigella phage vB_SsoS-ISF002]8E-4590%ASD50892.1
10-47425455ATG2736.8426966.43DNA adenine methyltransferase [Escherichia phage vB_EcoS_SH2]3E-14787%ARW57245.1
11-55235939ATG1388.5915797.96Hypothetical protein ISF001_0010 [Shigella phage vB_SsoS-ISF002]4E-98100%ASD50894.1
12-59367948ATG6706.5975636.28ATP-dependent helicase [Escherichia phage JMPW2]095%ALT58178.2
13+80488500ATG15010.4716913.61Hypothetical protein Shfl1p58 [Shigella virus Shfl1]9E-9390%YP_004414874.1
14+85779497ATG3066.0434833.12DNA primase/helicase [Escherichia phage JMPW1]091%ALT58257.1
15+959811382ATG5944.864226.31Putative tail fiber [Shigella virus Shfl1]087%YP_004414872.1
16-1141111824ATG1377.8715667.35Single-stranded DNA-binding protein [Shigella phage SH6]1E-6278%APC44921.1
17-1187012517ATG2158.5223707.2Putative recombination protein [Shigella phage vB_SsoS-ISF002]8E-12891%ASD50900.1
18-1259213656ATG3545.0239954.22Exodeoxyribonuclease VIII [Shigella phage SH6]093%APC44928.1
19+1418414414ATG768.718399.77Phage lipoprotein [Shigella phage SH6]6E-3884%APC44941.1
20+1441715373ATG3188.0934098.65Hypothetical protein pSf2_021 [Shigella phage pSf-2]094%YP_009112959.1
21-1546818851ATG11274.88125022.94Tail fiber protein [Shigella phage SH6]094%APC44985.1
22-1892919528ATG1999.120875.01Putative tail assembly protein [Escherichia phage ADB-2]9E-13596%YP_007112720.1
23-1952520259ATG2445.7428258.09Putative minor tail protein [Escherichia virus T1]099%YP_003910.1
24-2025621038ATG2608.5228774.74Putative minor tail protein [Escherichia virus T1]2E-17390%YP_003909.1
25-2111821471ATG1174.6413011.49Tail fiber protein [Escherichia phage JMPW1]6E-7287%ALT58245.1
26-2147424347ATG9576.05103770.01Tail length tape measure protein [Escherichia phage JMPW2]094%ALT58162.1
27-2438724656ATG894.2510131.71Hypothetical protein pSf2_028 [Shigella phage pSf-2]7E-4989%YP_009112966.1
28-2470425021ATG1056.7211815.39Hypothetical protein pSf2_029 [ Shigella phage pSf-2]4E-5388%YP_009112967.1
29-2513625804ATG2225.0724090.32Putative major tail protein [Shigella virus Shfl1]2E-14088%YP_004414858.1
30-2580726205ATG1329.0715330.60Hypothetical protein pSf2_031 [Shigella phage pSf-2]3E-7987%YP_009112969.1
31-2619526638ATG1476.9116523.76Hypothetical protein Shfl1p40 [Shigella virus Shfl1]9E-9691%YP_004414856.1
32-2663127002ATG1235.6913904.7Hypothetical protein pSf2_033 [Shigella phage pSf-2]2E-7692%YP_009112971.1
33-2699927412ATG1379.2515542.96Hypothetical protein JMPW2_033 [Escherichia phage JMPW2]8E-7985%ALT58155.2
34-2745527745ATG964.7610348.09Hypothetical protein T1p46 [Escherichia virus T1]2E-4682%YP_003899.1
35-2779528754ATG3196.6135068.32Hypothetical protein Shfl1p36 [Shigella virus Shfl1]093%YP_004414852.1
36-2884729614ATG2554.6526691.97Hypothetical protein Shfl1p35 [Shigella virus Shfl1]1E-14481%YP_004414851.1
37-2967430150ATG1585.5417268.48Hypothetical protein pSf2_038 [Shigella phage pSf-2]9E-8983%YP_009112976.1
38-3016231274ATG3705.3340269.73Major head subunit precursor [Escherichia virus T1]092%YP_003895.1
39-3127732038ATG2539.0928826.61Minor capsid protein [Escherichia phage JMPW1]7E-16390%ALT58231.1
40-3202833311ATG4274.7147760.74Putative portal protein [Shigella virus Shfl1]093%YP_004414847.1
41-3336834936ATG5226.9159967.82Putative terminase large subunit [Shigella virus Shfl1]094%YP_004414846.1
42-3497535499ATG1744.9319287.68Putative terminase small subunit [Escherichia virus T1]3E-11493%YP_003891.1
43-3558435811ATG759.398557.22Hypothetical protein Shfl1p28 [Shigella virus Shfl1]4E-3388%YP_004414844.1
44-3581335998ATG619.577039.22Hypothetical protein JMPW1_022 [Escherichia phage JMPW1]2E-2580%ALT58226.1
45-3597936140ATG539.225894.76Hypothetical protein pSf2_046 [Shigella phage pSf-2]3E-2083%YP_009112984.1
46-3630536508ATG675.077230.12Hypothetical protein B508_00150 [Escherichia phage ADB-2]4E-3484%YP_007112695.1
47-3650836738ATG769.758737.25Hypothetical protein JMPW1_019 [Escherichia phage JMPW1]7E-3988%ALT58223.1
48-3673837082ATG1149.1612972Hypothetical protein B508_00140 [Escherichia phage ADB-2]1E-6587%YP_007112693.1
49-3707937288ATG6948032.77Hypothetical protein B508_00135 [Escherichia phage ADB-2]1E-3383%YP_007112692.1
50-3736137933ATG1905.5521575.59Hypothetical protein T1p62 [Escherichia virus T1]2E-12391%YP_003883.1
51-3804238575ATG1775.9520038.7Putative morphogenetic protein [Escherichia phage ADB-2]6E-11290%YP_007112690.1
52-3865939105ATG1488.5117383.97Hypothetical protein SH6_0017 [Shigella phage SH6]2E-8195%APC44930.1
53-3916339381ATG724.757840.97Hypothetical protein T1p66 [Escherichia virus T1]5E-3383%YP_003878.1
54-3953040168ATG2129.3823844.42Hypothetical protein pSf2_055 [Shigella phage pSf-2]9E-13891%YP_009112993.1
55-4017340460ATG957.8411139.68Hypothetical protein B508_00110 [Escherichia phage ADB-2]7E-5081%YP_007112687.1
56-4053940688ATG497.825667.66Hypothetical protein [Escherichia phage vB_EcoS_SH2]5E-2588%ARW57197.1
57-4068841194ATG1686.9618812.91Hypothetical protein pSf2_059 [Shigella phage pSf-2]3E-8877%YP_009112997.1
58-4126641754ATG1624.4318232.68Hypothetical protein B508_00095 [Escherichia phage ADB-2]2E-9889%YP_007112684.1
59-4182642017ATG634.057383.14Hypothetical protein JMPW2_006 [Escherichia phage JMPW2]3E-2984%ALT58128.1
60-4202742200ATG576.526161.42Hypothetical protein ISF001_0059 [Shigella phage vB_SsoS-ISF002]5E-2486%ASD50943.1
61-4230442531ATG7510.078613.07Hypothetical protein JMPW2_004 [Escherichia phage JMPW2]2E-3068%ALT58126.1
62-4253842768ATG766.548632.82Hypothetical protein Shfl1p05 [Shigella virus Shfl1]5E-3988%YP_004414824.1
63-4284743317ATG1565.5117590.09Hypothetical protein B508_00070 [Escherichia phage ADB-2]4E-7774%YP_007112679.1
64-4332043514ATG645.17308.31Hypothetical protein Shfl1p02 [Shigella virus Shfl1]2E-2986%YP_004414821.1
65-4358643915ATG1095.712370.27Hypothetical protein ISF001_0064 [Shigella phage vB_SsoS-ISF002]2E-6087%ASD50948.1
66-4392844503ATG1914.9921342.21Hypothetical protein JMPW2_001 [Escherichia phage JMPW2]5E-10481%ALT58123.1
67+4520645910ATG2349.1426394.97DNA methylase [Shigella phage SH6]6E-14689%APC44923.1
68+4597146156ATG619.167030.22Hypothetical protein B508_00040 [Escherichia phage ADB-2]2E-2479%YP_007112675.1
69+4617246357ATG616.146920.14Hypothetical protein ISF001_0068 [Shigella phage vB_SsoS-ISF002]9E-2582%ASD50952.1
70+4643346813ATG1264.4914559.40Hypothetical protein ISF001_0069 [Shigella phage vB_SsoS-ISF002]2E-6683%ASD50953.1
71+4681046983ATG578.016695.60Hypothetical protein ISF001_0070 [Shigella phage vB_SsoS-ISF002]4E-2075%ASD50954.1
72+4705547426ATG1234.5113557.28Hypothetical protein JMPW1_074 [Escherichia phage JMPW1]4E-6180%ALT58278.1
73+4741947619ATG666.187624.69Hypothetical protein T1p02 [Escherichia virus T1]2E-2882%YP_003943.1
74+4763747957ATG1069.7112105.10Hypothetical protein pSf2_078 [Shigella phage pSf-2]9E-5681%YP_009113016.1
75+4817448398ATG7410.167966.25Hypothetical protein B508_00015 [Escherichia phage ADB-2]185%YP_007112670.1
76+4840248614ATG703.938105.68Hypothetical protein T1p06 [Escherichia virus T1]1E-2571%YP_003939.1
77+4869549057ATG1209.6213975.26Hypothetical protein B508_00005 [Escherichia phage ADB-2]1E-7090%YP_007112668.1
78+4918850009ATG2735.8930123.19Hypothetical protein pSf2_083 [Shigella phage pSf-2]098%YP_009113021.1
Figure 1

The linear genome map of Shigella flexneri bacteriophage vB-SflS-ISF001 drawn in a circularized format using DNAPlotter (Carver et al., 2009). The 4 circular tracks describe (from inner to outer layers): GC skew [(G – C) / (G + C)], G + C content, ORFs located in negative strand, and ORFs located in positive strand.

Comparison of the basic genomic properties of phage vB_SflS-ISF001 and other similar phages. Analysis of the predicted ORFs of vB_SflS-ISF001 and their putative functions. The linear genome map of Shigella flexneri bacteriophage vB-SflS-ISF001 drawn in a circularized format using DNAPlotter (Carver et al., 2009). The 4 circular tracks describe (from inner to outer layers): GC skew [(G – C) / (G + C)], G + C content, ORFs located in negative strand, and ORFs located in positive strand.

3.1.1. DNA replication, modification, regulation

In this group, ORF12 was the longest ORF (2013 bp, 670 aa), and its predicted protein product shared high similarity with the ATP-dependent helicase from Escherichia phage JMPW2 (95% identity). ORF10 product was predicted as DNA adenine methyltransferase due to 87% similarity (E value: 3E-147) to the DNA adenine methyltransferase of Escherichia phage vB-EcoS-SH2 (accession number: KY985004). ORF14 showed 91% identity to the DNA primase/helicase of Escherichia phage JMPW1. The deduced product of ORF16 displayed 78% similarity (E value: 1E-62) with the single-stranded DNA-binding protein from Shigella phage SH6. The proteins encoded by ORF17, ORF18, and ORF67 matched the putative recombination protein of Shigella phage vB-SsoS-ISF002 (accession number: MF093736), exodeoxyribonuclease VIII of Shigella phage SH6, and DNA methylase of Shigella phage SH6 with 91% (E value: 8E-128), 93%, and 89% (E value: 6E-146) similarity, respectively.

3.1.2. Structure, morphogenesis

ORF21, which was the largest ORF in this group (3384 bp, 1127 aa), encoded a protein similar to the tail fiber protein from Shigella phage SH6 (94%). The protein sequences of products of ORFs 15 and 25 also showed similarity to the tail fiber proteins of Shigella virus Shfl1 (accession number: HM035024) and Escherichia phage JMPW1, with 87% (E value: 0) and 87% (E value: 6E-72) identity, respectively. The predicted proteins of ORFs 23 and 24 showed 100% identity (E value: 2E-173) to the putative minor tail protein of Escherichia virus T1. Moreover, the major tail protein was found to be encoded by ORF29 with 88% identity (E value: 2E-140) to the major tail protein of Shigella virus Shfl1. The predicted proteins of ORFs 22 and 26 were identified as the putative tail assembly protein and tail length tape measure protein due to 96% (E value: 9E-135) and 94% similarity with the putative tail assembly protein of Escherichia phage ADB-2 and tail length tape measure protein of Escherichia phage JMPW2, respectively. ORF38 was predicted to encode the major head subunit precursor, with 92% sequence similarity to the major head subunit precursor of Escherichia virus T1. The predicted protein of ORF39 was identified as the minor capsid protein, displaying 90% similarity (E value: 7E-163) with the minor capsid protein from Escherichia phage JMPW1. The portal protein and morphogenetic protein were found to be encoded by ORFs 40 and 51, respectively. The product of ORF40 showed 93% similarity with the portal protein from Shigella virus Shfl1, and the protein sequence of ORF51 showed 90% similarity (E value: 6E-112) with the putative morphogenetic protein of Escherichia phage ADB-2. Furthermore, the product encoded by ORF19 had 84% similarity (E value: 6E-38) with the phage lipoprotein of Shigella phage SH6.

3.1.3. DNA packaging

Terminase complex is composed of 2 separate gene products of ORFs 41 and 42. The product of ORF41 showed 94% similarity to the putative terminase large subunit from Shigella virus Shfl1 and the protein sequence of ORF42 product shared 93% similarity (E value: 3E-114) to the putative terminase small subunit from Shigella virus Shfl1.

3.1.4. Bacterial cell wall lysis

The product of ORF5 showed 90% similarity (E value: 6E36) to the putative holin of Escherichia virus T1, and the predicted protein of ORF4 showed 90% similarity (E value: 2E-101) to endolysin from Shigella phage SH6.

3.2. Comparative genomics analysis

A MegaBLAST search of the phage genome indicated that vB-SflS-ISF001 had 88%–91% sequence similarity with Shigella and Escherichia phages (Table 1). CoreGene analysis demonstrated that vB-SflS-ISF001 shared similarity to 50 proteins of other related phages (score >70), including 22 known (2 bacterial cell wall lysis, 7 DNA replication, modification, regulation protein, 11 structural, and 2 DNA packaging proteins) and 38 hypothetical proteins (Table 3). These amino acid coding sequences were not restricted to any particular region or functional group of genes and were distributed over the phage genome. Moreover, comparison of the genome sequence of phage vB-SflS-ISF001 with other members of the T1virus genus demonstrated that vB-SflS-ISF001 genome sequence, organization, and ORF orientations were generally similar to other members of the genus T1virus (Figure 2).
Table 3

Conserved proteins of vB_SflS-ISF001 phage shared with related phages (SH6, Shfl1, ADB-2, JMPW2) as determined by CoreGenes.

ProductRelated phages*
vB_SflS-ISF001JMPW2ADB-2Shfl1SH6
1Hypothetical proteinATN94079.1ALT58192.1AFV50974.1AEA72948.1APC44945.1
2Hypothetical proteinATN94081.1ALT58190.1AFV50972.1AEA72947.1APC44951.1
3EndolysinATN94082.1ALT58189.1AFV50971.1AEA72946.1APC44908.1
4HolinATN94083.1ALT58188.1AFV50970.1AEA72945.1APC44968.1
5Hypothetical proteinATN94084.1ALT58185.1AFV50969.1AEA72943.1APC44977.1
6Hypothetical proteinATN94085.1ALT58184.2AFV50968.1AEA72942.1APC44932.1
7Hypothetical proteinATN94086.1ALT58183.2AFV50967.1AEA72941.1APC44907.1
8Hypothetical proteinATN94087.1ALT58182.1AFV50966.1AEA72940.1APC44946.1
9DNA methylaseATN94088.1ALT58180.1AFV50965.1AEA72939.1APC44914.1
10Hypothetical proteinATN94089.1ALT58179.1AFV50964.1AEA72938.1APC44943.1
11ATP-dependent helicaseATN94090.1ALT58178.2AFV50962.1AEA72937.1APC44976.1
12Hypothetical proteinATN94091.1ALT58177.1AFV50961.1AEA72936.1APC44936.1
13Putative DNA primaseATN94092.1ALT58176.1AFV50960.1AEA72935.1APC44959.1
14Tail fiber proteinATN94093.1ALT58175.1AFV50959.1AEA72934.1APC44917.1
15Single-stranded DNA-binding proteinATN94094.1ALT58174.1AFV50958.1AEA72933.1APC44921.1
16RecombinationATN94095.1ALT58173.1AFV50957.1AEA72932.1APC44939.1
17Tail fiber proteinATN94099.1ALT58167.1AFV50951.1AEA72928.1APC44985.1
18Tail assembly proteinATN94100.1ALT58166.1AFV50950.1AEA72927.1APC44963.1
19Minor tail proteinATN94101.1ALT58165.1AFV50949.1AEA72926.1APC44919.1
20Minor tail proteinATN94102.1ALT58164.2AFV50948.1AEA72925.1APC44909.1
21Minor tail proteinATN94103.1ALT58163.1AFV50947.1AEA72924.1APC44974.1
22Tail tape measure proteinATN94104.1ALT58162.1AFV50946.1AEA72923.1APC44947.1
23Tape measure chaperoneATN94105.1ALT58161.2AFV50945.1AEA72922.1APC44924.1:
24Hypothetical proteinATN94106.1ALT58160.1AFV50944.1AEA72921.1APC44958.1
25Major tail proteinATN94107.1ALT58159.1AFV50942.1AEA72920.1APC44938.1
26Hypothetical proteinATN94108.1ALT58158.1AFV50941.1AEA72919.1APC44925.1
27Hypothetical proteinATN94109.1ALT58157.2AFV50940.1AEA72918.1APC44961.1
28Hypothetical proteinATN94111.1ALT58155.2AFV50939.1AEA72916.1APC44912.1
29Hypothetical proteinATN94112.1ALT58154.1AFV50938.1AEA72915.1APC44965.1
30Hypothetical proteinATN94113.1ALT58153.1AFV50937.1AEA72914.1APC44931.1
31Hypothetical proteinATN94114.1ALT58152.1AFV50936.1AEA72913.1APC44983.1
32Hypothetical proteinATN94115.1ALT58151.1AFV50935.1AEA72912.1APC44955.1
33Major capsid proteinATN94116.1ALT58150.1AFV50934.1AEA72911.1APC44972.1
34Minor capsid proteinATN94117.1ALT58149.1AFV50933.1AEA72910.1APC44922.1
35Portal proteinATN94118.1ALT58148.1AFV50931.1AEA72909.1APC44942.1
36Terminase large subunitATN94119.1ALT58147.1AFV50930.1AEA72908.1APC44953.1
37Terminase small subunitATN94120.1ALT58146.2AFV50928.1AEA72907.1APC44944.1
38Hypothetical proteinATN94121.1ALT58145.1AFV50927.1AEA72906.1APC44934.1
39Hypothetical proteinATN94122.1ALT58144.1AFV50926.1AEA72905.1APC44962.1
40Hypothetical proteinATN94124.1ALT58142.1AFV50925.1AEA72903.1APC44940.1
41Hypothetical proteinATN94125.1ALT58141.1AFV50924.1AEA72902.1APC44948.1
42Hypothetical proteinATN94126.1ALT58140.1AFV50923.1AEA72901.1APC44950.1
43Hypothetical proteinATN94127.1ALT58139.1AFV50922.1AEA72900.1APC44980.1
44KinaseATN94128.1ALT58138.1AFV50921.1AEA72899.1APC44910.1
45Hypothetical proteinATN94129.1ALT58136.1AFV50920.1AEA72898.1APC44960.1
46Hypothetical proteinATN94130.1ALT58135.2AFV50919.1AEA72896.1APC44930.1
47Hypothetical proteinATN94132.1ALT58133.1AFV50918.1AEA72895.1APC44988.1
48Hypothetical proteinATN94133.1ALT58132.1AFV50917.1AEA72894.1APC44913.1
49Hypothetical proteinATN94135.1ALT58130.1AFV50915.1AEA72892.1APC44984.1
40Hypothetical proteinATN94136.1ALT58129.1AFV50914.1AEA72891.1APC44973.1
41Hypothetical proteinATN94137.1ALT58128.1AFV50913.1AEA72889.1APC44981.1
42Hypothetical proteinATN94139.1ALT58126.1AFV50912.1AEA72887.1APC44911.1
43Hypothetical proteinATN94140.1ALT58125.1AFV50911.1AEA72885.1APC44957.1
44Hypothetical proteinATN94144.1ALT58123.1AFV50906.1AEA72882.1APC44978.1
45Hypothetical proteinATN94151.1ALT58200.1AFV50902.1AEA72955.1APC44926.1
46Hypothetical proteinATN94152.1ALT58199.2AFV50901.1AEA72954.1APC44935.1
47Hypothetical proteinATN94153.1ALT58197.1AFV50900.1AEA72952.1APC44933.1
48Hypothetical proteinATN94154.1ALT58196.1AFV50899.1AEA72951.1APC44952.1
49Hypothetical proteinATN94155.1ALT58195.1AFV50898.1AEA72950.1APC44956.1
50Hypothetical proteinATN94156.1ALT58193.1AFV50975.1AEA72949.1APC44969.1

*Data presented in these columns are accession numbers for each individual protein of each phage.

Figure 2

Alignment of the genome of S. flexneri bacteriophage vB-SflS-ISF001 with others of the genus T1virus using Mauve. Names of the bacteriophages are mentioned under their maps line. Colored blocks indicate corresponding regions of nucleotide similarity, while colorless blocks correspond to dissimilar regions.

Conserved proteins of vB_SflS-ISF001 phage shared with related phages (SH6, Shfl1, ADB-2, JMPW2) as determined by CoreGenes. *Data presented in these columns are accession numbers for each individual protein of each phage. Alignment of the genome of S. flexneri bacteriophage vB-SflS-ISF001 with others of the genus T1virus using Mauve. Names of the bacteriophages are mentioned under their maps line. Colored blocks indicate corresponding regions of nucleotide similarity, while colorless blocks correspond to dissimilar regions.

3.3. Phylogenetic position of vB-SflS-ISF001

The constructed phylogenetic tree using the major tail protein and the DNA primase revealed that vB-SflSISF001 had homology to genus T1virus phages (Shigella phage SH6, Shigella phage Shfl1, Shigella phage pSf-2, Escherichia phage ADB-2, Escherichia phage JMPW2, Enterobacteria phage T1, and Escherichia phage JMPW1) (Figure 3). Based on the UPGMA dendrograms, vB-SflSISF001, a Shigella flexneri phage, can be classified as a new species in the genus T1virus of the subfamily Tunavirinae (Figure ).
Figure 3

Phylogenetic relationship of S. flexneri bacteriophage vB-SflS-ISF001. Phylogenetic trees were constructed based on the amino acid sequence of the major tail (A) and the DNA primase (B) using the UPGMA method with 2000 bootstrap replications. The numbers on the lines show the supporting rates.

Phylogenetic relationship of S. flexneri bacteriophage vB-SflS-ISF001. Phylogenetic trees were constructed based on the amino acid sequence of the major tail (A) and the DNA primase (B) using the UPGMA method with 2000 bootstrap replications. The numbers on the lines show the supporting rates.

3.4. Analysis of vB-SflS-ISF001 structural proteins

To further characterize vB-SflS-ISF001, the high-titer phage suspension was subjected to 12% (w/v) SDS-PAGE gel. As shown in Figure 4, at least 11 individual protein bands with molecular masses ranging from 13 to 103.7 kDa were detected. In addition, each of the bands was attributed to one of the predicted structural proteins of phage vB-SflS-ISF001 based on their molecular weights (Figure 4).
Figure 4

SDS-PAGE analysis of the S. flexneri bacteriophage vB-SflS-ISF001. Lane M, Page Ruler TM Prestained Protein Ladder 26616 (Thermo Scientific, Waltham, MA, USA). The predicted ORFs products related to each band are presented on the left side.

SDS-PAGE analysis of the S. flexneri bacteriophage vB-SflS-ISF001. Lane M, Page Ruler TM Prestained Protein Ladder 26616 (Thermo Scientific, Waltham, MA, USA). The predicted ORFs products related to each band are presented on the left side.

4. Discussion

Shigella is one of the most important groups of Enterobacteriaceae which cause enteric infections (Zhang et al., 2013) . With the emergence of resistant strains, phage therapy has been introduced as an alternative method and a new generation of antibacterial agents. A candidate phage must be analyzed thoroughly before its use in phage therapy (Shahin et al., 2018) . Therefore, the current study aimed to perform a comparative genomic analysis and phylogenic analysis, and look for any sequences related to antibiotic resistance, bacterial virulence factor, or phage lysogeny genes. According to whole genome sequencing and bioinformatic analysis, the most and the least similarity between the ORFs of vB-SflS-ISF001 and other T1virus phages were observed in SH6 and SH2, respectively. Six out of 24 ORFs (ORFs 4, 16, 18, 19, 21, and 67), and 1 out of 24 ORFs (ORF10) of vB-SflS-ISF001 had similarity to ORFs of SH6 and SH2, respectively. In the DNA replication, modification, and regulation group of genes, the function of 7 ORFs were predicted due to their similarity to JMPW2 (1 ORF), vB-EcoS-SH2 (1 ORF), JMPW1 (1 ORF), SH6 (3 ORF), and vB-SsoS-ISF002 (1 ORF). DNA primase/ helicase, which plays a regulatory role in the bacteriophage DNA replication process, is encoded by ORF 14 (Shen et al., 2016) . In the structure and morphogenesis group of genes, the function of 13 ORFs were predicted due to their similarity to JMPW2 (1 ORF), vB-EcoS-SH2 (1 ORF), JMPW1 (2 ORF), SH6 (1 ORF), T1 (3 ORF), Shfl1 (3 ORF), and ADB-2 (2 ORF). Terminases are phage-encoded endonuclease enzymes with ATPase activity that act in the headful DNA packaging process during phage assembly (Hamdi et al., 2017). This enzyme, which was classified in the DNA packaging group, is composed of 2 separate units: the small subunit (ORF41) and the large subunit (ORF42). Double-strand DNA (dsDNA) phages employ the holin–endolysin complex to destroy bacterial host cells. In the genome of vB-SflS-ISF001, ORFs 4 (endolysin) and 5 (holin) were predicted to encode this complex. Holins are hydrophobic proteins that produce holes in the bacterial cytoplasmic membrane by oligomerization and ease the access of endolysins to the cell wall (Fernandes and São‐ José, 2016) . In contrast, endolysins have a crucial role in cleaving the peptidoglycan (murein), the main part of the bacterial cell wall structure (Fernandes and São‐José, 2016). Furthermore, the position of predicted ORFs of the lysis group was similar with those of other Siphoviridae phages (Escherichia virus T1, Escherichia phage JMPW1, Shigella phage SH6, Escherichia phage ADB-2, Shigella phage pSf-2, and Shigella virus Shfl1), which were located at the right or left end of the genome (Roberts et al., 2004; Bhensdadia et al., 2013; Jun et al., 2016; Shen et al., 2016; Hamdi et al., 2017) . Among the identified ORFs and detected conserved domains of the vB-SflS-ISF001 genome, no sequences related to undesirable genes including antibiotic resistance, virulence, or lysogenic mediated or toxin-coding genes were found. Therefore, vB-SflS-ISF001 can be considered a safe agent for biocontrol applications. Additionally, as with other T1virus phages, no tRNA-encoding sequences were identified in the genome of vB-SflS-ISF001. Genomic comparison showed that the organization, orientations, and distribution of the ORFs were generally similar to those of other members of the genus T1virus. Moreover, MegaBLAST analysis and UPGMA dendrograms revealed that vB-SflS-ISF001 can be classified as a new member of the genus T1virus, subfamily Tunavirinae. In conclusion, in the current study, genomic characteristics of Shigella flexneri phage vB-SflS-ISF001 were comparatively analyzed. Phage vB-SflS-ISF001 genome is a dsDNA (50,552 bp) with 45.58% G + C content. Seventy-eight distinct ORFs and no tRNA were predicted in the vB-SflS-ISF001 genome. Comparative genomic analysis of vB-SflS-ISF001 demonstrated that this phage could be classified as a new species in the genus T1virus of the subfamily Tunavirinae. Moreover, no undesirable genes, e.g., antibiotic resistance, virulence, lysogenic mediated genes, or toxin-coding genes, were found in the vB-SflS-ISF001 genome sequence. Phylogenetic analysis (based on major tail and DNA primase) of vB-SflS-ISF001 showed a high similarity to other T1virus species, and was further validated through genome and comparative genomic analyses, which not only constitute a much more accurate classification approach, but also a powerful methodology to investigate and certify the safety of phages for potential application as biocontrol agents. Therefore, the data suggest that vB-SflS-ISF001 can be used as a safe agent for phage therapy.

Acknowledgments

This research was funded by an operating grant of the Dean of Research and Graduate Studies at the University of Isfahan (No: A/94/32650) and Jiangsu Agricultural Science and Technology Foundation (No. CX[16]1060).
  28 in total

Review 1.  Virioplankton: viruses in aquatic ecosystems.

Authors:  K E Wommack; R R Colwell
Journal:  Microbiol Mol Biol Rev       Date:  2000-03       Impact factor: 11.056

2.  GeneMarkS: a self-training method for prediction of gene starts in microbial genomes. Implications for finding sequence motifs in regulatory regions.

Authors:  J Besemer; A Lomsadze; M Borodovsky
Journal:  Nucleic Acids Res       Date:  2001-06-15       Impact factor: 16.971

Review 3.  Bacteriophage control of bacterial virulence.

Authors:  Patrick L Wagner; Matthew K Waldor
Journal:  Infect Immun       Date:  2002-08       Impact factor: 3.441

4.  Mauve: multiple alignment of conserved genomic sequence with rearrangements.

Authors:  Aaron C E Darling; Bob Mau; Frederick R Blattner; Nicole T Perna
Journal:  Genome Res       Date:  2004-07       Impact factor: 9.043

5.  The genome and proteome of coliphage T1.

Authors:  Mary D Roberts; Nancy L Martin; Andrew M Kropinski
Journal:  Virology       Date:  2004-01-05       Impact factor: 3.616

6.  Phage conversion of Shigella flexneri group antigens.

Authors:  P Gemski; D E Koeltzow; S B Formal
Journal:  Infect Immun       Date:  1975-04       Impact factor: 3.441

7.  Lytic and lysogenic infection of diverse Escherichia coli and Shigella strains with a verocytotoxigenic bacteriophage.

Authors:  C E James; K N Stanley; H E Allison; H J Flint; C S Stewart; R J Sharp; J R Saunders; A J McCarthy
Journal:  Appl Environ Microbiol       Date:  2001-09       Impact factor: 4.792

8.  Identification of a putative pathogenicity island in Shigella flexneri using subtractive hybridisation of the S. flexneri and Escherichia coli genomes.

Authors:  John C Walker; Naresh K Verma
Journal:  FEMS Microbiol Lett       Date:  2002-08-06       Impact factor: 2.742

Review 9.  Shigella flexneri infection: pathogenesis and vaccine development.

Authors:  Amy V Jennison; Naresh K Verma
Journal:  FEMS Microbiol Rev       Date:  2004-02       Impact factor: 16.408

10.  The tRNAscan-SE, snoscan and snoGPS web servers for the detection of tRNAs and snoRNAs.

Authors:  Peter Schattner; Angela N Brooks; Todd M Lowe
Journal:  Nucleic Acids Res       Date:  2005-07-01       Impact factor: 16.971

View more
  2 in total

1.  Isolation, phenotypic characterization and comparative genomic analysis of 2019SD1, a polyvalent enterobacteria phage.

Authors:  Prince Kumar; Mukesh K Meghvansi; D V Kamboj
Journal:  Sci Rep       Date:  2021-11-12       Impact factor: 4.379

2.  A Novel Bacteriocin Against Shigella flexneri From Lactiplantibacillus plantarum Isolated From Tilapia Intestine: Purification, Antibacterial Properties and Antibiofilm Activity.

Authors:  Yu-Hang Jiang; Wei-Gang Xin; Qi-Lin Zhang; Lian-Bing Lin; Xian-Yu Deng
Journal:  Front Microbiol       Date:  2022-01-05       Impact factor: 5.640

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.