| Literature DB >> 32899836 |
Nikoline S Olsen1,2, Laura Forero-Junco2, Witold Kot2, Lars H Hansen2.
Abstract
Phages drive bacterial diversity, profoundly influencing microbial communities, from microbiomes to the drivers of global biogeochemical cycling. Aiming to broaden our understanding of Escherichia coli (MG1655, K-12) phages, we screened 188 Danish wastewater samples and isolated 136 phages. Ninety-two of these have genomic sequences with less than 95% similarity to known phages, while most map to existing genera several represent novel lineages. The isolated phages are highly diverse, estimated to represent roughly one-third of the true diversity of culturable virulent dsDNA Escherichia phages in Danish wastewater, yet almost half (40%) are not represented in metagenomic databases, emphasising the importance of isolating phages to uncover diversity. Seven viral families, Myoviridae, Siphoviridae, Podoviridae, Drexlerviridae, Chaseviridae, Autographviridae, and Microviridae, are represented in the dataset. Their genomes vary drastically in length from 5.3 kb to 170.8 kb, with a guanine and cytosine (GC) content ranging from 35.3% to 60.0%. Hence, even for a model host bacterium, substantial diversity remains to be uncovered. These results expand and underline the range of coliphage diversity and demonstrate how far we are from fully disclosing phage diversity and ecology.Entities:
Keywords: Escherichia coli; bacteriophage; coliphage; diversity; genomics; taxonomy; wastewater
Mesh:
Substances:
Year: 2020 PMID: 32899836 PMCID: PMC7552041 DOI: 10.3390/v12090986
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
List of 104 unique Escherichia phage species identified in 94 Danish wastewater samples. Phages (n) denotes the number of phages isolated with more than 95% nucleotide sequence similarity. Similarity is sequence identity (%) times sequence coverage (%) to closest relative (Blastn). Taxonomy is based on similarity (BLASTn) to closest related.
| Phage | Accession | Phages ( | Genome (bp) | ORFs | tRNAs | GC (%) | Family; Genus | Similarity (%) | Closest Relative | Accession |
|---|---|---|---|---|---|---|---|---|---|---|
| tootiki | MN850647 | 1 | 88,257 | 128 | 22 | 39 |
| 90.2 | Escherichia phage vB_EcoM_Alf5 | NC_031082.1 |
| mio | MN850631 | 1 | 83,431 | 121 | 18 | 39.1 |
| 89.7 | Salmonella virus VSe11 | MG251391.1 |
| allfine | MN850633 | 1 | 86,963 | 125 | 20 | 39 |
| 91.2 | Escherichia phage vB_EcoM-AYO145A | NC_028825.1 |
| bumzen | MN850635 | 3 | 87,360 | 126 | 20 | 39.1 |
| 92.5 | Escherichia phage vB_EcoM_Alf5 | NC_031082.1 |
| dune | MN850636 | 1 | 88,511 | 129 | 20 | 39 |
| 91.5 | Escherichia phage vB_EcoM_VpaE1 | NC_027337.1 |
| warpig | MN850637 | 1 | 86,106 | 127 | 17 | 39 |
| 93 | Escherichia phage vB_EcoM_VpaE1 | NC_027337.1 |
| radambza | MN850639 | 1 | 86,702 | 127 | 19 | 38.9 |
| 91.6 | Escherichia phage vB_EcoM_VpaE1 | NC_027337.1 |
| ekra | MN850644 | 1 | 87,282 | 128 | 20 | 38.9 |
| 92.9 | Escherichia phage vB_EcoM_Alf5 | NC_031082.1 |
| humlepung | MN850564 | 3 | 85,311 | 119 | 19 | 39.1 |
| 92.1 | Escherichia phage vB_EcoM_VpaE1 | NC_027337.1 |
| finno | MN850619 | 1 | 87,554 | 129 | 20 | 38.9 |
| 89.7 | Escherichia phage vB_EcoM-AYO145A | NC_028825.1 |
| garuso | MN850566 | 2 | 85,798 | 130 | 20 | 38.9 |
| 90.9 | Escherichia phage vB_EcoM-AYO145A | NC_028825.1 |
| momo | MN850580 | 1 | 88,168 | 130 | 20 | 39 |
| 90.7 | Escherichia phage vB_EcoM-AYO145A | NC_028825.1 |
| heid | MN850577 |
| 87,590 | 126 | 20 | 39 |
| 91.2 | Escherichia phage vB_EcoM_Alf5 | NC_031082.1 |
| skuden | MN850585 | 1 | 87,263 | 131 | 20 | 38.9 |
| 91.1 | Escherichia phage vB_EcoM_VpaE1 | NC_027337.1 |
| pinkbiff | MN850603 | 1 | 88,814 | 129 | 20 | 39 |
| 93.9 | Escherichia phage vB_EcoM_Alf5 | NC_031082.1 |
| fjerdesal | MN850605 | 3 | 87,715 | 128 | 21 | 39 |
| 90.6 | Escherichia phage vB_EcoM_AYO145A | NC_028825.1 |
| andreotti | MN850610 | 1 | 83,391 | 117 | 20 | 39.2 |
| 91.9 | Escherichia phage vB_EcoM_VpaE1 | NC_027337.1 |
| nataliec | MN850611 | 2 | 89,137 | 134 | 20 | 39 |
| 90.3 | Escherichia phage vB_EcoM_AYO145A | NC_028825.1 |
| adrianh | MN850614 | 2 | 88,226 | 128 | 19 | 38.9 |
| 91.1 | Escherichia phage vB_EcoM_Alf5 | NC_031082.1 |
| mistaenkt | MN850587 | 1 | 86,664 | 128 | 22 | 47.2 |
| 91.1 | Escherichia phage SUSP2 | NC_028935.2 |
| nimi | MN850626 | 1 | 137,039 | 213 | 5 | 43.7 |
| 93.3 | Escherichia phage LL12 | MH491969.1 |
| navn | MN850642 | 1 | 141,707 | 224 | 4 | 43.6 |
| 91.1 | Escherichia coli O157 typing phage 4 | KP869102.1 |
| nomine | MN850649 | 1 | 137,991 | 220 | 5 | 43.6 |
| 91.5 | Escherichia phage LL12 | MH491969.1 |
| naswa | MN850595 | 1 | 138,583 | 222 | 5 | 43.6 |
| 93.1 | Escherichia phage LL12 | MH491969.1 |
| naam | MN850630 | 1 | 137,129 | 215 | 5 | 43.7 |
| 94.5 | Escherichia coli O157 typing phage 4 | KP869102.1 |
| ime | MN850576 | 2 | 137,114 | 217 | 5 | 43.6 |
| 93.1 | Escherichia phage LL12 | MH491969.1 |
| magaca | MN850612 | 1 | 135,826 | 217 | 5 | 43.6 |
| 96 | Escherichia phage slur12 | LN881735.1 |
| nom | MN850646 | 1 | 136,114 | 213 | 5 | 43.6 |
| 92.6 | Escherichia phage LL12 | MH491969.1 |
| isim | MN850597 | 1 | 138,289 | 219 | 5 | 43.6 |
| 93.8 | Escherichia phage LL12 | MH491969.1 |
| nomo | MN850578 | 1 | 137,702 | 218 | 5 | 43.7 |
| 93.3 | Escherichia phage APCEc02 | NC_041869.1 |
| inoa | MN850593 | 1 | 138,710 | 220 | 5 | 43,6 |
| 92 | Escherichia phage APCEc02 | NC_041869.1 |
| pangalan | MN850627 | 3 | 136,944 | 215 | 5 | 43.7 |
| 94.8 | Escherichia phage vB_EcoM_FFH2 | NC_024134.1 |
| tuntematon | MN850618 | 2 | 150,473 | 279 | 11 | 39.1 |
| 89.6 | Escherichia phage phAPEC8 | NC_020079.1 |
| anhysbys | MN850648 | 1 | 149,335 | 271 | 11 | 39.1 |
| 91.5 | Escherichia phage phAPEC8 | NC_020079.1 |
| ukendt | MN850565 | 1 | 150,947 | 266 | 11 | 39 |
| 88.7 | Escherichia phage phAPEC8 | NC_020079.1 |
| nepoznato | MN850571 | 4 | 151,514 | 265 | 10 | 38.9 |
| 85.6 | Escherichia phage phAPEC8 | NC_020079.1 |
| nieznany | MN850598 | 1 | 144,998 | 254 | 11 | 39.1 |
| 88.9 | Escherichia phage phAPEC8 | NC_020079.1 |
| muut | MN850573 | 1 | 146,307 | 243 | 13 | 37.4 |
| 92 | Escherichia phage vB_EcoM_PHB05 | MF805809.1 |
| alia | MN850632 | 1 | 147,009 | 246 | 13 | 37.5 |
| 93.1 | Enterobacteria phage ECGD1 | KU522583.1 |
| outra | MN850645 | 1 | 145,482 | 246 | 13 | 37.4 |
| 93.8 | Enterobacteria phage ECGD1 | KU522583.1 |
| inny | MN850601 | 1 | 147,483 | 247 | 13 | 37.4 |
| 92.4 | Enterobacteria phage ECGD1 | KU522583.1 |
| arall | MN850584 | 1 | 145,715 | 242 | 13 | 37.4 |
| 94.6 | Escherichia phage vB_vPM_PD06 | MH816848.1 |
| kvi | MN850615 | 1 | 163,673 | 266 | - | 40.5 |
| 94.2 | Escherichia phage ECD7 | NC_041936.1 |
| kaaroe | MN850574 | 1 | 163,719 | 267 | - | 40.5 |
| 94.7 | Enterobacteria phage RB49 | NC_005066.1 |
| dhabil | MN850621 | 1 | 165,644 | 266 | 3 | 39.5 |
| 87.5 | Enterobacteria phage JS10 | NC_012741.1 |
| dhaeg | MN850609 | 2 | 170,817 | 278 | 3 | 39.4 |
| 87.4 | Enterobacteria phage JS10 | NC_012741.1 |
| mogra | MN850579 | 1 | 168,724 | 263 | 2 | 37.7 |
| 91.1 | Escherichia phage vB_EcoM_PhAPEC2 | NC_024794.1 |
| mobillu | MN850622 | 1 | 163,063 | 255 | 2 | 37.7 |
| 94.5 | Escherichia phage p000y | MK047718.1 |
| moha | MN850590 | 1 | 168,676 | 267 | 2 | 37.6 |
| 94.8 | Escherichia phage APCEc01 | NC_029091.1 |
| moskry | MN850651 | 1 | 169,410 | 269 | 2 | 37.6 |
| 93.2 | Escherichia virus vB_Eco_mar005P1 | LR027383.1 |
| teqskov | MN895437 | 1 | 165,017 | 257 | 6 | 35.4 |
| 91.7 | Yersinia phage phiD1 | NC_027353.1 |
| teqdroes | MN895438 | 1 | 166,833 | 269 | 10 | 35.4 |
| 88.6 | Escherichia phage T2 | LC348380.1 |
| teqhad | MN895434 | 1 | 167,892 | 270 | 10 | 35.3 |
| 90.1 | Escherichia phage T2 | LC348380.1 |
| teqhal | MN895435 | 2 | 168,070 | 266 | 11 | 35.4 |
| 93.9 | Escherichia phage slur13 | LN881737.1 |
| teqsoen | MN895436 | 1 | 166,468 | 268 | 10 | 35.5 |
| 91.7 | Yersinia phage phiD1 | NC_027353.1 |
| flopper | MN850594 | 1 | 52,092 | 78 | 1 | 44.2 |
| 87 | Escherichia phage ST32 | NC_047830.1 |
| damhaus | MN850602 | 1 | 51,154 | 89 | - | 44.1 |
| 85.8 | Shigella phage pSf-1 | NC_021331.1 |
| herni | MN850640 | 2 | 50,971 | 89 | - | 44.1 |
| 87.6 | Shigella phage pSf-1 | NC_021331.1 |
| grams | MN850567 | 1 | 49,530 | 83 | - | 44.1 |
| 87.1 | Shigella phage pSf-1 | NC_021331.1 |
| aaroes | MN850572 | 1 | 51,662 | 92 | - | 44.1 |
| 83 | Shigella phage pSf-1 | NC_021331.1 |
| aalborv | MN850591 | 1 | 46,660 | 79 | - | 43.9 |
| 86.9 | Shigella phage pSf-1 | NC_021331.1 |
| haarsle | MN850600 | 2 | 48,613 | 85 | - | 44 |
| 87.1 | Shigella phage pSf-1 | NC_021331.1 |
| egaa | MN850607 | 1 | 51,643 | 87 | - | 44.1 |
| 89.7 | Shigella phage pSf-1 | NC_021331.1 |
| vojen | MN850569 | 1 | 50,709 | 86 | - | 44.1 |
| 89.7 | Shigella phage pSf-1 | NC_021331.1 |
| tiwna | MN850643 | 1 | 51,014 | 85 | - | 44.6 |
| 87.2 | Escherichia phage vB_Eco_Swan01 | NC_048202.1 |
| tonijn | MN850641 | 2 | 51,627 | 86 | - | 44.6 |
| 88.4 | Escherichia phage vB_Eco_Swan01 | NC_048202.1 |
| tonnikala | MN850613 | 1 | 51,277 | 86 | - | 44.8 |
| 86.4 | Escherichia phage vB_Eco_Swan01 | NC_048202.1 |
| atuna | MN850620 | 1 | 50,732 | 88 | - | 44.6 |
| 84.9 | Escherichia virus vB_Eco_mar001J1 | NC_048204 |
| tunus | MN850638 | 1 | 51,111 | 87 | - | 44.8 |
| 93.7 | Escherichia phage SECphi27 | NC_047938.1 |
| orkinos | MN850586 | 2 | 49,798 | 81 | - | 44.6 |
| 91.3 | Escherichia phage SECphi27 | NC_047938.1 |
| ityhuna | MN850582 | 1 | 50,768 | 86 | - | 44.7 |
| 93.3 | Escherichia phage SECphi27 | NC_047938.1 |
| tonn | MN850596 | 2 | 51,012 | 87 | - | 44.5 |
| 94 | Escherichia phage vB_Eco_Swan01 | NC_048202.1 |
| tinuso | MN850634 | 1 | 50,856 | 86 | - | 44.8 |
| 97.3 | Escherichia phage vB_Eco_Swan01 | NC_048202.1 |
| tunzivis | MN850604 | 1 | 50,596 | 84 | - | 44.6 |
| 94.5 | Escherichia phage SECphi27 | NC_047938.1 |
| tuinn | MN850606 | 1 | 50,505 | 86 | - | 44.7 |
| 94.8 | Escherichia phage vB_Eco_Swan01 | NC_048202.1 |
| Jahat | MK552105 | 1 | 51,101 | 87 | - | 45.7 |
| 68.5 | Escherichia phage vB_Eco_Swan01 | NC_048202.1 |
| bob | MN850628 | 1 | 45,252 | 63 | - | 54.5 |
| 88.6 | Escherichia phage SECphi18 | LT960609.1 |
| mckay | MN850629 | 1 | 44,443 | 63 | - | 54.5 |
| 83.8 | Escherichia phage slur05 | NC_028901.1 |
| jat | MN850650 | 1 | 44,417 | 63 | - | 54.5 |
| 89.4 | Escherichia phage Gluttony | NC_031113.1 |
| rolling | MN850575 | 1 | 46,017 | 64 | - | 54.2 |
| 80.2 | Escherichia phage Sloth | KX534339.1 |
| welsh | MN850589 | 2 | 45,207 | 62 | - | 54.6 |
| 83.8 | Escherichia phage B2 | KX534339.1 |
| buks | MN850616 | 1 | 40,308 | 62 | - | 49.7 |
| 91.3 | Salmonella phage vB_SenS-Ent1 | NC_019539.1 |
| Skure | MK672798 | 1 | 59,474 | 92 | - | 44.6 |
| 90.4 | Escherichia phage vB_Eco_SLUR25 | LT907986.1 |
| Halfdan | MH362766 | 1 | 42,858 | 57 | - | 53.7 |
| 28.8 | Pseudomonas phage vB_PaeS_SCUT-S3 | MK165657.1 |
| Lilleen | MK629526 | 1 | 5342 | 6 | - | 46.9 |
| 93.8 | Escherichia phage SECphi17 | LT960607.1 |
| Lilleput | MK629525 | 1 | 5490 | 6 | - | 47 |
| 93.4 | Escherichia phage SECphi17 | LT960607.1 |
| Lilleto | MK629529 | 3 | 5492 | 6 | - | 46.8 |
| 92.7 | Escherichia phage SECphi17 | LT960607.1 |
| Lilledu | MK791318 | 1 | 5483 | 6 | - | 47.2 |
| 92.6 | Escherichia phage SECphi17 | LT960607.1 |
| lillemer | MN850599 | 1 | 5492 | 6 | - | 47.1 |
| 94.5 | Escherichia phage SECphi17 | LT960607.1 |
| Lilleven | MK629527 | 1 | 6090 | 9 | - | 44.4 |
| 93.9 | Enterobacteria phage St-1 | NC_012868.1 |
| sortsyn | MN850623 | 1 | 42,116 | 61 | - | 59 |
| 92.3 | Enterobacteria phage IME_EC2 | KF591601.1 |
| sortregn | MN850588 | 1 | 38,200 | 53 | - | 59.3 |
| 97.3 | Salmonella phage Lumpael | NC_048113.1 |
| Skarpretter | MK105855 | 1 | 42,042 | 63 | - | 55.8 |
| 37.9 | Escherichia phage C130_2 | MH363708.1 |
| sortkaff | MN850581 | 1 | 42,538 | 61 | - | 59.5 |
| 89.8 | Klebsiella phage vB_KpnS_IME279 | MF614100.1 |
| Sortsne | MK651787 | 1 | 41,912 | 62 | - | 60 |
| 67.6 | Klebsiella phage vB_KpnS_IME279 | MF614100.1 |
| aldrigsur | MN850592 | 1 | 42,379 | 55 | - | 55.7 |
| 71.9 | Enterobacteria phage J8-65 | NC_025445.1 |
| altidsur | MN850568 | 1 | 42,197 | 53 | - | 55.7 |
| 71.8 | Enterobacteria phage J8-65 | NC_025445.1 |
| forsur | MN850617 | 1 | 42,476 | 56 | - | 55.4 |
| 72 | Enterobacteria phage J8-65 | NC_025445.1 |
| glasur | MN850583 | 1 | 42,507 | 56 | - | 55.4 |
| 72.3 | Enterobacteria phage J8-65 | NC_025445.1 |
| Lidtsur | MK629528 | 1 | 42,291 | 56 | - | 54.6 |
| 69 | Enterobacteria phage J8-65 | NC_025445.1 |
| megetsur | MN850608 | 1 | 42,132 | 54 | - | 55.8 |
| 73.1 | Enterobacteria phage J8-65 | NC_025445.1 |
| mellemsur | MN850570 | 1 | 40,770 | 50 | - | 55.8 |
| 76.4 | Enterobacteria phage J8-65 | NC_025445.1 |
| smaasur | MN850625 | 1 | 41,110 | 50 | - | 55.4 |
| 93.3 | Enterobacteria phage J8-65 | NC_025445.1 |
| usur | MN850624 | 2 | 41,906 | 51 | - | 55.4 |
| 73.3 | Enterobacteria phage J8-65 | NC_025445.1 |
Figure 1Phylogenetic trees, bootstrap: 100, scalebar: substitutions per site. (A) Caudovirales based on large terminase subunit terL (Maximum log Likelihood: −1801.27). (B) Microviridae based on the DNA replication protein gene gpA (Maximum log Likelihood: −3922.55). (C) Taxonomic distribution of phages identified in 94 Danish wastewater samples, based on similarity to closest related and the ICTV Master Species list. 1 ≤95% similarity to other phages in the dataset. 2 ≤95% similarity to other phages in the dataset and the NCBI GenBank. 3 The Murrayvirus genus has by mistake been classified as Siphoviridae, but a proposed move to Podoviridae will be included in the 2021 ICTV ratification.
Figure 2(A) Number of phages and taxonomic family distribution per WWTP, as well as approximate geographical location thereof. (B) Bubble-diagram of the 104 unique coliphage species displaying genome size and GC content distribution; the area of the bubbles indicates the number of phages. (C) Sample completeness curve with confidence intervals (0.95). (D) Sample-size-based rarefaction and extrapolation curve with confidence intervals (0.95). Sampling units in figures C and D denote isolated phages (n = 136).
Figure 3Pairwise alignment of the unclassified Myoviridae phage species from this study (blue text) and their closest relatives (black text), the color bars between genomes indicate percent pairwise similarity (Easyfig, BlASTn). The genes marked with yellow code for a tailspike (gene 143 in phi92).
Figure 4Pairwise alignment of the new Carltongylesvirus phage flopper (blue) and the Carltongylesvirus phages GJ1 and ST32; the colour bars between genomes indicate percent pairwise similarity (Easyfig, BlASTn). Genomes have been modified to have similar starting points. Genes coding for the putative tail tape measure protein (red) and a tail fiber (orange) are colorised.
Figure 5Pairwise alignment of the new Hanrivervirus phages (green text) and type species pSf-1 (black text), the new Warwickvirus phages (blue text), and type species swan01 (black text) and Jahat (brown text); the color bars between genomes indicate percent pairwise similarity (Easyfig, BlASTn). Genomes have been modified to have similar starting points. Coloured genes (purple) code for a putative tail fiber.
Figure 6Pairwise alignment of the new Dhillonvirus phage species (blue text) and the type species HK578 (black text); the colour bars between genomes indicate percent pairwise similarity (Easyfig, BlASTn). Genomes have been modified to have similar starting points. Highlighted genes (green) code for a putative tail fiber.
Figure 7Comparisons of the new phage lineage representative Halfdan (blue text) and closest relatives (black text). (A) Phylogenetic tree, Maximum log Likelihood: −7678.71, bootstrap 100, large terminase subunit TerL, scalebar: substitutions per site. (B) Pairwise alignment of phage genomes; color bars between genomes indicate percent pairwise similarity (Easyfig, BlASTn). Genomes have been modified to have similar starting points.
Figure 8Comparisons of the Podoviridae phage species from this study (blue text) with closest relatives (black text). (A) Phylogenetic tree, Maximum log Likelihood: −8023.43, bootstrap 100, large terminase subunit TerL, scalebar: substitutions per site. (B) Pairwise alignment of phage genomes; color bars between genomes indicate percent pairwise similarity (Easyfig, BlASTn). Genomes have been modified to have similar starting points.
Figure 9Significant hits for finding close relatives of the phages isolated in this study in three databases. Hits are defined as hits ≥80% genome coverage when blasting the coliphage genomes against the IMG/VR and GVD databases, and as mapped reads covering ≥70% of individual coliphage genomes when mapping reads from the Wang study. The coliphages are grouped according to genera, and numbers in parentheses denote the number of coliphage species representing each genus; only genera with significant hits are shown. Color-codes of genera denote taxonomic family.