Ryan Cook1, Nathan Brown2, Tamsin Redgwell3, Branko Rihtman4, Megan Barnes2, Martha Clokie2, Dov J Stekel5, Jon Hobman5, Michael A Jones1, Andrew Millard2. 1. School of Veterinary Medicine and Science, University of Nottingham, Loughborough, United Kingdom. 2. Department of Genetics and Genome Biology, University of Leicester, Leicester, United Kingdom. 3. COPSAC, Copenhagen Prospective Studies on Asthma in Childhood, Herlev and Gentofte Hospital, University of Copenhagen, Copenhagen, Denmark. 4. School of Life Sciences, University of Warwick, Coventry, United Kingdom. 5. School of Biosciences, University of Nottingham, Loughborough, United Kingdom.
Abstract
Background: With advances in sequencing technology and decreasing costs, the number of phage genomes that have been sequenced has increased markedly in the past decade. Materials and Methods: We developed an automated retrieval and analysis system for phage genomes (https://github.com/RyanCook94/inphared) to produce the INfrastructure for a PHAge REference Database (INPHARED) of phage genomes and associated metadata. Results: As of January 2021, 14,244 complete phage genomes have been sequenced. The INPHARED data set is dominated by phages that infect a small number of bacterial genera, with 75% of phages isolated on only 30 bacterial genera. There is further bias, with significantly more lytic phage genomes (∼70%) than temperate (∼30%) within our database. Collectively, this results in ∼54% of temperate phage genomes originating from just three host genera. With much debate on the carriage of antibiotic resistance genes and their potential safety in phage therapy, we searched for putative antibiotic resistance genes. Frequency of antibiotic resistance gene carriage was found to be higher in temperate phages than in lytic phages and again varied with host. Conclusions: Given the bias of currently sequenced phage genomes, we suggest to fully understand phage diversity, efforts should be made to isolate and sequence a larger number of phages, in particular temperate phages, from a greater diversity of hosts. Copyright 2021, Mary Ann Liebert, Inc., publishers.
Background: With advances in sequencing technology and decreasing costs, the number of phage genomes that have been sequenced has increased markedly in the past decade. Materials and Methods: We developed an automated retrieval and analysis system for phage genomes (https://github.com/RyanCook94/inphared) to produce the INfrastructure for a PHAge REference Database (INPHARED) of phage genomes and associated metadata. Results: As of January 2021, 14,244 complete phage genomes have been sequenced. The INPHARED data set is dominated by phages that infect a small number of bacterial genera, with 75% of phages isolated on only 30 bacterial genera. There is further bias, with significantly more lytic phage genomes (∼70%) than temperate (∼30%) within our database. Collectively, this results in ∼54% of temperate phage genomes originating from just three host genera. With much debate on the carriage of antibiotic resistance genes and their potential safety in phage therapy, we searched for putative antibiotic resistance genes. Frequency of antibiotic resistance gene carriage was found to be higher in temperate phages than in lytic phages and again varied with host. Conclusions: Given the bias of currently sequenced phage genomes, we suggest to fully understand phage diversity, efforts should be made to isolate and sequence a larger number of phages, in particular temperate phages, from a greater diversity of hosts. Copyright 2021, Mary Ann Liebert, Inc., publishers.
Authors: Yu-Fan Tsao; Véronique L Taylor; Smriti Kala; Joseph Bondy-Denomy; Alima N Khan; Diane Bona; Vincent Cattoir; Stephen Lory; Alan R Davidson; Karen L Maxwell Journal: J Bacteriol Date: 2018-10-23 Impact factor: 3.490
Authors: Simon Roux; David Páez-Espino; I-Min A Chen; Krishna Palaniappan; Anna Ratner; Ken Chu; T B K Reddy; Stephen Nayfach; Frederik Schulz; Lee Call; Russell Y Neches; Tanja Woyke; Natalia N Ivanova; Emiley A Eloe-Fadrosh; Nikos C Kyrpides Journal: Nucleic Acids Res Date: 2021-01-08 Impact factor: 16.971
Authors: Cynthia L Monaco; David B Gootenberg; Guoyan Zhao; Scott A Handley; Musie S Ghebremichael; Efrem S Lim; Alex Lankowski; Megan T Baldridge; Craig B Wilen; Meaghan Flagg; Jason M Norman; Brian C Keller; Jesús Mario Luévano; David Wang; Yap Boum; Jeffrey N Martin; Peter W Hunt; David R Bangsberg; Mark J Siedner; Douglas S Kwon; Herbert W Virgin Journal: Cell Host Microbe Date: 2016-03-09 Impact factor: 21.023
Authors: Adam G Clooney; Thomas D S Sutton; Andrey N Shkoporov; Ross K Holohan; Karen M Daly; Orla O'Regan; Feargal J Ryan; Lorraine A Draper; Scott E Plevy; R Paul Ross; Colin Hill Journal: Cell Host Microbe Date: 2019-11-19 Impact factor: 21.023
Authors: Eric W Sayers; Mark Cavanaugh; Karen Clark; James Ostell; Kim D Pruitt; Ilene Karsch-Mizrachi Journal: Nucleic Acids Res Date: 2020-01-08 Impact factor: 16.971
Authors: Graham F Hatfull; Marisa L Pedulla; Deborah Jacobs-Sera; Pauline M Cichon; Amy Foley; Michael E Ford; Rebecca M Gonda; Jennifer M Houtz; Andrew J Hryckowian; Vanessa A Kelchner; Swathi Namburi; Kostandin V Pajcini; Mark G Popovich; Donald T Schleicher; Brian Z Simanek; Alexis L Smith; Gina M Zdanowicz; Vanaja Kumar; Craig L Peebles; William R Jacobs; Jeffrey G Lawrence; Roger W Hendrix Journal: PLoS Genet Date: 2006-06-09 Impact factor: 5.917
Authors: Audra E Devoto; Joanne M Santini; Matthew R Olm; Karthik Anantharaman; Patrick Munk; Jenny Tung; Elizabeth A Archie; Peter J Turnbaugh; Kimberley D Seed; Ran Blekhman; Frank M Aarestrup; Brian C Thomas; Jillian F Banfield Journal: Nat Microbiol Date: 2019-01-28 Impact factor: 30.964
Authors: Nuala A O'Leary; Mathew W Wright; J Rodney Brister; Stacy Ciufo; Diana Haddad; Rich McVeigh; Bhanu Rajput; Barbara Robbertse; Brian Smith-White; Danso Ako-Adjei; Alexander Astashyn; Azat Badretdin; Yiming Bao; Olga Blinkova; Vyacheslav Brover; Vyacheslav Chetvernin; Jinna Choi; Eric Cox; Olga Ermolaeva; Catherine M Farrell; Tamara Goldfarb; Tripti Gupta; Daniel Haft; Eneida Hatcher; Wratko Hlavina; Vinita S Joardar; Vamsi K Kodali; Wenjun Li; Donna Maglott; Patrick Masterson; Kelly M McGarvey; Michael R Murphy; Kathleen O'Neill; Shashikant Pujar; Sanjida H Rangwala; Daniel Rausch; Lillian D Riddick; Conrad Schoch; Andrei Shkeda; Susan S Storz; Hanzhen Sun; Francoise Thibaud-Nissen; Igor Tolstoy; Raymond E Tully; Anjana R Vatsan; Craig Wallin; David Webb; Wendy Wu; Melissa J Landrum; Avi Kimchi; Tatiana Tatusova; Michael DiCuccio; Paul Kitts; Terence D Murphy; Kim D Pruitt Journal: Nucleic Acids Res Date: 2015-11-08 Impact factor: 16.971