Literature DB >> 33239371

A Bioinformatic Pipeline for Improved Genome Analysis and Clustering of Isolates during Outbreaks of Legionnaires' Disease.

Wolfgang Haas1, Pascal Lapierre2, Kimberlee A Musser2.   

Abstract

Legionnaires' disease, a severe lung infection caused by the bacterium Legionella pneumophila, occurs as single cases or in outbreaks that are actively tracked by public health departments. To determine the point source of an outbreak, clinical isolates need to be compared to environmental samples to find matching isolates. One confounding factor is the genome plasticity of L. pneumophila, making an exact sequence comparison by whole-genome sequencing (WGS) challenging. Here, we present a WGS analysis pipeline, LegioCluster, that is designed to circumvent this problem by automatically selecting the best matching reference genome prior to mapping and variant calling. This approach reduces the number of false-positive variant calls, maximizes the fraction of all genomes that are being compared, and naturally clusters the isolates according to their reference strain. Isolates that are too distant from any genome in the database are added to the list of candidate references, thereby creating a new cluster. Short insertions or deletions are considered in addition to single-nucleotide polymorphisms for increased discriminatory power. This manuscript describes the use of this automated and "locked down" bioinformatic pipeline deployed at the New York State Department of Health's Wadsworth Center for investigating relatedness between clinical and environmental isolates. A similar pipeline has not been widely available for use to support these critically important public health investigations.
Copyright © 2021 American Society for Microbiology.

Entities:  

Keywords:  Legionella pneumophila; next-generation sequencing

Year:  2021        PMID: 33239371      PMCID: PMC8111141          DOI: 10.1128/JCM.00967-20

Source DB:  PubMed          Journal:  J Clin Microbiol        ISSN: 0095-1137            Impact factor:   5.948


  28 in total

1.  A Large Community Outbreak of Legionnaires' Disease Associated With a Cooling Tower in New York City, 2015.

Authors:  Don Weiss; Christopher Boyd; Jennifer L Rakeman; Sharon K Greene; Robert Fitzhenry; Trevor McProud; Kimberlee Musser; Li Huang; John Kornblum; Elizabeth J Nazarian; Annie D Fine; Sarah L Braunstein; Daniel Kass; Keren Landman; Pascal Lapierre; Scott Hughes; Anthony Tran; Jill Taylor; Deborah Baker; Lucretia Jones; Laura Kornstein; Boning Liu; Rodolfo Perez; David E Lucero; Eric Peterson; Isaac Benowitz; Kristen F Lee; Stephanie Ngai; Mitch Stripling; Jay K Varma
Journal:  Public Health Rep       Date:  2017-01-31       Impact factor: 2.792

2.  Qualimap: evaluating next-generation sequencing alignment data.

Authors:  Fernando García-Alcalde; Konstantin Okonechnikov; José Carbonell; Luis M Cruz; Stefan Götz; Sonia Tarazona; Joaquín Dopazo; Thomas F Meyer; Ana Conesa
Journal:  Bioinformatics       Date:  2012-08-22       Impact factor: 6.937

3.  ART: a next-generation sequencing read simulator.

Authors:  Weichun Huang; Leping Li; Jason R Myers; Gabor T Marth
Journal:  Bioinformatics       Date:  2011-12-23       Impact factor: 6.937

4.  Insights into the long-term persistence of Legionella in facilities from whole-genome sequencing.

Authors:  Megan Wells; Erica Lasek-Nesselquist; Dianna Schoonmaker-Bopp; Deborah Baker; Lisa Thompson; Danielle Wroblewski; Elizabeth Nazarian; Pascal Lapierre; Kimberlee A Musser
Journal:  Infect Genet Evol       Date:  2018-07-31       Impact factor: 3.342

5.  The Harvest suite for rapid core-genome alignment and visualization of thousands of intraspecific microbial genomes.

Authors:  Todd J Treangen; Brian D Ondov; Sergey Koren; Adam M Phillippy
Journal:  Genome Biol       Date:  2014       Impact factor: 13.583

6.  Legionnaires' Disease Outbreak Caused by Endemic Strain of Legionella pneumophila, New York, New York, USA, 2015.

Authors:  Pascal Lapierre; Elizabeth Nazarian; Yan Zhu; Danielle Wroblewski; Amy Saylors; Teresa Passaretti; Scott Hughes; Anthony Tran; Ying Lin; John Kornblum; Shatavia S Morrison; Jeffrey W Mercante; Robert Fitzhenry; Don Weiss; Brian H Raphael; Jay K Varma; Howard A Zucker; Jennifer L Rakeman; Kimberlee A Musser
Journal:  Emerg Infect Dis       Date:  2017-11       Impact factor: 6.883

7.  Fast and accurate short read alignment with Burrows-Wheeler transform.

Authors:  Heng Li; Richard Durbin
Journal:  Bioinformatics       Date:  2009-05-18       Impact factor: 6.937

8.  Kraken: ultrafast metagenomic sequence classification using exact alignments.

Authors:  Derrick E Wood; Steven L Salzberg
Journal:  Genome Biol       Date:  2014-03-03       Impact factor: 13.583

9.  Trimmomatic: a flexible trimmer for Illumina sequence data.

Authors:  Anthony M Bolger; Marc Lohse; Bjoern Usadel
Journal:  Bioinformatics       Date:  2014-04-01       Impact factor: 6.937

10.  SNVPhyl: a single nucleotide variant phylogenomics pipeline for microbial genomic epidemiology.

Authors:  Aaron Petkau; Philip Mabon; Cameron Sieffert; Natalie C Knox; Jennifer Cabral; Mariam Iskander; Mark Iskander; Kelly Weedmark; Rahat Zaheer; Lee S Katz; Celine Nadon; Aleisha Reimer; Eduardo Taboada; Robert G Beiko; William Hsiao; Fiona Brinkman; Morag Graham; Gary Van Domselaar
Journal:  Microb Genom       Date:  2017-06-08
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.