Literature DB >> 28821546

A Supervised Statistical Learning Approach for Accurate Legionella pneumophila Source Attribution during Outbreaks.

Andrew H Buultjens1,2, Kyra Y L Chua3, Sarah L Baines1, Jason Kwong1,2,3, Wei Gao1, Zoe Cutcher4,5, Stuart Adcock4, Susan Ballard3, Mark B Schultz3, Takehiro Tomita3, Nela Subasinghe3, Glen P Carter1,2, Sacha J Pidot1, Lucinda Franklin4, Torsten Seemann3,6, Anders Gonçalves Da Silva2,3, Benjamin P Howden7,2,3, Timothy P Stinear7,2.   

Abstract

Public health agencies are increasingly relying on genomics during Legionnaires' disease investigations. However, the causative bacterium (Legionella pneumophila) has an unusual population structure, with extreme temporal and spatial genome sequence conservation. Furthermore, Legionnaires' disease outbreaks can be caused by multiple L. pneumophila genotypes in a single source. These factors can confound cluster identification using standard phylogenomic methods. Here, we show that a statistical learning approach based on L. pneumophila core genome single nucleotide polymorphism (SNP) comparisons eliminates ambiguity for defining outbreak clusters and accurately predicts exposure sources for clinical cases. We illustrate the performance of our method by genome comparisons of 234 L. pneumophila isolates obtained from patients and cooling towers in Melbourne, Australia, between 1994 and 2014. This collection included one of the largest reported Legionnaires' disease outbreaks, which involved 125 cases at an aquarium. Using only sequence data from L. pneumophila cooling tower isolates and including all core genome variation, we built a multivariate model using discriminant analysis of principal components (DAPC) to find cooling tower-specific genomic signatures and then used it to predict the origin of clinical isolates. Model assignments were 93% congruent with epidemiological data, including the aquarium Legionnaires' disease outbreak and three other unrelated outbreak investigations. We applied the same approach to a recently described investigation of Legionnaires' disease within a UK hospital and observed a model predictive ability of 86%. We have developed a promising means to breach L. pneumophila genetic diversity extremes and provide objective source attribution data for outbreak investigations.IMPORTANCE Microbial outbreak investigations are moving to a paradigm where whole-genome sequencing and phylogenetic trees are used to support epidemiological investigations. It is critical that outbreak source predictions are accurate, particularly for pathogens, like Legionella pneumophila, which can spread widely and rapidly via cooling system aerosols, causing Legionnaires' disease. Here, by studying hundreds of Legionella pneumophila genomes collected over 21 years around a major Australian city, we uncovered limitations with the phylogenetic approach that could lead to a misidentification of outbreak sources. We implement instead a statistical learning technique that eliminates the ambiguity of inferring disease transmission from phylogenies. Our approach takes geolocation information and core genome variation from environmental L. pneumophila isolates to build statistical models that predict with high confidence the environmental source of clinical L. pneumophila during disease outbreaks. We show the versatility of the technique by applying it to unrelated Legionnaires' disease outbreaks in Australia and the UK.
Copyright © 2017 American Society for Microbiology.

Entities:  

Keywords:  Legionella pneumophila; comparative studies; genomics; microbial source tracking; phylogeography

Mesh:

Year:  2017        PMID: 28821546      PMCID: PMC5648911          DOI: 10.1128/AEM.01482-17

Source DB:  PubMed          Journal:  Appl Environ Microbiol        ISSN: 0099-2240            Impact factor:   4.792


  41 in total

1.  A Large Community Outbreak of Legionnaires' Disease Associated With a Cooling Tower in New York City, 2015.

Authors:  Don Weiss; Christopher Boyd; Jennifer L Rakeman; Sharon K Greene; Robert Fitzhenry; Trevor McProud; Kimberlee Musser; Li Huang; John Kornblum; Elizabeth J Nazarian; Annie D Fine; Sarah L Braunstein; Daniel Kass; Keren Landman; Pascal Lapierre; Scott Hughes; Anthony Tran; Jill Taylor; Deborah Baker; Lucretia Jones; Laura Kornstein; Boning Liu; Rodolfo Perez; David E Lucero; Eric Peterson; Isaac Benowitz; Kristen F Lee; Stephanie Ngai; Mitch Stripling; Jay K Varma
Journal:  Public Health Rep       Date:  2017-01-31       Impact factor: 2.792

2.  FastTree 2--approximately maximum-likelihood trees for large alignments.

Authors:  Morgan N Price; Paramvir S Dehal; Adam P Arkin
Journal:  PLoS One       Date:  2010-03-10       Impact factor: 3.240

3.  Prokka: rapid prokaryotic genome annotation.

Authors:  Torsten Seemann
Journal:  Bioinformatics       Date:  2014-03-18       Impact factor: 6.937

4.  Phylogenetic analysis of environmental Legionella pneumophila isolates from an endemic area (Alcoy, Spain).

Authors:  Leonor Sánchez-Busó; María Piedad Olmos; María Luisa Camaró; Francisco Adrián; Juan Miguel Calafat; Fernando González-Candelas
Journal:  Infect Genet Evol       Date:  2014-12-13       Impact factor: 3.342

5.  Legionella pneumophila pangenome reveals strain-specific virulence factors.

Authors:  Giuseppe D'Auria; Nuria Jiménez-Hernández; Francesc Peris-Bondia; Andrés Moya; Amparo Latorre
Journal:  BMC Genomics       Date:  2010-03-17       Impact factor: 3.969

Review 6.  Legionella and Legionnaires' disease: 25 years of investigation.

Authors:  Barry S Fields; Robert F Benson; Richard E Besser
Journal:  Clin Microbiol Rev       Date:  2002-07       Impact factor: 26.132

7.  An outbreak of Legionnaires' disease at the Melbourne Aquarium, April 2000: investigation and case-control studies.

Authors:  Jane E Greig; John A Carnie; Graham F Tallis; Norbert J Ryan; Agnes G Tan; Ian R Gordon; Bernard Zwolak; Jennie A Leydon; Charles S Guest; William G Hart
Journal:  Med J Aust       Date:  2004-06-07       Impact factor: 7.738

8.  Evidence in the Legionella pneumophila genome for exploitation of host cell functions and high genome plasticity.

Authors:  Christel Cazalet; Christophe Rusniok; Holger Brüggemann; Nora Zidane; Arnaud Magnier; Laurence Ma; Magalie Tichit; Sophie Jarraud; Christiane Bouchier; François Vandenesch; Frank Kunst; Jérôme Etienne; Philippe Glaser; Carmen Buchrieser
Journal:  Nat Genet       Date:  2004-10-03       Impact factor: 38.330

9.  Gene flow in environmental Legionella pneumophila leads to genetic and pathogenic heterogeneity within a Legionnaires' disease outbreak.

Authors:  Paul R McAdam; Charles W Vander Broek; Diane S J Lindsay; Melissa J Ward; Mary F Hanson; Michael Gillies; Mick Watson; Joanne M Stevens; Giles F Edwards; J Ross Fitzgerald
Journal:  Genome Biol       Date:  2014       Impact factor: 13.583

10.  Evaluation of an Optimal Epidemiological Typing Scheme for Legionella pneumophila with Whole-Genome Sequence Data Using Validation Guidelines.

Authors:  Sophia David; Massimo Mentasti; Rediat Tewolde; Martin Aslett; Simon R Harris; Baharak Afshar; Anthony Underwood; Norman K Fry; Julian Parkhill; Timothy G Harrison
Journal:  J Clin Microbiol       Date:  2016-06-08       Impact factor: 5.948

View more
  1 in total

Review 1.  Machine Learning Approaches for Epidemiological Investigations of Food-Borne Disease Outbreaks.

Authors:  Baiba Vilne; Irēna Meistere; Lelde Grantiņa-Ieviņa; Juris Ķibilds
Journal:  Front Microbiol       Date:  2019-08-06       Impact factor: 5.640

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.