Literature DB >> 27832215

Avoiding Pandemic Fears in the Subway and Conquering the Platypus.

A Gonzalez1, Y Vázquez-Baeza2, J B Pettengill3, A Ottesen3, D McDonald4, R Knight5.   

Abstract

Metagenomics is increasingly used not just to show patterns of microbial diversity but also as a culture-independent method to detect individual organisms of intense clinical, epidemiological, conservation, forensic, or regulatory interest. A widely reported metagenomic study of the New York subway suggested that the pathogens Yersinia pestis and Bacillus anthracis were part of the "normal subway microbiome." In their article in mSystems, Hsu and collaborators (mSystems 1(3):e00018-16, 2016, http://dx.doi.org/10.1128/mSystems.00018-16) showed that microbial communities on transit surfaces in the Boston subway system are maintained from a metapopulation of human skin commensals and environmental generalists and that reanalysis of the New York subway data with appropriate methods did not detect the pathogens. We note that commonly used software pipelines can produce results that lack prima facie validity (e.g., reporting widespread distribution of notorious endemic species such as the platypus or the presence of pathogens) but that appropriate use of inclusion and exclusion sets can avoid this issue.

Entities:  

Year:  2016        PMID: 27832215      PMCID: PMC5069772          DOI: 10.1128/mSystems.00050-16

Source DB:  PubMed          Journal:  mSystems        ISSN: 2379-5077            Impact factor:   6.496


COMMENTARY

The development and validation of novel methods that use next-generation DNA sequence data to detect pathogens from complex ecosystems represent important areas of research. In particular, these methods are important in studies of the built environment and of agricultural systems, where the correct detection of pathogens represents enormous public benefit and where incorrect detection creates fear. For example, in a recent study of the New York subway (1), due to incorrect taxonomic classifications, the authors reported observing Yersinia pestis (the causative agent of plague) and Bacillus anthracis (the causative agent of anthrax) as part of the “normal subway microbiome.” These observations led to high-visibility news reports. But improved reanalysis of the same data by Hsu et al. (2) demonstrated that these results were illusory. Hsu et al. found that these pathogens were not part of the normal subway microbiome, either in New York or in an independent sample set from the Boston subway. They drew the more plausible conclusion that the surfaces were dominated by inputs of normal human skin bacteria, consistent with other studies, and found that the subway was not a reservoir of bacterially encoded toxins or antimicrobial resistance elements. That carefully conducted study added fundamentally to our knowledge of the transmission and expression of microbes in high-traffic built environments. Another example of the importance of accurate pathogen identification from next-generation sequencing data is the ability to detect Salmonella from fresh produce. In a study by Ottesen et al. (3), the authors could not confirm the presence of Salmonella on the tomato crops through the use of 16S amplicon sequencing. However, an analysis of shotgun data from samples collected from the roots, leaves, and fruits of the tomato plants performed using the MG-RAST server reported hits corresponding to Salmonella. Furthermore, this analysis also showed the surprising presence of Gallus gallus (red jungle fowl), Mus musculus (house mouse), and even the elusive Ornithorhynchus anatinus (duck-billed platypus).

Detecting the presence of specific taxa from MG-RAST public datasets.

To exemplify the pervasiveness of false positives in MG-RAST, we downloaded all public samples (25,943 samples; accessed 22 April 2015), searched each report for Salmonella, Raphus (dodo bird), Thylacinus (Tasmanian tiger), and Ornithorhynchus (duck-billed platypus), and summarized the findings by the countries in which these organisms were observed on the basis of the latitude and longitude fields in the associated metadata (Table 1). A Jupyter (8) Notebook reproducing this report can be found in http://goo.gl/UIhBjf.
TABLE 1 

Number of hits to specific taxa, living and extinct, and locations as reported by MG-RAST

TaxonomyExtinctTotal no. of hits reported by MG-RASTMain country locations (no. of hits [sorted by abundance])
OrnithorhynchusNo17,140,078Brazil (4,338,217), Australia (3,905,173), United States (2,669,553), Italy (2,665,186), Malawi (1,335,746), undefined (585,412), Kyrgyzstan (558,786), Russian Federation (333,978), South Africa (289,642), Belgium (198,052), Finland (168,848), China (50,542), Israel (27,366), Philippines (13,577)
RaphusYes11Brazil (8), Australia (3)
SalmonellaNo146,842,227Italy (76,730,072), Brazil (33,956,417), United States (14,178,170), Malawi (3,808,783), China (3,383,261), Australia (3,354,697), undefined (3,257,862), Russian Federation (2,750,106), Finland (1,886,515), Belgium (1,373,668), South Africa (1,105,658), Israel (783,766), Philippines (232,026), Kyrgyzstan (41,226)
ThylacinusYes1,344Brazil (920), Australia (125), United States (80), Malawi (63), undefined (46), South Africa (32), Finland (23), Belgium (21), Russian Federation (15), Italy (13), Israel (4), China (2)
Number of hits to specific taxa, living and extinct, and locations as reported by MG-RAST

Conquering the platypus.

To demonstrate how the problem of confirming the presence of specific taxa in metagenomic samples can be addressed, we created Platypus Conquistador (https://github.com/biocore/Platypus-Conquistador), a BSD-licensed Python package based on BLAST (4) and SortMeRNA (5). Platypus Conquistador confirms the presence or absence of a taxon of interest within shotgun metagenomic datasets by relying on two reference sequence databases: an inclusion database, which includes the sequences of interest (e.g., Salmonella), and an exclusion database, which includes any known sequence background (e.g., platypus). The reference sequence databases are expected to be mutually exclusive. In general, these two databases can be created by partitioning an existing database, such as the gene data provided by the Integrated Microbial Genomes (IMG) (6) system. These partitions can be customized to include taxa of specific interest. This method has been used by Ottesen et al. (7) to describe the efficacy of enrichment steps in the effort to culture Salmonella from tomatoes. For that analysis, the authors ran Platypus Conquistador on shotgun metagenomic data using the IMG database split into a reference database, including only those sequences assigned to Salmonella, and an exclusion database containing all remaining sequences, demonstrating the absence of this pathogen.

Conclusions.

Simple bioinformatics solutions exist to detect taxa of interest and to resolve incorrect taxonomic classifications for shotgun sequencing data. Incorrect but pervasive taxonomic classifications can lead to conclusions that lack prima facie validity (for example, environments in which the platypus was reportedly found include environments from the built environment to the human gut). Worse, these incorrect assignments have great potential to spark unwarranted public concern, as was seen in the case of the NYC subway microbiome paper noted above. These examples should also serve as a reminder that, although analytical software pipelines and computational methods can be thoroughly tested and validated, their results are based on user-specified parameters that change the results and, as a consequence, their validity. Researchers must always question the rationality of the parameters and meaning of the results to reduce the possibility of incorrect conclusions. Moving toward standardized and reproducible pipelines of analysis that can be scrutinized by our peers will greatly help avoid similar problems in the future. For pathogen detection, it is critical to additionally define taxon inclusion and exclusion criteria based on the studied environment in order to discard misleading results. This is especially important in cases of intense public interest, such as exposure in systems used by millions of people every day to apparent pathogens that are as illusory as the benthic Platypus.
  6 in total

1.  Basic local alignment search tool.

Authors:  S F Altschul; W Gish; W Miller; E W Myers; D J Lipman
Journal:  J Mol Biol       Date:  1990-10-05       Impact factor: 5.469

2.  SortMeRNA: fast and accurate filtering of ribosomal RNAs in metatranscriptomic data.

Authors:  Evguenia Kopylova; Laurent Noé; Hélène Touzet
Journal:  Bioinformatics       Date:  2012-10-15       Impact factor: 6.937

3.  IMG: the Integrated Microbial Genomes database and comparative analysis system.

Authors:  Victor M Markowitz; I-Min A Chen; Krishna Palaniappan; Ken Chu; Ernest Szeto; Yuri Grechkin; Anna Ratner; Biju Jacob; Jinghua Huang; Peter Williams; Marcel Huntemann; Iain Anderson; Konstantinos Mavromatis; Natalia N Ivanova; Nikos C Kyrpides
Journal:  Nucleic Acids Res       Date:  2012-01       Impact factor: 16.971

4.  Baseline survey of the anatomical microbial ecology of an important food plant: Solanum lycopersicum (tomato).

Authors:  Andrea R Ottesen; Antonio González Peña; James R White; James B Pettengill; Cong Li; Sarah Allard; Steven Rideout; Marc Allard; Thomas Hill; Peter Evans; Errol Strain; Steven Musser; Rob Knight; Eric Brown
Journal:  BMC Microbiol       Date:  2013-05-24       Impact factor: 3.605

5.  Co-enriching microflora associated with culture based methods to detect Salmonella from tomato phyllosphere.

Authors:  Andrea R Ottesen; Antonio Gonzalez; Rebecca Bell; Caroline Arce; Steven Rideout; Marc Allard; Peter Evans; Errol Strain; Steven Musser; Rob Knight; Eric Brown; James B Pettengill
Journal:  PLoS One       Date:  2013-09-09       Impact factor: 3.240

6.  Geospatial Resolution of Human and Bacterial Diversity with City-Scale Metagenomics.

Authors:  Ebrahim Afshinnekoo; Cem Meydan; Shanin Chowdhury; Dyala Jaroudi; Collin Boyer; Nick Bernstein; Julia M Maritz; Darryl Reeves; Jorge Gandara; Sagar Chhangawala; Sofia Ahsanuddin; Amber Simmons; Timothy Nessel; Bharathi Sundaresh; Elizabeth Pereira; Ellen Jorgensen; Sergios-Orestis Kolokotronis; Nell Kirchberger; Isaac Garcia; David Gandara; Sean Dhanraj; Tanzina Nawrin; Yogesh Saletore; Noah Alexander; Priyanka Vijay; Elizabeth M Hénaff; Paul Zumbo; Michael Walsh; Gregory D O'Mullan; Scott Tighe; Joel T Dudley; Anya Dunaif; Sean Ennis; Eoghan O'Halloran; Tiago R Magalhaes; Braden Boone; Angela L Jones; Theodore R Muth; Katie Schneider Paolantonio; Elizabeth Alter; Eric E Schadt; Jeanne Garbarino; Robert J Prill; Jane M Carlton; Shawn Levy; Christopher E Mason
Journal:  Cell Syst       Date:  2015-03-03       Impact factor: 10.304

  6 in total
  11 in total

1.  Living in a microbial world.

Authors:  Charles Schmidt
Journal:  Nat Biotechnol       Date:  2017-05-09       Impact factor: 54.908

2.  Comprehensive benchmarking and ensemble approaches for metagenomic classifiers.

Authors:  Alexa B R McIntyre; Rachid Ounit; Ebrahim Afshinnekoo; Robert J Prill; Elizabeth Hénaff; Noah Alexander; Samuel S Minot; David Danko; Jonathan Foox; Sofia Ahsanuddin; Scott Tighe; Nur A Hasan; Poorani Subramanian; Kelly Moffat; Shawn Levy; Stefano Lonardi; Nick Greenfield; Rita R Colwell; Gail L Rosen; Christopher E Mason
Journal:  Genome Biol       Date:  2017-09-21       Impact factor: 13.583

3.  Clinical metagenomics of bone and joint infections: a proof of concept study.

Authors:  Etienne Ruppé; Vladimir Lazarevic; Myriam Girard; William Mouton; Tristan Ferry; Frédéric Laurent; Jacques Schrenzel
Journal:  Sci Rep       Date:  2017-08-10       Impact factor: 4.379

4.  Segal's Law, 16S rRNA gene sequencing, and the perils of foodborne pathogen detection within the American Gut Project.

Authors:  James B Pettengill; Hugh Rand
Journal:  PeerJ       Date:  2017-06-22       Impact factor: 2.984

5.  Microdiversity of an Abundant Terrestrial Bacterium Encompasses Extensive Variation in Ecologically Relevant Traits.

Authors:  Alexander B Chase; Ulas Karaoz; Eoin L Brodie; Zulema Gomez-Lunar; Adam C Martiny; Jennifer B H Martiny
Journal:  MBio       Date:  2017-11-14       Impact factor: 7.867

6.  The impact of skin care products on skin chemistry and microbiome dynamics.

Authors:  Amina Bouslimani; Ricardo da Silva; Tomasz Kosciolek; Stefan Janssen; Chris Callewaert; Amnon Amir; Kathleen Dorrestein; Alexey V Melnik; Livia S Zaramela; Ji-Nu Kim; Gregory Humphrey; Tara Schwartz; Karenina Sanders; Caitriona Brennan; Tal Luzzatto-Knaan; Gail Ackermann; Daniel McDonald; Karsten Zengler; Rob Knight; Pieter C Dorrestein
Journal:  BMC Biol       Date:  2019-06-12       Impact factor: 7.431

7.  Microbiome analyses of blood and tissues suggest cancer diagnostic approach.

Authors:  Gregory D Poore; Evguenia Kopylova; Qiyun Zhu; Carolina Carpenter; Serena Fraraccio; Stephen Wandro; Tomasz Kosciolek; Stefan Janssen; Jessica Metcalf; Se Jin Song; Jad Kanbar; Sandrine Miller-Montgomery; Robert Heaton; Rana Mckay; Sandip Pravin Patel; Austin D Swafford; Rob Knight
Journal:  Nature       Date:  2020-03-11       Impact factor: 49.962

8.  BLAST-based validation of metagenomic sequence assignments.

Authors:  Adam L Bazinet; Brian D Ondov; Daniel D Sommer; Shashikala Ratnayake
Journal:  PeerJ       Date:  2018-05-28       Impact factor: 2.984

9.  Selection of Appropriate Metagenome Taxonomic Classifiers for Ancient Microbiome Research.

Authors:  Irina M Velsko; Laurent A F Frantz; Alexander Herbig; Greger Larson; Christina Warinner
Journal:  mSystems       Date:  2018-07-17       Impact factor: 6.496

10.  Comparative Analysis of 16S rRNA Gene and Metagenome Sequencing in Pediatric Gut Microbiomes.

Authors:  Danielle Peterson; Kevin S Bonham; Sophie Rowland; Cassandra W Pattanayak; Vanja Klepac-Ceraj
Journal:  Front Microbiol       Date:  2021-07-15       Impact factor: 5.640

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.