| Literature DB >> 25437808 |
Kelly L Wyres1, Thomas C Conway2, Saurabh Garg3, Carlos Queiroz4, Matthias Reumann5, Kathryn Holt6, Laura I Rusu7.
Abstract
Recent advances in DNA sequencing technologies have the potential to transform the field of clinical and public health microbiology, and in the last few years numerous case studies have demonstrated successful applications in this context. Among other considerations, a lack of user-friendly data analysis and interpretation tools has been frequently cited as a major barrier to routine use of these techniques. Here we consider the requirements of microbiology laboratories for the analysis, clinical interpretation and management of bacterial whole-genome sequence (WGS) data. Then we discuss relevant, existing WGS analysis tools. We highlight many essential and useful features that are represented among existing tools, but find that no single tool fulfils all of the necessary requirements. We conclude that to fully realise the potential of WGS analyses for clinical and public health microbiology laboratories of all scales, we will need to develop tools specifically with the needs of these laboratories in mind.Entities:
Year: 2014 PMID: 25437808 PMCID: PMC4243455 DOI: 10.3390/pathogens3020437
Source DB: PubMed Journal: Pathogens ISSN: 2076-0817
Figure 1Publications describing clinical/public microbiology whole-genome sequence (WGS) analyses. Publications as listed in the PubMED database were identified by use of search terms “whole-genome” AND “outbreak”. Abstracts were manually screened to include only those describing bacterial outbreak investigations or evaluation of WGS analyses for outbreak investigations. * Publications as of 26 May 2014 only.
Analytical capability of targeted Next-Generation (DNA) Sequencing (NGS) analysis solutions with respect to clinical and public health microbiology investigative methods.
| Solution | Date of Publication/First Release | Upload/Analyse Raw Sequence Data | Reference-Based Mapping | Variant Calling | Typing analyses (e.g., MLST) | Comparative Typing Analyses | Multiple Sequence Alignment | Phylogenetic Tree/Network Construction | |
|---|---|---|---|---|---|---|---|---|---|
| BioNumerics a | 1992 | Yes | Yes | Yes | Yes | No c | Yes | Yes | Yes |
| CLC Genomics Workbench a | 2008 | Yes | Yes | Yes | Yes | No | No | Yes | Yes |
| Galaxy b | 2007 | Yes | Yes | Yes | Yes | No | No | Yes | Yes |
| BIGSdb | 2010 | No | No | No | Yes | Yes | Yes | Yes | Yes |
| Center for Genomic Epidemiology Web Portal | 2011 | Yes | No | Yes | Yes | Yes | Yes | No d | Yes |
| Ridom SeqSphere+ a | 2013 | Yes | Yes | Yes | Yes | Yes | Yes | Yes | Yes |
| snp-search | 2013 | No | No | No | No | No | Yes | No e | Yes |
a Commercial product;
b Features/analyses described for the default installation and tools available in the Galaxy Tool Shed as of September 2013 only;
c There is a function to automate typing analyses from first generation but not next-generation sequence data;
d The Center for Genomic Epidemiology snpTree tool deals only with variant call information and thus multiple sequence alignment sensu stricto is not applicable; however the tool will generate a pseudo-alignment of concatenated single nucleotide polymorphisms;
e as described for snpTree, snp-search deals only with variant call information but will output a pseudo-alignment of concatenated single nucleotide polymorphisms in FASTA format.
Methodologies and analysis execution techniques of recently published bacterial outbreak investigations.
| Species | Focus a | Type(s) of Methods | Tools b | Ref. |
|---|---|---|---|---|
| Mapping, variant calling, phylogeny | Independent and/or custom | [ | ||
| Independent and/or custom | ||||
| Independent and/or custom | ||||
| Mapping, variant calling, phylogeny | Independent and/or custom | [ | ||
| Independent and/or custom | ||||
| Independent and/or custom | ||||
| Mapping, variant calling | CLC Genomics Workbench | [ | ||
| CLC Genomics Workbench | ||||
| CLC Genomics Workbench/custom | ||||
| Mapping, variant calling, network/ | CLC Genomics Workbench/custom | |||
| Mapping, variant calling, minimum spanning tree | Independent and/or custom | [ | ||
| Mapping, variant calling, phylogeny | Independent and/or custom | [ | ||
| Mapping, variant calling, phylogeny | Independent and/or custom | [ | ||
| Independent/BIGSdb | [ | |||
| Independent/BIGSdb | ||||
| Mapping, variant calling, phylogeny | Independent and/or custom | [ | ||
| Mapping, variant calling, multiple sequence alignment (references), phylogeny | Independent and/or custom | [ | ||
| Mapping, variant calling, phylogeny | Independent and/or custom | [ | ||
| Mapping, read depth analysis, variant calling | Independent and/or custom | |||
| Mapping, read depth analysis, variant calling | Independent and/or custom | |||
| Mapping, read depth analysis, variant calling | Independent and/or custom |
a Differentiation—the aim was to differentiate outbreak from non-outbreak strains; Diversity/resolution—the aim was to assess the diversity and/or resolve the phylogenetic relationships among outbreak strains; Transmission—the aim of the study was to resolve the relationships between strains specifically to enable transmission tracing;
b Independent and/or custom—refers to the use of independent bioinformatics tools for different analysis stages (e.g., separate open source tools for mapping and variant calling) and/or custom analyses/scripts.
Workflow management systems: general features.
| Application | Native App/Web-based | Security/Authentication Available? | Workflow Design | Analysis Progress Monitoring | Ability to Stop/Pause/Resume Workflows? | Rerun Workflows with Different Parameters? |
|---|---|---|---|---|---|---|
| Anduril | Native | Unclear | AndurilScript | Yes | Stop only | Yes |
| Biomolecular Hub | Native | Yes | Drag-and-drop | Yes | Stop only | Unclear |
| Chipster | Both | Yes | List | Yes | No | Yes |
| NG6 | Web-based | Yes | Drag-and-drop | Yes | No | No |
| VIBE | Client/server | Yes | Drag-and-drop | Yes | Stop only | No |
| Wildfire | Native | No | Drag-and-drop | Yes | No | Yes |
| Kepler | Native | No | Drag-and-drop | Yes | Yes | Yes |
| KNIME | Both | Yes | Drag-and-drop | Yes | Yes | Yes |
| Pegasus | Native | No | Dax files | Yes | Stop only | Yes |
| Pipeline Pilot | Native | Yes | Drag-and-drop | Yes | Yes | Yes |
| Platform Process Manager (PPM) | Native | Yes | Drag-and-drop | Yes | Yes | Yes |
| Taverna | Native | Yes | Drag-and-drop | Yes | Yes | Yes |
Workflow management systems: analysis-specific features and references. API—Application Programming Interface.
| Product | Generic | NGS Analysis components? | Integration of third-party analyses? | Languages for Integration of New Tools | Reference and/or Vendor |
|---|---|---|---|---|---|
| Anduril | Specific | Limited | Yes | XML | [ |
| Biomolecular Hub | Specific | No | Unclear | Java | IDBS, Guildford, UK |
| Chipster | Specific | Limited | Yes | R/configuration changes | [ |
| NG6 | Specific | Yes | Yes | Python (through XML)/Smarty template | [ |
| VIBE | Specific | No | Yes | XML/Java (through API) | INCOGEN Inc., Williamsburg, VA, USA |
| Wildfire | Specific | No | Yes | Extended ACD syntax (through API) | [ |
| Kepler | Generic | No | Yes | Java | [ |
| KNIME | Generic | No | Yes | Eclipse Plugins | KNIME.com AG, Zurich, Switzerland |
| Pegasus | Generic | No | Yes | DAX workflow definitions | [ |
| Pipeline Pilot | Generic | Yes | Yes | Java/Perl/VBScript/SOAP | Accelrys, Inc., San Diego, CA, USA |
| Platform Process Manager (PPM) | Generic | No | Yes | Custom scripts | IBM Corp., Armonk, NY, USA |
| Taverna | Generic | Yes | Yes | SOAP/template library | [ |