| Literature DB >> 28348871 |
Jason C Kwong1,2,3, Anders Gonçalves da Silva2,3, Kristin Dyet4, Deborah A Williamson3, Timothy P Stinear2,1, Benjamin P Howden2,1,3, Torsten Seemann2,5.
Abstract
Whole-genome sequencing (WGS) provides the highest resolution analysis for comparison of bacterial isolates in public health microbiology. However, although increasingly being used routinely for some pathogens such as Listeria monocytogenes and Salmonella enterica, the use of WGS is still limited for other organisms, such as Neisseria gonorrhoeae. Multi-antigen sequence typing (NG-MAST) is the most widely performed typing method for epidemiological surveillance of gonorrhoea. Here, we present NGMASTER, a command-line software tool for performing in silico NG-MAST on assembled genome data. NGMASTER rapidly and accurately determined the NG-MAST of 630 assembled genomes, facilitating comparisons between WGS and previously published gonorrhoea epidemiological studies. The source code and user documentation are available at https://github.com/MDU-PHL/ngmaster.Entities:
Keywords: In silico typing; Multi-antigen sequence typing; NG-MAST; Neisseria gonorrhoeae; Whole-genome sequencing
Mesh:
Year: 2016 PMID: 28348871 PMCID: PMC5320595 DOI: 10.1099/mgen.0.000076
Source DB: PubMed Journal: Microb Genom ISSN: 2057-5858
Concordance between NGMASTER results from draft genome assemblies using MEGAHIT and SPAdes, and previously published NG-MAST results
| Two-stage¶ | Total | |||
|---|---|---|---|---|
| PRJEB2999* | 176 (95 %) | 184 (99 %) | 184 (99 %) | 186 |
| PRJNA29335† | – | – | – | 1 |
| PRJNA266539‡ | 162 (91 %) | 169 (94 %) | 178 (99 %) | 179 |
| PRJNA298332§ | 199 (93 %) | 207 (97 %) | 208 (97 %) | 214 |
| PRJEB14168|| | 50 (100 %) | 50 (100 %) | 50 (100 %) | 50 |
| Total | 587 (93 %) | 610 (97 %) | 620 (98 %) | 630 |
*Grad )
†Demczuk )
‡Demczuk )
§Closed reference genome NCCP11945 (Genbank accession CP001050.1) – in silico NG-MAST results reported by Demczuk.
||Local isolates with NG-MAST performed by PCR/Sanger sequencing.
¶Two-stage assembly: 1. NGMASTER run using rapid assembly with MEGAHIT; 2. NGMASTER also run using SPAdes if there was no result or a mixed result using MEGAHIT assembly.
Reasons for discordant results between NGMASTER and published data using SPAdes assemblies
| Reason for discordant result | ||
|---|---|---|
| Major errors (incorrect result) | ||
| Assembly error | 0 | 10 |
| Minor errors (incomplete/missing result) | ||
| Alternate conserved key motif | 1 | 1 |
| Multiple alleles detected | 6 | 2 |
| Allele not detected | 29 | 0 |
| Errors in published data | ||
| Possible sequence mix-up in published data | 4 | 4 |
| Probable transcription error in published data | 1 | 1 |
| Error in published data | 1 | 1 |
Fig. 1.Number and frequency of alternate starting key motifs within ‘conserved’ gene regions for trimming allele sequences.