| Literature DB >> 35745468 |
Hanka Brangsch1, Muhammad Saqib2, Awais Ur Rehman Sial3, Falk Melzer1, Jörg Linde1, Mandy Carolina Elschner1.
Abstract
Burkholderia (B.) mallei is a host-adapted equine pathogen that causes glanders, a re-emerging zoonotic disease, which is endemic in Pakistan and other developing countries and seriously impacts the global equine movement. Due to globalization, the geographical restriction of diseases vanishes and the lack of awareness of and experience with eradicated diseases in industrialized countries also promotes the re-introduction of infections in these regions. Owing to the high equine population, the Pakistani province Punjab is a potential hotspot where several glanders outbreaks have been seen over last two decades. For determining the genomic diversity of B. mallei in this and other equine-populated prefectures, the genomes of 19 B. mallei strains isolated between 1999 and 2020 in different locations were sequenced and their genotypes were determined. Particularly, for genetically highly homogenous pathogens like B. mallei genotyping techniques require a high discriminatory power for enabling differentiation on the strain level. Thus, core-genome single nucleotide polymorphism (cgSNP) analysis was applied for distinguishing the highly similar strains. Furthermore, a whole-genome sequence-based core genome multi locus sequence typing (cgMLST) scheme, specific to B. mallei, was developed and additionally applied to the data. It was found that B. mallei genotypes in Pakistan persisted over time and space and genotype clusters preferred connection with a time point rather than the place of isolation, probably due to frequent equine movement, which promotes the spread of glanders. The cgMLST approach proved to work in accord with SNP typing and may help to investigate future glanders outbreaks.Entities:
Keywords: Burkholderia mallei; Pakistan; SNP typing; WGS; cgMLST scheme; genotyping; glanders
Year: 2022 PMID: 35745468 PMCID: PMC9227068 DOI: 10.3390/pathogens11060614
Source DB: PubMed Journal: Pathogens ISSN: 2076-0817
Metadata of B. mallei strains isolated in Pakistan from 1999 until 2020.
| Strain | Year | Source | Host | Population | Region | Purpose |
|---|---|---|---|---|---|---|
| Pak2018H3 | 2018 | Blood | Horse | Private farm | Islamabad | Polo |
| Pak2018M4 | 2018 | Pus | Mule | Sample received for confirmation | Azad Jammu and Kashmir | Draught |
| Pak2019H6 | 2019 | Pus | Horse | Private owner having total 28 imported polo ponies | Islamabad | Polo |
| Pak2017H7 | 2017 | Blood | Horse | Private | Islamabad | Polo |
| Pak2020M8 | 2020 | Blood | Mule | For hauling | Faisalabad | Draught |
| Pak2019H9 | 2019 | Blood | Horse | Owner has 40 polo ponies | Lahore | Polo |
| Pak2018H10 | 2018 | Blood | Horse | Cart horse | Faisalabad | Draught |
| Pak2020M11 | 2020 | Blood | Mule | For hauling | Faisalabad | Draught |
| PRL1 | 2002 | Pus | Donkey | For hauling | Faisalabad | Draught |
| PRL2 | 1999 | Nasal swab | Horse | Police service | Faisalabad | Mounted |
| PRL3 | 2005 | Pus | Horse | Private | Sargodha | Farm |
| PRL4 | 2005 | Pus | Horse | Private | Sargodha | Farm |
| PRL7 | 2000 | Pus | Horse | For hauling | Faisalabad | Draught |
| PRL11 | 1999 | Pus | Horse | Police service | Faisalabad | Mounted |
| PRL34 | 2007 | Nasal swab | Donkey | Work in brick factory | Faisalabad | Draught |
| PRL41 | 2006 | Pus | Mule | For hauling | Faisalabad | Draught |
| PRL42 | 2007 | Pus | Mule | For hauling | Faisalabad | Draught |
| PRL43 | NA | NA | NA | NA | NA | NA |
| PRL44 | 2007 | Nasal swab | Mule | Private | Sargodha | Farm |
Assembly quality data of the investigated Pakistani strains, which were sequenced using the Illumina short-read technique.
| Strain | Coverage | Bases | Contigs | GC (%) | L50 | N50 | GF * (%) | CDS |
|---|---|---|---|---|---|---|---|---|
| Pak2018H3 | 100 | 5,526,644 | 295 | 68.68 | 44 | 43,202 | 92.86 | 4614 |
| Pak2018M4 | 118 | 5,526,233 | 269 | 68.68 | 41 | 46,366 | 92.95 | 4615 |
| Pak2019H6 | 79 | 5,526,261 | 262 | 68.69 | 40 | 46,511 | 92.98 | 4631 |
| Pak2017H7 | 86 | 5,528,440 | 272 | 68.69 | 42 | 46,377 | 92.97 | 4623 |
| Pak2020M8 | 98 | 5,593,509 | 284 | 68.22 | 41 | 46,937 | 92.64 | 4639 |
| Pak2019H9 | 68 | 5,305,987 | 266 | 68.60 | 40 | 44,187 | 89.23 | 4442 |
| Pak2018H10 | 75 | 5,536,192 | 307 | 68.65 | 45 | 43,210 | 92.91 | 4630 |
| Pak2020M11 | 127 | 5,530,694 | 379 | 68.59 | 61 | 30,209 | 92.59 | 4667 |
| PRL1 | 121 | 5,523,415 | 294 | 68.66 | 43 | 43,660 | 92.95 | 4610 |
| PRL2 | 112 | 5,512,370 | 302 | 68.67 | 43 | 42,887 | 92.54 | 4595 |
| PRL3 | 116 | 5,599,466 | 279 | 68.69 | 40 | 46,978 | 92.50 | 4680 |
| PRL4 | 120 | 5,517,077 | 287 | 68.68 | 41 | 43,747 | 92.54 | 4605 |
| PRL7 | 114 | 5,282,618 | 281 | 68.58 | 41 | 43,782 | 88.62 | 4432 |
| PRL11 | 124 | 5,509,016 | 290 | 68.68 | 41 | 45,145 | 92.56 | 4600 |
| PRL34 | 90 | 5,559,549 | 287 | 68.73 | 42 | 45,246 | 92.51 | 4623 |
| PRL41 | 82 | 5,589,007 | 272 | 68.71 | 39 | 46,976 | 92.87 | 4666 |
| PRL42 | 78 | 5,575,591 | 270 | 68.70 | 38 | 48,213 | 93.81 | 4652 |
| PRL43 | 82 | 5,579,744 | 271 | 68.69 | 41 | 46,808 | 93.82 | 4658 |
| PRL44 | 171 | 5,527,185 | 294 | 68.69 | 42 | 43,055 | 92.95 | 4613 |
* Genome fraction covering reference genome ATCC 23344.
Assembly quality data of hybrid assemblies using Illumina short-read in conjunction with ONT long-read techniques.
| Strain | Bases | Contigs | L50 | N50 | GF * (%) | CDS |
|---|---|---|---|---|---|---|
| 34 | 5,647,473 | 1 | 1 | 5,647,473 | 94.62 | 4812 |
| Mukteswar | 5,760,320 | 11 | 1 | 3,539,038 | 96.27 | 4909 |
| BfR 242 | 5,375,480 | 18 | 1 | 3,503,053 | 90.00 | 4632 |
| NCTC 120 | 5,401,604 | 19 | 1 | 4,027,971 | 89.47 | 4668 |
* Genome fraction covering reference genome ATCC 23344.
Figure 1Maximum-likelihood tree, generated based on cgSNPs called by Snippy. Strains from Pakistan are printed in bold with the district of isolation given after the name (LHR—Lahore; FSD—Faisalabad; AZK—Azad Jammu and Kashmir; ISB—Islamabad; SGI—Sargodha) and the year of isolation indicated by color. For non-Pakistani strains, the country of isolation is given. The bar indicates base substitutions per site. Clusters formed by Pakistani strains are denoted by Roman numerals. Numbers in red represent bootstrap support values.
Figure 2Comparison between trees generated by different approaches: (A) Neighbor-Joining tree based on cgMLST allelic profiles using 2838 target genes; (B) Maximum-likelihood tree based on cgSNP alignment generated by Snippy using Illumina read data; (C) Approximately Maximum-likelihood tree based on cgSNP alignment generated with Parsnp using genome assemblies. For convenience, clusters or singletons of strains that showed in the trees were colored identically. Bars indicate allelic changes (A) or base substitutions per site (B,C). Bootstrap values for (B,C) can be found in Figure S3.
Figure 3Minimum-spanning tree based on cgMLST allelic profile differences. The Pakistani strains are coloured according to the place of isolation with the year of isolation given. Circles with dotted lines represent strains isolated in India. Numbers on the branches indicate the number of allelic differences. The empty, solid line circle represents strain PRL43, for which no metadata was available.