| Literature DB >> 32118530 |
Kelly L Wyres1, Sarah M Cahill2, Kathryn E Holt3,1, Ruth M Hall4, Johanna J Kenyon2.
Abstract
Multiply antibiotic-resistant Acinetobacter baumannii infections are a global public health concern and accurate tracking of the spread of specific lineages is needed. Variation in the composition and structure of capsular polysaccharide (CPS), a critical determinant of virulence and phage susceptibility, makes it an attractive epidemiological marker. The outer core (OC) of lipooligosaccharide also exhibits variation. To take better advantage of the untapped information available in whole genome sequences, we have created a curated reference database of 92 publicly available gene clusters at the locus encoding proteins responsible for biosynthesis and export of CPS (K locus), and a second database for 12 gene clusters at the locus for outer core biosynthesis (OC locus). Each entry has been assigned a unique KL or OCL number, and is fully annotated using a simple, transparent and standardized nomenclature. These databases are compatible with Kaptive, a tool for in silico typing of bacterial surface polysaccharide loci, and their utility was validated using (a) >630 assembled A. baumannii draft genomes for which the KL and OCL regions had been previously typed manually, and (b) 3386 A. baumannii genome assemblies downloaded from NCBI. Among the previously typed genomes, Kaptive was able to confidently assign KL and OCL types with 100 % accuracy. Among the genomes retrieved from NCBI, Kaptive detected known KL and OCL in 87 and 90 % of genomes, respectively, indicating that the majority of common KL and OCL types are captured within the databases; 13 of the 92 KL in the database were not detected in any publicly available whole genome assembly. The failure to assign a KL or OCL type may indicate incomplete or poor-quality genomes. However, further novel variants may remain to be documented. Combining outputs with multilocus sequence typing (Institut Pasteur scheme) revealed multiple KL and OCL types in collections of a single sequence type (ST) representing each of the two predominant globally distributed clones, ST1 of GC1 and ST2 of GC2, and in collections of other clones comprising >20 isolates each (ST10, ST25, and ST140), indicating extensive within-clone replacement of these loci. The databases are available at https://github.com/katholt/Kaptive and will be updated as further locus types become available.Entities:
Keywords: Acinetobacter baumannii; K locus; Kaptive; OC locus; capsular polysaccharide; outer-core oligosaccharide
Mesh:
Substances:
Year: 2020 PMID: 32118530 PMCID: PMC7200062 DOI: 10.1099/mgen.0.000339
Source DB: PubMed Journal: Microb Genom ISSN: 2057-5858
Fig. 1.General arrangement of the surface polysaccharide synthesis loci in . KL and OCL boundaries are shown and flanking locus genes are coloured grey. Variable sequence portions are indicated by white boxes, and conserved genes at each locus are represented by coloured arrows. (a) Organization of the KL with marked regions defining the roles of common modules. CPS export genes are in orange, and genes in dark blue are involved in the synthesis of common sugar substrates. gne1 is not always present but is often critical to the synthesis of many CPS structures. (b) Organization of the two groups (A and B) of the OC locus with marked regions defining conserved or variable portions. Genes in green encode conserved glycosyltransferases and genes in light blue are those involved in complex sugar synthesis.
Fig. 2.Closely related capsule biosynthesis gene clusters demonstrating cases of small genetic replacements. Genes are represented by arrows orientated in the direction of transcription that are coloured according to the scheme shown below. Shading between gene clusters indicates regions of >95 % nucleotide sequence similarity (dark grey) or 90–95 % nucleotide sequence similarity (light grey). Figure drawn to scale using the GenBank accession numbers listed in Table S1. (a) KL63 and KL108 gene clusters differing in wzy sequence. (b) KL1 and KL107 are examples of gne1 presence vs. absence. (c) KL13, KL73, KL12 and KL110 are examples of several closely related gene clusters with small sequence replacements altering the synthesis pathway of a complex sugar substrate, or topology of the CPS structure.
Gene nomenclature key for K and OC loci
|
Gene name |
Predicted reaction product |
Predicted protein |
|---|---|---|
|
| ||
|
|
CMP- |
Multiple |
|
|
– |
|
|
|
– |
|
|
|
UDP-2,3- |
Multiple |
|
|
UDP-2,3-diacetamido-2,3-dideoxy- |
2-epimerase |
|
|
CMP-8- |
Multiple |
|
|
dTDP- |
Multiple |
|
|
dTDP- |
Multiple |
|
|
UDP- |
UDP-6-deoxy-4-keto- |
|
|
UDP- |
UTP-glucose-1-phosphate uridylyltransferase |
|
|
UDP-4-keto-6-deoxy- |
UDP- |
|
|
UDP- |
UDP- |
|
|
UDP- |
UDP- |
|
|
UDP- |
UDP- |
|
|
|
|
|
|
– |
|
|
|
– |
|
|
|
CMP- |
Multiple |
|
|
GDP- |
Multiple |
|
|
UDP- |
Multiple |
|
|
CMP- |
Multiple |
|
|
– |
|
|
|
|
Phosphoglucomutase |
|
|
– |
|
|
|
CMP- |
Multiple |
|
|
– |
|
|
|
dTDP- |
Multiple |
|
|
UDP- |
Multiple |
|
|
UDP- |
UDP-6-deoxy-4-keto- |
|
|
dTDP- |
Multiple |
|
|
dTDP-6-deoxy- |
dTDP- |
|
|
UDP- |
|
|
|
dTDP-4-acetamido-4,6-dideoxy- |
Multiple |
|
|
– |
Outer membrane protein |
|
|
– |
Protein tyrosine phosphatase |
|
|
– |
Protein tyrosine kinase |
|
|
– |
Repeat unit translocase |
|
|
– |
Repeat unit polymerase |
|
| ||
|
|
– |
Predicted |
|
|
– |
|
|
|
UDP- |
|
|
|
– |
Pyruvyl transferase (outer core) |
|
|
UDP- |
UDP- |
Fig. 3.Breakdown of confidence levels for Kaptive locus calls using the KL and OCL databases. (a) Results following database quality checking using a private collection of 680 genome assemblies (Dataset 2). Colour key is shown in the top right corner. (b) Results of applying the databases to 3412 genome assemblies available in NCBI databases (Dataset 3). Colour key is shown in the top right corner.
Fig. 4.Distribution of K and OC loci by sequence type. Heat maps show the distribution of distinct K (a) and OC (b) loci among genomes assigned to nine common multilocus sequence types (STs). Coloured shading indicates the percentage of isolates belonging to a given ST that were assigned a given KL or OCL type, as indicated by the colour legend. genome assemblies were retrieved from the NCBI database; only confirmed for which both K and OC loci were assigned by Kaptive with confidence level ‘good’ or better are shown (n=2002; 125 ST1, 1669 ST2, 46 ST10, 20 ST16, 43 ST25, 28 ST78, 22 ST406, 29 ST499, 20 ST636).