| Literature DB >> 35035886 |
Granger Sutton1, Gary B Fogel2, Bradley Abramson3, Lauren Brinkac4, Todd Michael3, Enoch S Liu2, Sterling Thomas4.
Abstract
Background: Wall teichoic acid (WTA) genes are essential for production of cell walls in gram-positive bacteria and necessary for survival and variability in the cassette has led to recent antibiotic resistance acquisition in pathogenic bacteria.Entities:
Keywords: Bacillus subtilis; core genes; pan-genome; pan-genome graph; wall teichoic acids
Mesh:
Substances:
Year: 2021 PMID: 35035886 PMCID: PMC8753576 DOI: 10.12688/f1000research.51874.1
Source DB: PubMed Journal: F1000Res ISSN: 2046-1402
The protein level orthologous OGCs within the WTA cassettes.
Column 1 is the gene name/symbol. Column 2 is the set of OGCs determined to be orthologs at the protein level. Column 3 is the number of the 108 strains in the PGG which contain one of the protein level orthologs. Column 4 is OGC medoid sequence RefSeq annotation for one of the protein level orthologs.
| Gene | OGCs | Summed OGC Size | Annotation |
|---|---|---|---|
|
| 3713, 4723 | 108 | medoid_4723 Q433_RS17940 polyisoprenyl-teichoic acid-peptidoglycan teichoic acid transferase TagV |
|
| 3723, 4724 | 108 | medoid_4724 OB04_RS18145 glycosyltransferase family 2 protein |
|
| 3724, 4725 | 108 | medoid_4725 C7M23_RS06445 Teichuronic acid biosynthesis protein tuaF |
|
| 3725, 4726 | 108 | medoid_4726 Bateq7PJ16_RS19495 teichuronic acid biosynthesis protein TuaE |
|
| 3726, 4727 | 108 | medoid_4727 BKN48_RS07140 UDP-glucose 6-dehydrogenase TuaD |
|
| 3727, 4728 | 108 | medoid_4728 C7M27_RS00270 glycosyltransferase family 4 protein |
|
| 3728, 4729 | 108 | medoid_4729 BEST7003_RS17430 MOP flippase family protein |
|
| 3729, 4730 | 108 | medoid_4730 C7M26_RS17205 sugar transferase |
|
| 3731, 4731, 5419 | 101 | medoid_4731 BEST7003_RS17440 N-acetylmuramoyl-L-alanine amidase LytC |
|
| 3732, 4732, 5420 | 98 | medoid_4732 BEST7003_RS17445 SpoIID/LytB domain-containing protein |
|
| 3733, 4733, 5421 | 101 | medoid_4733 BEST7003_RS17450 membrane-bound protein LytA |
|
| 4734, 8738 | 85 | medoid_4734 EQZ01_RS18735 transcription antiterminator LytR |
|
| 3737, 4735, 8737 | 108 | medoid_4735 BSK2_RS18135 UDP-N-acetylglucosamine 2-epimerase (non-hydrolyzing) |
|
| 3738, 4736 | 108 | medoid_4736 CD007_RS18080 UTP--glucose-1-phosphate uridylyltransferase GalU |
|
| 4738, 5996, 7395, 8734 | 69 | medoid_4738 BEST7003_RS17470 poly (glucosyl N-acetylgalactosamine 1-phosphate) glucosyltransferase |
|
| 4739, 5429, 5997, 7091, 7396, 8733 | 76 | medoid_4739 BEST7003_RS17475 glycosyltransferase family 2 protein |
|
| 3740, 4744, 8735 | 108 | medoid_4744 C7M17_RS18530 teichoic acids export ABC transporter ATP-binding subunit TagH |
|
| 3741, 4745, 5425, 7157 | 108 | medoid_4745 BEST7003_RS17490 teichoic acids export ABC transporter permease subunit TagG |
|
| 3745, 4746, 5430, 5998, 7397, 8732 | 87 | medoid_4746 BEST7003_RS17495 teichoic acid poly (glycerol phosphate) polymerase |
|
| 3746, 4748, 5431, 6915, 7624, 8731 | 107 | medoid_4748 BEST7003_RS17505 glycerol-3-phosphate cytidylyltransferase |
|
| 3747, 4749, 5432, 6918, 8730 | 107 | medoid_4749 BEST7003_RS17510 N-acetylglucosaminyldiphosphoundecaprenol N-acetyl-beta-D- mannosaminyltransferase |
|
| 3748, 4750, 5433, 6919, 8729 | 107 | medoid_4750 BEST7003_RS17515 teichoic acid glycerol-phosphate primase |
|
| 3751, 4752, 5718 | 108 | medoid_4752 CAH07_RS02865 spore germination protein |
|
| 3752, 4753 | 108 | medoid_4753 BEST7003_RS17540 spore germination protein GerBA |
|
| 3755, 4754, 5438 | 108 | medoid_4754 BEST7003_RS17555 polyisoprenyl-teichoic acid--peptidoglycan teichoic acid transferase TagT |
|
| 5418, 6914 | 45 | medoid_5418 ETL58_RS18550 (poly)ribitol-phosphate teichoic acid beta-D-glucosyltransferase |
| 5423, 7623 | 51 | medoid_5423 CAH07_RS02960 glycosyltransferase |
Figure 1. WTA 144 OGC gene content tree based on complete linkage hierarchical clustering of pairwise Jaccard distance of the 144 OGCs in the WTA cassettes.
We distinguished seven clades. Clade I (blue) is the ribitol WTA consistent with that found in strain W23. Clade II (purple) and clade III (orange) appear to also be ribitol WTA based on the presence of tarI (OGC 5434), tarJ (OGC 5435), tarK (OGC 5436) and tarL (OGC 5437). Clade IV (yellow) may be a ribitol WTA based on a shorter tagF gene. Clade V (red) may be a glycerol WTA based on a longer tagF gene. Clade VI is the type strain glycerol WTA. Clade VII (teal) is missing many key WTA genes ( tag/tarBADFG) so it is unclear how the one strain in that clade is constructing WTAs. The 10 medoid strains used in Figure 3 are in bold.
Figure 2. The ANI tree for the 108 strains of B. subtilis ssp.
in the pan-genome based on complete linkage hierarchical clustering of pairwise genome ANI distances (100 – ANI). The colors indicate which of the seven WTA gene content clades a strain is in from Figure 1. The 10 medoid strains used in Figure 3 are in bold.
Figure 3. Linear comparison of the WTA cassette of 10 medoid strains representing each of the seven WTA clades (I-VII).
Arrows indicate individual WTA genes drawn to scale with order and orientation maintained. The coordinates for the WTA cassettes in SAMN08707592 and SAMN08707595 which are located on the opposite strand, were reversed for rendering. Genes between strains belonging to the same OGC are joined vertically by correponding colored lines.
OGC subpatterns for the WTA cassettes across clades I-VII.
The OGC subpatterns show some limited recombination within the WTA cassettes but most recombination seems limited to the entire cassette. Column 1 is the region between core OGCs within the WTA cassette. Column 2 is an OGC subpattern. Columns 3-9 indicate the number of strains within a clade that has the given OGC subpattern for that row. The rows are ordered relative to their order in the WTA cassette from core OGC 3712 to core OGC 3756.
| Region | OGC SubPattern | Clade I (n = 43) | Clade II (n = 2) | Clade III (n = 3) | Clade IV (n = 1) | Clade V (n = 23) | Clade VI (n = 35) | Clade VII (n = 1) |
|---|---|---|---|---|---|---|---|---|
| 3712-3721 | (3713-3718,9158,3719-3720) | 22 | ||||||
| 3712-3721 | 4723 | 43 | 2 | 3 | 1 | 1 | 35 | 1 |
| 3722-3749 | (3723-3725,9159,3726-3729) | 23 | ||||||
| 3722-3749 | (3730-3735,9341,3736) | 5 | ||||||
| 3722-3749 | (3730-3735) | 1 | ||||||
| 3722-3749 | (3731-3734) | 10 | ||||||
| 3722-3749 | (3731-3734,9341) | 1 | ||||||
| 3722-3749 | (3736,9341) | 5 | ||||||
| 3722-3749 | (3730-3733,9495) | 1 | ||||||
| 3722-3749 | (3737-3748) | 23 | ||||||
| 3722-3749 | (7961) | 1 | ||||||
| 3722-3749 | (4724-4730) | 43 | 2 | 3 | 1 | 35 | 1 | |
| 3722-3749 | (4731-4736) | 35 | ||||||
| 3722-3749 | (10139,4737-4745) | 24 | ||||||
| 3722-3749 | (4744-4745) | 3 | ||||||
| 3722-3749 | (10139,4737,5994-5996,7091-7093,4743-4745) | 1 | ||||||
| 3722-3749 | (10139,5995-5996,7091-7093,4743-4745) | 7 | ||||||
| 3722-3749 | (5418) | 43 | ||||||
| 3722-3749 | (6914) | 2 | ||||||
| 3722-3749 | (5419-5421,4734-4735,5422-5424,4736,4744,5425) | 43 | 2 | |||||
| 3722-3749 | (4731,7622,4733-4735,5423,7623,4736,4744,5425) | 3 | ||||||
| 3722-3749 | (8738-8737,4736,8736,7961,8735,5425) | 1 | ||||||
| 3722-3749 | (4734-4736,7156,4744,7157) | 1 | ||||||
| 3722-3749 | (4738-4741) | 15 | ||||||
| 3722-3749 | (5426-5430) | 7 | ||||||
| 3722-3749 | (5994-5998) | 20 | ||||||
| 3722-3749 | (7395-7397) | 1 | ||||||
| 3722-3749 | (5431) | 43 | ||||||
| 3722-3749 | (7624) | 3 | ||||||
| 3722-3749 | (5432-5433) | 43 | 2 | |||||
| 3722-3749 | (6915-6919) | 2 | ||||||
| 3722-3749 | (5434-5437) | 43 | 2 | 3 | ||||
| 3722-3749 | (8734-8729) | 1 | ||||||
| 3722-3749 | (7625-7631) | 3 | 1 | |||||
| 3722-3749 | (4746-4751) | 35 | ||||||
| 3750-3753 | (3751-3752) | 6 | 2 | 3 | 1 | 13 | 2 | |
| 3750-3753 | (4752-4753) | 37 | 10 | 32 | 1 | |||
| 3750-3753 | (5718,3752) | 1 | ||||||
| 3754-3756 | (3755) | 1 | 1 | 23 | ||||
| 3754-3756 | (4754) | 37 | 1 | 2 | 35 | 1 | ||
| 3754-3756 | (5438) | 6 | 1 |
Figure 4. RAxML maximum likelihood tree for trimmed alignment of all 108 WTA cassettes.
Midpoint rooted and branchlengths ignored. Strains are color coded by WTA clade membership (I-VII). Numbers at nodes represent bootstrap support. The 10 medoid strains used in Figure 3 are in bold.