| Literature DB >> 20204180 |
William A Bryant1, Preben Krabben, Frank Baganz, Yuhong Zhou, John M Ward.
Abstract
A survey of a complete gene synteny comparison has been carried out between twenty fully sequenced strains from the genus Escherichia with the aim of finding yet uncharacterised genes implicated in the metabolism of uropathogenic strains of E. coli (UPEC). Several sets of adjacent colinear genes have been identified which are present in all four UPEC included in this study (CFT073, F11, UTI89, and 536), annotated with putative metabolic functions, but are not found in any other strains considered. An operon closely homologous to that encoding the L-sorbose degradation pathway in Klebsiella pneumoniae has been identified in E. coli CFT073; this operon is present in all of the UPEC considered, but only in 7 of the other 16 strains. The operon's function has been confirmed by cloning the genes into E. coli DH5alpha and testing for growth on L-sorbose. The functional genomic approach combining in silico and in vitro work presented here can be used as a basis for the discovery of other uncharacterised genes contributing to bacterial survival in specific environments.Entities:
Year: 2010 PMID: 20204180 PMCID: PMC2831201 DOI: 10.1155/2009/782924
Source DB: PubMed Journal: Comp Funct Genomics ISSN: 1531-6912
Strains of genus Escherichia used in this study, all with completely sequenced genomes or whole genome shotgun sequences freely available from GenBank. Unless otherwise indicated, they are Escherichia coli.
| Strain | Type | Sorbose operon | Source/Accession number |
|---|---|---|---|
| CFT073 | UPEC (uropathogenic) | + | AE014075 |
| F11 | UPEC | + | AAJU00000000 |
| 536 | UPEC | + | CP000247 |
| UTI89 | UPEC | + | CP000243 |
|
| |||
| 042 | EAEC (enteroaggregative) | − | Sanger Center |
| B7A | ETEC (enterotoxigenic) | − | AAJT00000000 |
| E24377A | ETEC | − | CP000800 |
|
| |||
| B171 | EPEC (enteropathogenic) | − | AAJX00000000 |
| E22 | EPEC | − | AAJV00000000 |
| E2348 | EPEC | − | Sanger Center |
| E110019 | EPEC | − | AAJW00000000 |
|
| |||
| 53638 | EIEC (enteroinvasive) | − | AAKB00000000 |
| MG1655 | Commensal (Gastrointestinal tract) | − | U00096 |
| HS | Commensal (Gastrointestinal tract) | − | CP000802 |
|
| |||
| SMS-3-5 | Environmental | + | CP000970 |
| O157:H7 str. Sakai | EHEC (enterohaemorrhagic) | + | BA000007 |
| O157:H7 EDL933 | EHEC | + | AE005174 |
|
| |||
|
| Bacillary Dysentery | + | Sanger Center |
|
| Bacillary Dysentery | + | AE005674 |
|
| Bacillary Dysentery | + | CP000034 |
Figure 1Structure of the pSC-B plasmid with the putative L-sorbose operon insert from CFT073.
Synteny conservation of sets of adjacent colinear genes in 18 sequenced strains of genus Escherichia; all these sets of genes are present in E. coli CFT073. Those genes which are part of genomic islands, as identified in [18], are marked with an asterisk.
| SAC no. | Gene c numbers | No. of genes | Putative function |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | c0317 c0323 | 7 | Polysaccharide biosynthesis* | − | − | − | − | − | − | − | − | − | − | − | − | − | − | − | − | − | − | − |
| 2 | c0330 c0333 | 4 | Fucose metabolism | + | + | − | + | − | − | − | − | − | − | − | − | − | − | − | − | − | − | − |
| 3 | c0409 c0415 | 7 | 2,5-diketo-D-gluconic acid metabolism | + | + | + | + | + | − | − | − | − | − | − | − | − | − | − | − | + | − | − |
| 4 | c0757 c0765 | 9 | Chorismate biosynthesis | + | + | + | + | − | − | − | − | − | − | − | − | − | − | − | − | − | − | − |
| 5 | c1955 c1960 | 6 | PTS system, cellobiose specific | − | − | + | − | − | + | + | + | + | + | + | − | + | − | − | − | + | − | − |
| 6 | c3405 c3410 | 6 | PTS system, maltose/glucose specific* | + | + | + | + | − | − | − | − | − | − | − | − | − | − | − | − | + | − | − |
| 7 | c3750 c3756 | 7 | 5- or 6-carbon sugar metabolism | + | + | + | + | − | − | − | − | − | − | − | − | − | − | − | − | + | − | − |
| 8 | c4013 c4018 | 6 | Carbohydrate metabolism | + | + | + | − | − | − | − | − | − | − | − | − | − | − | − | − | − | − | − |
| 9 | c4276 c4280 | 5 | PTS system, galactitol specific | − | − | + | − | + | + | − | + | − | − | + | − | − | − | + | + | + | − | − |
| 10 | c4481 c4488 | 8 | PTS system, fructose specific | + | + | + | + | − | − | − | − | − | − | − | − | − | − | − | − | − | − | − |
| 11 | c4756 c4759 | 4 | PTS system, glucose specific | + | + | + | + | − | − | − | − | − | − | − | − | − | − | − | − | − | − | − |
| 12 | c4760 c4780 | 21 | Entner-Doudoroff pathway | + | + | + | − | − | − | − | − | − | − | − | − | − | − | − | − | − | − | − |
| 13 | c4828 c4830 | 3 | Shikimate metabolism | + | + | + | + | − | − | − | − | − | + | − | + | + | − | − | − | − | − | − |
| 14 | c4924 c4926 | 3 | Citrate metabolism | + | + | + | + | − | − | − | − | − | − | − | − | − | − | − | − | − | − | − |
| 15 | c4981 c4987 | 7 | L-sorbose metabolism | + | + | + | + | + | − | − | − | − | − | − | + | − | − | + | + | + | + | − |
| 16 | c5020 c5025 | 6 | 3-ketoacid metabolism | − | − | + | − | − | − | − | − | − | − | − | − | − | − | − | − | − | − | − |
| 17 | c5030 c5041 | 12 | 2-oxoglutarate metabolism | + | + | + | + | − | − | − | − | − | − | − | − | − | − | − | − | − | − | − |
| 18 | c5298 c5303 | 6 | 3-ketoacid metabolism | + | + | + | + | − | + | + | + | + | + | + | − | + | − | − | − | + | − | − |
| 19 | c5346 c5351 | 5 | Arginine metabolism | + | + | + | − | − | − | − | − | − | − | − | − | − | − | − | − | − | − | − |
Figure 2Positions of the SACs identified in the genome sequence of E. coli CFT073, as labelled in Table 2.
Comparison of SAC No. 15 in CFT073 to the Klebsiella pneumoniae L-sorbose degradation operon.
| CFT073 | CFT073 name |
|
| Protein |
|---|---|---|---|---|
| gene ID | Locus | Similarity (%) | ||
| c4981 | Putative oxidoreductase | L-sorbose 1-phosphate reductase |
| 91 |
| c4982 | PTS system, mannose-specific | second subunit of EII-Sor |
| 97 |
| IID component | ||||
| c4983 | PTS system, mannose-specific | first subunit of EII-Sor |
| 95 |
| IIC component | ||||
|
| ||||
| c4984 | Putative sorbose PTS component | EIII-B Sor PTS |
| 92 |
| c4985 | Putative sorbose PTS component | EIII-F Sor PTS |
| 82 |
| c4986 | sorbitol-6-phosphate 2-dehydrogenase | D-glucitol-6-P-Dehydrogenase |
| 92 |
|
| ||||
| c4987 | Putative transcriptional regulator | Sor regulator |
| 92 |
| of sorbose uptake and utilization genes | ||||
Figure 3Phylogenetic tree of L-sorbose operons (both confirmed and putative) in the genus Escherichia and in two Klebsiella strains. The scale is in units of substitutions per site. Unless otherwise stated, the strain is Escherichia coli.
Figure 4Growth curves for DH5α containing plasmid pQR793, compared to CFT073 and DH5α with an empty pUC-19 plasmid. DH5α with pQR793 is represented by ∘ and •, DH5α with pUC19 by Δ and ▴ and CFT073 by □ and ▪ where empty symbols represent growth on glucose and filled symbols represent growth on L-sorbose. Where duplicate samples were taken, readings varied by less than 0.01 OD600 units using a CO8000 Cell Density Meter (WPA).