| Literature DB >> 17054788 |
Chunguang Liang1, Thomas Dandekar.
Abstract
BACKGROUND: Systematic genome comparisons are an important tool to reveal gene functions, pathogenic features, metabolic pathways and genome evolution in the era of post-genomics. Furthermore, such comparisons provide important clues for vaccines and drug development. Existing genome comparison software often lacks accurate information on orthologs, the function of similar genes identified and genome-wide reports and lists on specific functions. All these features and further analyses are provided here in the context of a modular software tool "inGeno" written in Java with Biojava subroutines.Entities:
Mesh:
Year: 2006 PMID: 17054788 PMCID: PMC1635569 DOI: 10.1186/1471-2105-7-461
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1. Genomes of Listeria monocytogenes (lower genome in the figure) and Listeria innocua (upper genome) are compared using inGeno. Orthologous genes in both genomes are colored with the same color. Linkage lines connect locus collinear blocks and indicate the degree of rearrangement between the genomes. The threshold can be adjusted by a slider in the lower-right corner of the control panel. Red blocks in each genome distinguish genes which are potential strain-specific and determined by a user-given threshold. In this comparison several strain-specific genome islands are detected, e.g., in the figure a red island beginning with lmo0200 is being investigated. It is part of the Lipi1 pathogenicity island. Clusters of green lines indicate genome rearrangement events, these can be caused e.g. by transposons. A large number of transposase genes are found and visualized in the L. monocytogenes genome.
Figure 2. The comparison between two closely related E. coli K-12 strains (W3110 and MG1655) indicates these are only slightly different, except for a highlighted large inversion. The upper genome is W3110 [24], whereas the lower genome is MG1655 [23-25]. Using inGeno, a couple of strain-specific genes are readily seen, such as TnaB (annotated as low affinity tryptophan permease; b3709 in figure) and other genes (DcuC: b0621, GatA: b2094, RcsC: b2218), which lead to different metabolic capabilities, e.g., the utilization of tryptophan as carbon source may be impaired in W3110.
Figure 3. The top genome is E. coli O157 [25], the lower is E. coli K-12 strain MG1655 [23-25]. The selected region (ECs1272-1296 and ECs1299-1409) is one of the strain-specific islands that are potentially related to bacterial pathogenicity. ECs1282 and ECs1283 are identified by inGeno as hemagglutinin/hemolysin-like protein and hemolysin activator-related protein, respectively. An operon-like structure follows these two genes. InGeno reports these encode for a holo acyl-carrier protein, an oxoacyl-(acyl-carrier protein) reductase, a hydroxydecanoyl-(acyl-carrier protein) dehydratase, an acyl-carrier protein, an aminomethyl transferase and an oxoacyl-(acyl-carrier protein)-synthase. These enzymes and proteins add to the fatty acid metabolism, additional lipids or lipoproteins may be produced by O157 in contrast to MG1655. Moreover, a series of continuous genes encoding urease components are shown for 0157 (ECs1321-1327: UreA-G). The detailed information on these proteins is summarized in Table 1.
Genes involved in the strain-specific island of E. coli O157:H7
| ECs1272 | Rtn-like protein |
| ECs1273 | FidL-like protein |
| ECs1274 | putative transcriptional regulator |
| ECs1275 | putative oxidoreductase |
| ECs1276 | putative chaperone protein |
| ECs1277 | putative outer membrane protein |
| ECs1278 | putative outer membrane usher protein |
| ECs1279 | putative chaperone protein |
| ECs1280 | putative major pilin protein |
| ECs1282 | hemagglutinin/hemolysin-related prote |
| ECs1283 | hemolysin activator-related protein |
| ECs1284 | putative holo- [acyl-carrier protein] synthase |
| ECs1285 | putative 3-oxoacyl-(acyl carrier protein) reductase |
| ECs1286 | putative (3R)-hydroxymyristol-(acyl carrier prot.) dehydratase |
| ECs1287 | putative acyl-carrier-protein |
| ECs1288 | putative aminomethyltransferase |
| ECs1289 | putative 3-oxoacyl- [acyl-carrier-protein] synthase synthase |
| ECs1321 | urease-associated protein Ure |
| ECs1322 | urease gamma subunit |
| ECs1323 | urease beta subunit |
| ECs1324 | urease alpha subunit |
| ECs1325 | urease accessory protein UreE |
| ECs1326 | urease accessory protein UreF |
| ECs1327 | urease accessory protein UreG |
| ECs1351 | putative tellurium resistance protein TerZ |
| ECs1352 | putative tellurium resistance protein TerA |
| ECs1353 | putative tellurium resistance protein TerB |
| ECs1354 | putative tellurium resistance protein TerC |
| ECs1355 | putative tellurium resistance protein TerD |
| ECs1356 | putative tellurium resistance TerE |
| ECs1358 | putative tellurium resistance protein TerF |
Genes involved in the strain-specific island (selected region in Figure 3). This table is an excerpt from an inGeno report. All genes belong to a selected strain-specific island of the E. coli O157:H7 genome (location: 1337361–1456555). The annotations marked "unknown", "hypothetic protein" are also included in the inGeno report but not shown for this Table to save space. Similarly, there are a number of "transposases" detected by inGeno, they are excluded from the Table for the same reason.