| Literature DB >> 16299586 |
William W L Hsiao1, Korine Ung, Dana Aeschliman, Jenny Bryan, B Brett Finlay, Fiona S L Brinkman.
Abstract
Microbial genes that are "novel" (no detectable homologs in other species) have become of increasing interest as environmental sampling suggests that there are many more such novel genes in yet-to-be-cultured microorganisms. By analyzing known microbial genomic islands and prophages, we developed criteria for systematic identification of putative genomic islands (clusters of genes of probable horizontal origin in a prokaryotic genome) in 63 prokaryotic genomes, and then characterized the distribution of novel genes and other features. All but a few of the genomes examined contained significantly higher proportions of novel genes in their predicted genomic islands compared with the rest of their genome (Paired t test = 4.43E-14 to 1.27E-18, depending on method). Moreover, the reverse observation (i.e., higher proportions of novel genes outside of islands) never reached statistical significance in any organism examined. We show that this higher proportion of novel genes in predicted genomic islands is not due to less accurate gene prediction in genomic island regions, but likely reflects a genuine increase in novel genes in these regions for both bacteria and archaea. This represents the first comprehensive analysis of novel genes in prokaryotic genomic islands and provides clues regarding the origin of novel genes. Our collective results imply that there are different gene pools associated with recently horizontally transmitted genomic regions versus regions that are primarily vertically inherited. Moreover, there are more novel genes within the gene pool associated with genomic islands. Since genomic islands are frequently associated with a particular microbial adaptation, such as antibiotic resistance, pathogen virulence, or metal resistance, this suggests that microbes may have access to a larger "arsenal" of novel genes for adaptation than previously thought.Entities:
Mesh:
Year: 2005 PMID: 16299586 PMCID: PMC1285063 DOI: 10.1371/journal.pgen.0010062
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
List of GI-Associated Features and the Number and Percentage of Islands Meeting Each Criterion
Summary of Organisms without Genomic Islands Based on the DIMOB Criterion
Figure 1Proportion of Novel Genes in Genomic Islands (Red Bars) versus the Rest of the Genome (Blue Bars) according to a COG-Based Analysis
Proportions of novel genes are calculated as a percentage of all genes within islands or outside of islands, respectively, for each genome (listed on the x axis). A paired t test indicates that significantly more genes in islands versus non-islands do not have a COG classification (p = 1.20E-18). This phenomenon is uniform across prokaryotic lineages and domains. Similar results are also observed if different datasets are analyzed, or different methods for identifying novel genes are used (Table 3).
Summary of p-Values Using Different Datasets and Methods
Figure 2Proportion of Novel Genes in Genomic Islands (Red Bars) versus the Rest of the Genome (Blue Bars) according to a SUPERFAMILY-Based Analysis
Proportions of novel genes are calculated as a percentage of all genes within islands or outside of islands, respectively, for each genome (listed on the x axis). A paired t test indicates that significantly higher proportions of genes in islands (red bars) versus outside islands (non-islands; purple bars) do not have a SUPERFAMILY prediction (potential novel genes; p = 4.43E-14).
Proportions of Novel Genes with BLAST Hits in the Phage and Plasmid Database at Expect Value Cutoff of 1E-5
Number of Organisms Distributed by Proportion of Novel Genes and Statistical Test Significance
Distribution of Predicted Subcellular Localization of Proteins in Islands Compared to Outside of Islands