| Literature DB >> 18404206 |
Mark W J van Passel1, Pradeep Reddy Marri, Howard Ochman.
Abstract
Bacterial species, and even strains within species, can vary greatly in their gene contents and metabolic capabilities. We examine the evolution of this diversity by assessing the distribution and ancestry of each gene in 13 sequenced isolates of Escherichia coli and Shigella. We focus on the emergence and demise of two specific classes of genes, ORFans (genes with no homologs in present databases) and HOPs (genes with distant homologs), since these genes, in contrast to most conserved ancestral sequences, are known to be a major source of the novel features in each strain. We find that the rates of gain and loss of these genes vary greatly among strains as well as through time, and that ORFans and HOPs show very different behavior with respect to their emergence and demise. Although HOPs, which mostly represent gene acquisitions from other bacteria, originate more frequently, ORFans are much more likely to persist. This difference suggests that many adaptive traits are conferred by completely novel genes that do not originate in other bacterial genomes. With respect to the demise of these acquired genes, we find that strains of Shigella lose genes, both by disruption events and by complete removal, at accelerated rates.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18404206 PMCID: PMC2275313 DOI: 10.1371/journal.pcbi.1000059
Source DB: PubMed Journal: PLoS Comput Biol ISSN: 1553-734X Impact factor: 4.475
Figure 1Phylogeny of E. coli and Shigella strains showing the numbers of ORFan genes (red) and HOP genes (blue) gained on each branch.
Yellow-shaded ovals contain the letter abbreviations used to refer to ancestral lineages and clades. The clades S4 and U1 have bootstrap values of 95%, while the rest have a bootstrap value of 100%.
Amounts of DNA gained and lost by strains of E. coli and Shigella since their last common ancestor.
| DNA Gained (kb) | DNA Lost (kb) | ||||
| ORFans | HOPs | ORFans | HOPs | Net Gain | |
|
| 99 | 232 | 44 | 126 | 161 |
|
| 106 | 237 | 52 | 135 | 156 |
|
| 111 | 218 | 59 | 116 | 154 |
|
| 105 | 211 | 57 | 108 | 152 |
|
| 90 | 201 | 47 | 95 | 149 |
|
| 129 | 205 | 68 | 112 | 154 |
|
| 72 | 152 | 27 | 52 | 146 |
|
| 72 | 152 | 27 | 52 | 146 |
|
| 129 | 205 | 15 | 37 | 282 |
|
| 130 | 225 | 13 | 45 | 297 |
|
| 82 | 141 | 16 | 16 | 191 |
|
| 106 | 135 | 25 | 18 | 199 |
|
| 81 | 103 | 22 | 23 | 139 |
Fate of acquired genes in strains of E. coli and Shigella spp.
| ORFans | HOPs | |||||||||||
| Strain | Intact (n) | ψ Genes (n) | Lost (n) | Intact (%) | ψ Genes (%) | Lost (%) | Intact (n) | ψgenes (n) | Lost (n) | Intact (%) | ψ Genes (%) | Lost (%) |
|
| 447 | 60 | 74 | 77 | 10 | 13 | 104 | 26 | 91 | 47 | 12 | 41 |
|
| 440 | 70 | 71 | 76 | 12 | 12 | 101 | 32 | 95 | 44 | 14 | 42 |
|
| 388 | 91 | 93 | 68 | 16 | 16 | 98 | 22 | 103 | 44 | 10 | 46 |
|
| 390 | 55 | 108 | 71 | 10 | 20 | 97 | 17 | 100 | 45 | 8 | 47 |
|
| 407 | 47 | 96 | 74 | 9 | 17 | 121 | 28 | 72 | 55 | 13 | 33 |
|
| 402 | 62 | 121 | 69 | 11 | 21 | 99 | 16 | 97 | 47 | 8 | 46 |
|
| 468 | 39 | 46 | 85 | 7 | 8 | 161 | 9 | 50 | 73 | 4 | 23 |
|
| 469 | 37 | 47 | 85 | 7 | 9 | 161 | 9 | 50 | 73 | 4 | 23 |
|
| 717 | 33 | 16 | 94 | 4 | 2 | 255 | 1 | 45 | 85 | 0 | 15 |
|
| 735 | 27 | 17 | 94 | 3 | 2 | 268 | 3 | 46 | 85 | 1 | 15 |
|
| 628 | 28 | 26 | 92 | 4 | 4 | 254 | 2 | 29 | 89 | 1 | 10 |
|
| 658 | 37 | 40 | 90 | 5 | 5 | 230 | 10 | 22 | 88 | 4 | 8 |
|
| 585 | 50 | 28 | 88 | 8 | 4 | 178 | 8 | 21 | 86 | 4 | 10 |
*: (n) denotes the total number of genes in a particular class, and (%) is the percentage of the total number of ORFans or HOPs in a genome.