| Literature DB >> 12914657 |
Abstract
BACKGROUND: Recent developments in the understanding of paralogous evolution have prompted a focus not only on obviously advantageous genes, but also on genes that can be considered to have a weak or sporadic impact on the survival of the organism. Here we examine the duplicative behavior of a category of genes that can be considered to be mostly transient in the genome, namely laterally transferred genes. Using both a compositional method and a gene-tree approach, we identify a number of proposed laterally transferred genes and study their nucleotide composition and frequency of duplication.Entities:
Mesh:
Year: 2003 PMID: 12914657 PMCID: PMC193641 DOI: 10.1186/gb-2003-4-8-r48
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Overview of gene categories
| Category | G | Cr | O1 | O2 | Suggested content of imports |
| A | + | - | - | - | Recent imports or loss of imports from B |
| B | + | + | - | - | Less recent imports |
| C | + | - | + | + | Loss of genes in Cr, or pervasive imports |
| D | + | + | + | + | Only highly pervasive imports expected |
G, genome under consideration; Cr, paired close relative; O1, first outgroup organism; O2, second outgroup organism. +, Present; -, absent.
Category A
| Pair | Outgroups | ||||
| ecol, edl | stym, klebs | 95 | 17 | 0.179 | 7 |
| ecol, stym | kpne, paer | 248 | 43 | 0.173 | 25 |
| edl, ecol | stym, klebs | 180 | 43 | 0.239 | 51 |
| sfle, ecol | stym, klebs | 139 | 30 | 0.216 | 37 |
| stym, ecol | kpne, paer | 270 | 47 | 0.174 | 8 |
| stym, styp | ecol, klebs | 89 | 20 | 0.225 | 5 |
| styp, stym | ecol, klebs | 138 | 31 | 0.225 | 5 |
| bsub, bhal | cace, cper | 575 | 51 | 0.089 | 16 |
| bhal, bsub | cace, cper | 603 | 40 | 0.066 | 16 |
| cace, cper | bhal, bsub | 779 | 49 | 0.063 | 20 |
| samu, samw | bsub, cace | 70 | 9 | 0.129 | 6 |
N, total number of genes; Ndev, number of genes with cT2 scores > 38; Pdev, proportion of deviant genes; Ndup, number of duplicated genes. See Materials and methods for species abbreviations. All Pdev values are significant at 99% when compared to group D (Table 4).
Category B
| Pair | Outgroups | ||||
| ecol, edl | stym, klebs | 190 | 27 | 0.142 | 18 |
| ecol, stym | kpne, paer | 163 | 22 | 0.135 | 11 |
| edl, ecol | stym, klebs | 189 | 15 | 0.079 | 32 |
| sfle, ecol | stym, klebs | 169 | 27 | 0.160 | 34 |
| stym, ecol | kpne, paer | 171 | 18 | 0.105 | 3 |
| stym, styp | ecol, klebs | 249 | 34 | 0.137 | 10 |
| styp, stym | ecol, klebs | 233 | 31 | 0.133 | 5 |
| bsub, bhal | cace, cper | 582 | 26 | 0.085 | 18 |
| bhal, bsub | cace, cper | 602 | 28 | 0.047* | 6 |
| cace, cper | bhal, bsub | 293 | 19 | 0.065 | 8 |
| samu, samw | bsub, cace | 489 | 31 | 0.063 | 12 |
*All Pdev values are significant at 99% when compared to group D (Table 3), except for bhal which is not significant. See Materials and methods for species abbreviations.
Category D
| Pair | Outgroups | |||||
| ecol, edl | stym, klebs | 2,823 | 91 | 0.032 | 46 | 3 |
| ecol, stym | kpne, paer | 2,163 | 49 | 0.023 | 34 | 2 |
| edl, ecol | stym, klebs | 2,865 | 87 | 0.030 | 71 | 8 |
| sfle, ecol | stym, klebs | 2,684 | 113 | 0.042 | 214 | 20 |
| stym, ecol | kpne, paer | 2,129 | 74 | 0.035 | 35 | 14 |
| stym, styp | ecol, klebs | 2,752 | 95 | 0.035 | 35 | 13 |
| styp, stym | ecol, klebs | 2,662 | 96 | 0.036 | 29 | 15 |
| bsub, bhal | cace, cper | 1,329 | 32 | 0.024 | 10 | 0 |
| bhal, bsub | cace, cper | 1,354 | 51 | 0.038 | 15 | 2 |
| cace, cper | bhal, bsub | 1,290 | 47 | 0.036 | 11 | 0 |
| samu, samw | bsub, cace | 1,055 | 42 | 0.040 | 9 | 0 |
* The number of duplicated genes with cT2 > 38. See Materials and methods for species abbreviations.
Figure 1Distribution of GC3 content of genes in categories A (dashed line) and D (solid line). (a) E. coli K12; (b) E. coli O157:H7; (c) S. flexneri; (d) S. typhimurium; (e) S. typhi; (f) B. subtilis; (g) B. halodurans; (h) C. acetobutylicum; (i) S. aureus.
GC3 % distribution data
| Pair | Outgroups | m(A) | SD(A) | m(D) | SD(D) |
| ecol, edl | stym, klebs | 0.451 | 0.119 | 0.523 | 0.035 |
| edl, ecol | stym, klebs | 0.425 | 0.130 | 0.544 | 0.068 |
| sfle, ecol | stym, klebs | 0.455 | 0.108 | 0.546 | 0.062 |
| stym, styp, | ecol, klebs | 0.441 | 0.138 | 0.589 | 0.069 |
| styp, stym | ecol, klebs | 0.441 | 0.123 | 0.589 | 0.069 |
| bsub, bhal | cace, cper | 0.387 | 0.091 | 0.437 | 0.061 |
| bhal, bsub | cace, cper | 0.383 | 0.061 | 0.408 | 0.049 |
| cace, cper | bhal, bsub | 0.186 | 0.037 | 0.182 | 0.032 |
| samu, samw | bsub, cace | 0.231 | 0.049 | 0.195 | 0.035 |
m, Mean; SD, standard deviation. A, category A; D, category D. See Materials and methods for species abbreviations.
Figure 2Distribution of GC3 content of genes of high (dashed line) and low (solid line) cT2 in category A. (a) B. subtilis; (b) B. halodurans; (c) C. acetobutylicum.
Frequency of duplications by category
| Pair | Outgroups | ||
| ecol, edl | stym, klebs | 0.074** | 0.016 |
| ecol, stym | kpne, paer | 0.101** | 0.018 |
| edl, ecol | stym, klebs | 0.283** | 0.025 |
| sfle, ecol | stym, klebs | 0.266** | 0.080 |
| stym, ecol | kpne, paer | 0.030* | 0.015 |
| stym, styp | ecol, klebs | 0.056** | 0.014 |
| styp, stym | ecol, klebs | 0.036* | 0.011 |
| bsub, bhal | cace, cper | 0.029** | 0.008 |
| bhal, bsub | cace, cper | 0.027** | 0.011 |
| cace, cper | bhal, bsub | 0.026** | 0.009 |
| samu, samw | bsub, cace | 0.086** | 0.009 |
*Significant at 95%; **significant at 99%.