| Literature DB >> 21303508 |
Kalliopi Georgiades1, Vicky Merhej, Khalid El Karkouri, Didier Raoult, Pierre Pontarotti.
Abstract
BACKGROUND: Genome degradation is an ongoing process in all members of the Rickettsiales order, which makes these bacterial species an excellent model for studying reductive evolution through interspecies variation in genome size and gene content. In this study, we evaluated the degree to which gene loss shaped the content of some Rickettsiales genomes. We shed light on the role played by horizontal gene transfers in the genome evolution of Rickettsiales.Entities:
Mesh:
Year: 2011 PMID: 21303508 PMCID: PMC3055210 DOI: 10.1186/1745-6150-6-6
Source DB: PubMed Journal: Biol Direct ISSN: 1745-6150 Impact factor: 4.540
Figure 1Phylogenomic analysis of . A. Gene content phylogeny constructed from the matrix of discrete characters using the neighbor-joining method. B. Whole core gene concatenated phylogenetic tree.
Genes gained by Rickettsiales.
| riCOGs | donor species | annotation | biological process | gained by | lost by |
|---|---|---|---|---|---|
| riCOG01601 | unknown | unknown | |||
| riCOG01580 | unknown | unknown | |||
| riCOG00373 | unknown | unknown | |||
| riCOG00835 | toxin of toxin antitoxin | toxin of toxin antitoxin | SFG | ||
| riCOG00983 | S-adenosylmethionine synthesis | ||||
| riCOG01685 | Eukaryotes | leucine-rich repeats | unknown | ||
| rioriCOG00831 | Canarypox virus | ankyrin-repeat containing protein | protein-protein interaction motif | ||
| rioriCOG00862 | ankyrin-repeat containing protein | protein-protein interaction motif | |||
| rioriCOG00673 | transposase & inactivated derivative | transposase activity | |||
Six riCOGs probably transferred by different bacteria, animals and plants to rickettsial species and 3 rioriCOGs probably transferred by gamma-proteobacteria species, viruses and protists to the Orientia genus. Transferred genes have various functions including protein-protein interaction and metabolic activity. HGT took place at different levels of evolution and at some cases it was followed by gene loss in some species.
Rickettsiales as gene donors.
| target species | function | gained from | ||
|---|---|---|---|---|
| riCOG00139 | nucleotidyltransferase substrate binding | transferase activity | SFG | |
| riCOG00530 | putative permease | transmembrane transport | proto- | |
| rioriCOG00847 | Na+/proline symporter histidine kinase | transposase activity | ||
Two riCOGs and one rioriCOG probably transferred from Rickettsia and Orientia to Firmicutes, gamma-proteobacteria and Bacteroidetes. These genes code for transferase and transposases activity.
Figure 2. Probable gene gain events shaping the evolution of Rickettsia.
De novo genes
| Gene (sequence length *) | Found in | Size (nt) | Ka/Ks | P-value | Coding region | AT % | |
|---|---|---|---|---|---|---|---|
| 1 | raf_ORF0703 (335 nt) | 336 | 0.257 | 0.0357 | 57.6% | ||
| 477 | 1.628 | 0.046 | Yes | ||||
| 2 | raf_ORF0265 (389 nt) | 390 | 0.84 | 0.0233 | Yes | 49.6% | |
| 3 | raf_ORF0391 | 351 | 1.5352 | 0.3244 | No | 51.1% | |
| 4 | raf_ORF0649 (449 nt) | 309 | 1.355 | 0.1443 | 61.4% | ||
| 363 | 1.794 | 0.7871 | |||||
| 432 | 1.568 | 0.4576 | No | ||||
| 5 | raf_ORF1011 (551 nt) | 555 | 1.239 | 0.9519 | No | 62.7% | |
| 6 | raf_ORF1025 (311 nt) | 309 | 0.879 | 0.7758 | No | 63% | |
| 7 | raf_ORF0586 (347 nt) | 348 | 1.257 | 0.0719 | 58.2% | ||
| 147 | 2.171 | 0.6816 | No | ||||
| 8 | raf_ORF0724 (569 nt) | 354 | 0.154 | 0.1146 | 58.8% | ||
| 585 | 1.45 | 0.6896 | No | ||||
| 9 | raf_ORF0390 (1001 nt) | 432 | 1.479 | 0.0888 | 55.3% | ||
| 579 | 0.639 | 0.1011 | |||||
| 744 | 0.649 | 0.7778 | |||||
| 834 | 0.555 | 0.152 | No | ||||
| 10 | raf_ORF0053 (488 nt) | 474 | 0.615 | 0.1924 | 59% | ||
| 489 | 0.579 | 0.2145 | |||||
| 489 | 0.561 | 0.2427 | |||||
| 489 | 0.674 | 0.2498 | |||||
| 489 | 0.713 | 0.4433 | No | ||||
| 11 | raf_ORF0348 (593 nt) | 594 | 1.427 | 0.0494 | Yes | 58% | |
| 378 | 1.123 | 0.3122 | |||||
| 405 | 0.492 | 0.0917 | |||||
| 603 | 1.517 | 0.9648 | |||||
| 576 | 1.076 | 0.3832 | No | ||||
| 12 | raf_ORF0876 (378 nt) | 486 | 0.921 | 0.2213 | 51% | ||
| 486 | 1.354 | 0.636 | No | ||||
| 13 | raf_ORF0172 (1169 nt) | 1158 | 0.773 | 0.0564 | 52.8% | ||
| 1170 | 0.764 | 0.0989 | |||||
| 1185 | 0.069 | 0.2337 | No | ||||
| 14 | raf_ORF0993 (320 nt) | 321 | 0.942 | 0.059 | 57% | ||
| 435 | 0.1905 | 0.544 | No | ||||
| 15 | raf_ORF0046 (520 nt) | 519 | 0.406 | 0.0174 | Yes | 58% | |
| 408 | 1.53 | 0.0684 | No | ||||
| 483 | 0.977 | 0.3574 | No | ||||
| 16 | raf_ORF0275 (1394 nt) | 1395 | 1.83 | 0.0262 | Yes | 55.6% | |
| 1404 | 1.084 | 0.0614 | No | ||||
| 1395 | 0.767 | 0.0348 | Yes | ||||
| 1395 | 0.9 | 0.1505 | |||||
| 1422 | 0.873 | 0.2144 | |||||
| 1389 | 1.133 | 0.344 | No | ||||
| 17 | raf_ORF0921 (647 nt) | 576 | 0.62 | 0.0365 | Yes | 57.4% | |
| 645 | 0.642 | 0.2394 | No | ||||
When the ω ratio was significantly inferior to 1 (p-value < 0.05), we considered that we have a coding region. According to the ω ratio test, raf_ORF0703 and raf_ORF0265 are functional.
Their function in not known. On the contrary, the twelve following sequences seem to correspond to no coding regions, while raf_ORF0046, raf_ORF0275 and raf_ORF0921 seem to be non-functional. Only a transcriptomic analysis could show for sure if these regions are really coding or not. By chance, all of these genes are found in R. africae and to calculate the ω ratio we took as reference sequence the R. africae sequence.
* The length given in nucleotides is the sequence length in Rickettsia africae and the AT % content is the one corresponding to Rickettsia africae.
Figure 3Gene loss in . The " proto-Rickettsiae" genome probably contained 1,027 genes. R. bellii showed no gene loss. The TG lost the largest number of genes, followed by R. canadensis and R. akari. The gene loss that occurred during the separation of the SFG from the TG mainly involves genes coding for metabolism and information storage and processing. Losses at species level equally affect genes of all functions. The figure represents the species tree as deduced from the concatenation of whole core gene of 11 Rickettsia spp. [Merhej et al., 2009 [24]]. Numbers at arrows show the number of lost genes. Genes were classified into four functional categories: information storage and processing in pink, cellular processes and signaling in green, metabolism in blue and poorly characterized functions in grey. Numbers in the colored squares indicate the percentage of lost genes in the corresponding category. Gain penalty was set at five using PARS algorithm.
Gene content in the rickettsial genomes
| 1002 | 1025 | 850 | 911 | 785 | 826 | 758 | 760 | 752 | 805 | ||
| 0.82 | 1013 | 845 | 986 | 786 | 825 | 748 | 751 | 747 | 798 | ||
| 0.83 | 0.82 | 856 | 906 | 777 | 828 | 754 | 760 | 746 | 800 | ||
| 0.69 | 0.64 | 0.69 | 832 | 772 | 825 | 748 | 751 | 747 | 814 | ||
| 0.74 | 0.75 | 0.73 | 0.63 | 782 | 811 | 740 | 743 | 742 | 783 | ||
| 0.64 | 0.65 | 0.64 | 0.63 | 0.64 | 792 | 743 | 747 | 740 | 770 | ||
| 0.67 | 0.57 | 0.67 | 0.62 | 0.62 | 0.65 | 774 | 775 | 758 | 887 | ||
| 0.87 | 0.86 | 0.87 | 0.86 | 0.85 | 0.86 | 0.89 | 784 | 724 | 750 | ||
| 0.92 | 0.91 | 0.92 | 0.91 | 0.90 | 0.90 | 0.94 | 0.95 | 727 | 752 | ||
| 0.78 | 0.77 | 0.77 | 0.77 | 0.77 | 0.76 | 0.78 | 0.84 | 0.88 | 732 | ||
| 0.66 | 0.57 | 0.64 | 0.61 | 0.60 | 0.63 | 0.64 | 0.87 | 0.91 | 0.76 | ||
The numbers of genes shared between genomes (upper right triangle), the percentage of genes shared between genomes (the total number divided by the number of genes in the smallest genome; lower left triangle) and the numbers of genes per genome (bold).