| Literature DB >> 21350630 |
Kenro Oshima1, Kenji Ueda, Teruhiko Beppu, Hiromi Nishida.
Abstract
Comparisons of gene content and orthologous protein sequence constitute a major strategy in whole-genome comparison studies. It is expected that horizontal gene transfer between phylogenetically distant organisms and lineage-specific gene loss have greater influence on gene content-based phylogenetic analysis than orthologous protein sequence-based phylogenetic analysis. To determine the evolution of the syntrophic bacterium Symbiobacterium thermophilum, we analyzed phylogenetic relationships among Clostridia on the basis of gene content and orthologous protein sequence comparisons. These comparisons revealed that these 2 phylogenetic relationships are topologically different. Our results suggest that each Clostridia has a species-specific gene content because frequent genetic exchanges or gene losses have occurred during evolution. Specifically, the phylogenetic positions of syntrophic Clostridia were different between these 2 phylogenetic analyses, suggesting that large diversity in the living environments may cause the observed species-specific gene content. S. thermophilum occupied the most distant position from the other syntrophic Clostridia in the gene content-based phylogenetic tree. We identified 32 genes (14 under relaxed selection and 18 under functional constraint) evolving under Symbiobacterium-specific selection on the basis of synonymous-to-nonsynonymous substitution ratios. Five of the 14 genes under relaxed selection are related to transcription. In contrast, none of the 18 genes under functional constraint is related to transcription.Entities:
Year: 2010 PMID: 21350630 PMCID: PMC3039458 DOI: 10.4061/2011/376831
Source DB: PubMed Journal: Int J Evol Biol ISSN: 2090-052X
Figure 1Phylogenetic relationships on the basis of gene content comparisons among 50 Clostridia and Bacillus subtilis. The ortholog cluster analysis (minimum cluster size, 2) among the 51 bacteria was performed using the MBGD [13]. This analysis produced the gene presence/absence data matrix (10,636 genes × 51 organisms), which was used to generate the distance matrix between all pairs of the 51 bacteria. On the basis of the distance matrix, a neighbor-joining tree was reconstructed using MEGA software version 4 [14]. The bootstrap was performed with 1000 replicates. The bar indicates a 200-gene difference.
Figure 2Phylogenetic relationships on the basis of 112 orthologous protein sequence comparisons among 54 Clostridia and B. subtilis. The 112 proteins were extracted as orthologous proteins from the 55 bacteria by a previously described method [15]. We constructed the 112 multiple alignments by using Clustal W [16]. Then, a concatenated multiple alignment of the 112 multiple alignments was generated. The complete multiple alignment had 52,204 amino acid sites, including 19,818 gap/insertion sites. Hence, phylogenetic analyses were performed on the basis of 32,386 amino acid sites without the gap/insertion sites. The neighbor-joining tree was reconstructed using MEGA software version 4 [14]. The bootstrap was performed with 1000 replicates. The rate variation among sites was assumed to have a gamma distributed rate (α = 1). No other default parameters were changed. The bar indicates a 10% difference.
Genes evolving under Symbiobacterium-specific selection.
| Gene | 2Δln | ||
|---|---|---|---|
| 0.5347 | 24.3046 | 5.7413 | |
| 0.3967 | 5.9744 | 8.7835 | |
| 0.3774 | 75.4800 | 6.4323 | |
| 0.2240 | 3.4675 | 10.7272 | |
| 0.2183 | 8.3640 | 13.4750 | |
| 0.2084 | 2.0292 | 4.1224 | |
| 0.1869 | 9.3450 | 4.2192 | |
| 0.1526 | 24.2222 | 15.4681 | |
| 0.1497 | 26.2632 | 4.4715 | |
| 0.1315 | 3.7679 | 17.9996 | |
| 0.0838 | 2.1487 | 7.2876 | |
| 0.0156 | 2.7857 | 13.0700 | |
| 0.0125 | 2.8409 | 4.4748 | |
| 0.0044 | 2.9333 | 6.7786 | |
| 0.0123 | 0.0715 | 4.4609 | |
| 0.0073 | 0.0885 | 4.0842 | |
| 0.0045 | 0.0441 | 12.0081 | |
| 0.0040 | 0.0655 | 11.9908 | |
| 0.0039 | 0.0591 | 6.5852 | |
| 0.0039 | 0.1429 | 3.9835 | |
| 0.0038 | 0.0574 | 5.7865 | |
| 0.0038 | 0.0721 | 4.1368 | |
| 0.0037 | 0.1504 | 4.1064 | |
| 0.0035 | 0.0750 | 6.3618 | |
| 0.0034 | 0.0654 | 4.5717 | |
| 0.0034 | 0.0397 | 10.1363 | |
| 0.0027 | 0.0407 | 5.6285 | |
| 0.0027 | 0.0470 | 4.1885 | |
| 0.0023 | 0.0920 | 4.3304 | |
| 0.0020 | 0.0617 | 3.9836 | |
| 0.0018 | 0.1047 | 4.5829 | |
| 0.0004 | 0.0234 | 4.7842 | |
Figure 3Alignment of amino acid sequences of Hfq. Sth, Symbiobacterium thermophilum; Bsu, Bacillus subtilis; Chy, Carboxydothermus hydrogenoformans; Dre, Desulfotomaculum reducens; Dha, Desulfitobacterium hafniense; Mth, Moorella thermoacetica; Pth, Pelotomaculum thermopropionicum; Swo, Syntrophomonas wolfei. Red and blue sites indicate Symbiobacterium- and Bacillus-specific sites, respectively. The dots represent identical residues of S. thermophilum amino acid.