| Literature DB >> 22919687 |
Vaibhav Bhandari1, Hafiz S Naushad, Radhey S Gupta.
Abstract
The analyses of genome sequences have led to the proposal that lateral gene transfers (LGTs) among prokaryotes are so widespread that they disguise the interrelationships among these organisms. This has led to questioning of whether the Darwinian model of evolution is applicable to prokaryotic organisms. In this review, we discuss the usefulness of taxon-specific molecular markers such as conserved signature indels (CSIs) and conserved signature proteins (CSPs) for understanding the evolutionary relationships among prokaryotes and to assess the influence of LGTs on prokaryotic evolution. The analyses of genomic sequences have identified large numbers of CSIs and CSPs that are unique properties of different groups of prokaryotes ranging from phylum to genus levels. The species distribution patterns of these molecular signatures strongly support a tree-like vertical inheritance of the genes containing these molecular signatures that is consistent with phylogenetic trees. Recent detailed studies in this regard on the Thermotogae and Archaea, which are reviewed here, have identified large numbers of CSIs and CSPs that are specific for the species from these two taxa and a number of their major clades. The genetic changes responsible for these CSIs (and CSPs) initially likely occurred in the common ancestors of these taxa and then vertically transferred to various descendants. Although some CSIs and CSPs in unrelated groups of prokaryotes were identified, their small numbers and random occurrence has no apparent influence on the consistent tree-like branching pattern emerging from other markers. These results provide evidence that although LGT is an important evolutionary force, it does not mask the tree-like branching pattern of prokaryotes or understanding of their evolutionary relationships. The identified CSIs and CSPs also provide novel and highly specific means for identification of different groups of microbes and for taxonomical and biochemical studies.Entities:
Keywords: Archaea; Crenarchaeota; RpoB signatures; Thermotogae; conserved indels; lateral gene transfers; phylogenetic trees; signature proteins
Mesh:
Substances:
Year: 2012 PMID: 22919687 PMCID: PMC3417386 DOI: 10.3389/fcimb.2012.00098
Source DB: PubMed Journal: Front Cell Infect Microbiol ISSN: 2235-2988 Impact factor: 5.293
Overview of the CSIs and CSPs that have been identified for some major prokaryotic taxa.
| Archaea | Gao and Gupta, | |
| Crenarchaeota | Gupta and Shami, | |
| Thaumarchaeota | >200 CSPs | Gupta and Shami, |
| Thermotogae | Gupta and Bhandari, | |
| Cyanobacteria | Gupta, | |
| Chlamydiae | Gupta and Griffiths, | |
| Bacteroidetes, chlorobi and fibrobacteres | Gupta, | |
| Actinobacteria | Gao and Gupta, | |
| Deinococcus-thermus | Griffiths and Gupta, | |
| Aquificae | Griffiths and Gupta, | |
| α-proteobacteria | Gupta and Mok, | |
| γ-proteobacteria | Gao et al., | |
| ε-proteobacteria | Gupta, | |
| Pasteurellales | Naushad and Gupta, | |
| Clostridia sensu stricto | Gupta and Gao, |
The table provides general information regarding the number of CSIs and CSPs identified for many taxonomic groups on which genomic studies have been conducted. Further details can be obtained from the corresponding studies.
Figure 1Evolutionary relationships among Thermotogae species based upon CSIs and a Phylogenetic Tree. (A) Partial sequence alignment for the ribosomal protein L7 showing a 3 aa CSI (boxed) that is specific for all detected species of the Thermotogae phylum. The dashes in the alignment (−) indicate amino acid identity with the corresponding residue in the top line; (B) A maximum likelihood tree for the 12 sequenced Thermotogae species based upon concatenated sequences for 12 conserved proteins. (C) A summary diagram showing the species specificities of different CSIs identified for the Thermotogae group of species. The left panel highlights the CSIs that are specific for the entire Thermotogae phylum or its sub-groups, whereas the right panel indicates the CSIs that were also present in some non-Thermotogae organisms. Figures 1A,B modified from Gupta and Bhandari (2011).
Figure 2A summary diagram showing the various molecular markers that have been identified for the Archaeal kingdom and its subgroups. The arrows indicate the suggested evolutionary stages where the proteins unique for a particular taxa are proposed to have been introduced. The numbers beside the arrows indicate the number of CSIs and CSPs specific for the various taxa (these numbers indicate CSPs unless otherwise noted). The branching pattern shown is based solely upon the distribution patterns of CSPs and CSIs. Modified from Gao and Gupta (2007) and Gupta and Shami (2011).
A series of proteins specific for the Crenarchaeota and its sub-groups.
| 0.0 | 9e–98 | 5e–64 | 7e–161 | 7e–22 | – | – | – | – | ||
| 3e–46 | 9e–43 | 1e–20 | 1e–23 | 3e–25 | – | – | – | – | ||
| 3e–41 | – | 5e–27 | 4e–19 | 3e–25 | – | – | – | – | ||
| 7e–46 | 1e–21 | 2e–20 | 5e–17 | 7e–32 | – | – | – | – | ||
| 4e–56 | 1e–25 | 3e–21 | 3e–21 | 2e–85 | – | – | – | – | ||
| 9e–56 | 4e–36 | 4e–21 | 1e–46 | 1e–19 | – | – | – | – | ||
| 4e–40 | 2e–29 | 3e–20 | 7e–26 | – | 1e–77 | 1e–80 | – | – | ||
| 4e–42 | 6e–30 | 1e–25 | 1e–15 | – | 7e–50 | 8e–65 | – | – | ||
| 7e–34 | 3e–23 | 4e–22 | 4e–24 | – | 2e–162 | 0.0 | – | – | ||
| 1e–41 | 7e–30 | 5e–26 | 8e–15 | – | 5e–50 | 8e–64 | – | – | ||
| 3e–31 | 3e–33 | 3e–20 | 1e–22 | – | 4e–39 | 8e–60 | – | – | ||
| 9e–18 | 3e–11 | – | – | – | – | – | 0.0 | 2e–73 | ||
| 3e–18 | 3e–11 | – | – | – | – | – | 0.0 | 6e–54 | ||
| 1e–18 | 1e–10 | – | – | – | – | – | 0.0 | 2e–63 | ||
| 6e–22 | 7e–11 | – | – | – | – | – | 0.0 | 1e–60 | ||
| 1e–35 | 5e–30 | – | – | – | – | – | 1e–42 | 3e–10 | ||
| 1e–17 | 4e–8 | – | – | – | – | – | 1e–87 | 2e–22 | ||
| 2e–19 | 7e–11 | – | – | – | – | – | 0.0 | 5e–61 | ||
| 3e–15 | 6e–10 | – | – | – | – | – | 0.0 | 4e–46 | ||
Blastp searches were carried out on proteins specific for the Crenarchaeota or its sub-groups and the results for representative species from different sub-groups of the Crenarchaeota are shown with the observed E-values. E-values greater than 1e–3 are considered insignificant hits with lack of homology to the query protein sequence.
The dashes (–) indicate that the homolog for the protein query was not detected in the BlastP searches.
Top non-Crenarchaeota hits indicate detection of species outside the Crenarchaeota that were observed to have the lowest E-value scores.
Figure 3Evolutionary significance of various identified CSIs in the RNA polymerase β subunit. (A) A portion of the RpoB sequence alignment showing a large insert (boxed) that is distinctive characteristic of all Proteobacteria and some Gram-negative phyla (Chlamydiae-Verrucomicrobiae, Aquificales, Planctomycetes, and Bacteroidetes-Chlorobi), but not found in other phyla of bacteria. Due to the large size of the insert, its entire sequence is not shown. Dashes (–) indicate identity with the amino acid on the top line. On the right is a linear representation of prokaryotic relationships based on the presence and absence of this CSI. The numbers in the brackets indicate the species of each phylum, which have been identified to contain the CSI. (B) A schematic representation of the sequence for E. coli RNA polymerase β subunit (RpoB) showing some functionally important regions and the positions of different lineage-specific inserts that have been identified within this protein. The large insert depicted in (A) (≈ 100 aa in E. coli) is shown in solid black. The positions of CSIs for different groups are roughly indicated using arrows. The values in the brackets identify the number of organisms in each respective group and the number of these species to harbour the indicated CSI. In all cases no organism outside of the indicated group was identified to contain the indel. The indicated CSIs have been described in earlier work (Griffiths and Gupta, 2004b, 2007b; Gupta and Mok, 2007; Gao et al., 2009; Gupta and Bhandari, 2011; Naushad and Gupta, 2012).