| Literature DB >> 22479625 |
Ling-Ling Chen1, Wan-Chia Chung, Chan-Pin Lin, Chih-Horng Kuo.
Abstract
Phytoplasmas and mycoplasmas are two groups of important pathogens in the bacterial class Mollicutes. Because of their economical and clinical importance, these obligate pathogens have attracted much research attention. However, difficulties involved in the empirical study of these bacteria, particularly the fact that phytoplasmas have not yet been successfully cultivated outside of their hosts despite decades of attempts, have greatly hampered research progress. With the rapid advancements in genome sequencing, comparative genome analysis provides a new approach to facilitate our understanding of these bacteria. In this study, our main focus is to investigate the evolution of gene content in phytoplasmas, mycoplasmas, and their common ancestor. By using a phylogenetic framework for comparative analysis of 12 complete genome sequences, we characterized the putative gains and losses of genes in these obligate parasites. Our results demonstrated that the degradation of metabolic capacities in these bacteria has occurred predominantly in the common ancestor of Mollicutes, prior to the evolutionary split of phytoplasmas and mycoplasmas. Furthermore, we identified a list of genes that are acquired by the common ancestor of phytoplasmas and are conserved across all strains with complete genome sequences available. These genes include several putative effectors for the interactions with hosts and may be good candidates for future functional characterization.Entities:
Mesh:
Year: 2012 PMID: 22479625 PMCID: PMC3313985 DOI: 10.1371/journal.pone.0034407
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
List of the genome sequences included in this study.
| Genome | RefSeq | Size (Mb) | % GC | % coding | No. of CDS | % without homolog |
| ‘ | NC_007716 | 0.71 | 26 | 73 | 671 | 28 |
| ‘ | NC_005303 | 0.85 | 27 | 72 | 750 | 20 |
| ‘ | NC_010544 | 0.88 | 27 | 64 | 684 | 16 |
| ‘ | NC_011047 | 0.60 | 21 | 76 | 479 | 17 |
|
| NC_013948 | 1.01 | 29 | 87 | 813 | 26 |
|
| NC_006908 | 0.78 | 24 | 90 | 633 | 31 |
|
| NC_000908 | 0.58 | 31 | 90 | 475 | 33 |
|
| NC_005364 | 1.21 | 23 | 81 | 1,017 | 36 |
|
| NC_000964 | 4.22 | 43 | 87 | 4,176 | 46 |
|
| NC_004567 | 3.31 | 44 | 83 | 3,007 | 39 |
|
| NC_009706 | 3.96 | 32 | 84 | 3,919 | 42 |
|
| NC_009454 | 3.03 | 52 | 85 | 2,977 | 42 |
Number of protein coding sequences.
Figure 1Organismal phylogeny and distribution of lineage-specific gene clusters.
The organismal phylogeny is inferred from the concatenated protein alignment of 105 single-copy genes shared by all lineages (with 44,919 aligned amino acid sites), the three phylogenetic methods used (i.e., maximum likelihood, parsimony, and Bayesian) all produced the same tree topology with strong support (i.e., all internal nodes received 100% bootstrap support using the maximum likelihood method and >97% clade credibility using the Bayesian method). The branch lengthes shown in this figure is based on the maximum likelihood result. The numbers above a branch and proceeded by a ‘+’ sign indicate the number of homologous gene clusters that are uniquely present in all daughter lineages; the numbers below a branch and proceeded by a ‘−’ sign indicate the number of homologous gene clusters that are uniquely absent. For example, 52 gene clusters are shared by all four ‘Candidatus Phytoplasma’ genomes and do not contain homolog from any of the other eight genomes analyzed (i.e., represent possible gene gain events in the common ancestor of ‘Ca. Phytoplasma’ lineages); similarly, 27 gene clusters are missing from the four ‘Ca. Phytoplasma’ genomes but are present in all other eight genomes (i.e., represent possible gene loss events in the common ancestor of ‘Ca. Phytoplasma’ lineages).
Figure 2Distribution of COG functional category assignments.
The functional categorization of each homologous gene clusters was classified according to the COG assignments, genes that do have any inferred COG annotation were assigned to a custom category X. The numbers in the center of each pie chart indicate the number of homologous gene clusters in each set (e.g., the ‘All+’ set contains 125 homologous gene clusters that are shared by all 12 genomes examined and the ‘Mollicutes−’ set contains 252 homologous gene clusters that are inferred to have been lost in the common ancestor of phytoplasmas and mycoplasmas).