| Literature DB >> 30102350 |
Wen-Sui Lo1,2, Gail E Gasparich3, Chih-Horng Kuo1.
Abstract
Convergent evolution, a process by which organisms evolved independently to have similar traits, provides opportunities to understand adaptation. The bacterial genus Mycoplasma contains multiple species that evolved independently to become ruminant pathogens, which represents an interesting study system for investigating the process. In this work, we determined the genome sequences of 11 Entomoplasma/Mesoplasma species. This new data set, together with the other available Mollicutes genomes, provided comprehensive taxon sampling for inferring the gene content evolution that led to the emergence of Mycoplasma Mycoides cluster. Our results indicated that the most recent common ancestor (MRCA) of the Mycoides-Entomoplasmataceae clade lost ∼15% of the core genes when it diverged from the Spiroplasma Apis clade. After this initial wave of genome reduction, relatively few gene gains or losses were inferred until the emergence of the Mycoides cluster. Compared with those Entomoplasmataceae lineages that maintained the association with insects, the MRCA of the Mycoides cluster experienced a second wave of gene losses, as well as acquiring >100 novel genes through horizontal gene transfer. These gene acquisitions involved many with the Mycoplasma Hominis/Pneumoniae lineages as the putative donors, suggesting that gene exchanges among these vertebrate symbionts with distinct phylogenetic affiliations may be important in the emergence of the Mycoides cluster. These findings demonstrated that the gene content of bacterial genomes could be exceedingly dynamic, even for those symbionts with highly reduced genomes. Moreover, the emergence of novel pathogens may involve extensive remodeling of gene content, rather than acquisition of few virulence genes.Entities:
Mesh:
Year: 2018 PMID: 30102350 PMCID: PMC6117150 DOI: 10.1093/gbe/evy172
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Genome Characteristics of Representative Species in the Families Spiroplasmataceae, Entomoplasmataceae, and Mycoplasmataceae
| Clade | Host | Accession | Genome Size (kb) | % GC | % Coding | Protein-Coding Genes | Pseudogenes | tRNA Genes | rRNA Genes | |
|---|---|---|---|---|---|---|---|---|---|---|
| Citri-Chrysopicola-Mirum | Insect/Plant | NZ_CP010899 | 1,464 | 25.0 | 67.2 | 1,330 | 211 | 32 | 3 | |
| Insect | CP005077 | 1,123 | 28.8 | 89.0 | 1,009 | 6 | 33 | 3 | ||
| Crustacean | CP011856 | 1,366 | 29.8 | 86.0 | 1,180 | 30 | 32 | 3 | ||
| Apis | Insect | CP006682 | 1,161 | 28.3 | 87.8 | 997 | 1 | 29 | 3 | |
| Insect | CP006681 | 1,175 | 26.4 | 92.2 | 1,071 | 0 | 29 | 3 | ||
| Insect | CP006934 | 1,076 | 30.2 | 90.0 | 924 | 7 | 30 | 6 | ||
| Lactucae | Plant (surface) | CP024967 | 837 | 29.8 | 89.4 | 692 | 1 | 32 | 9 | |
| Lucivorax | Insect | CP024963 | 1,032 | 29.8 | 89.3 | 868 | 6 | 32 | 3 | |
| Insect | PHNE00000000 | 1,127 | 30.3 | 88.2 | 961 | 7 | 31 | 3 | ||
| Insect | CP024962 | 838 | 34.6 | 90.2 | 702 | 3 | 33 | 6 | ||
| Insect | CP024965 | 868 | 28.0 | 91.5 | 725 | 2 | 32 | 3 | ||
| Seiffertii | Plant (surface) | GCA_000518725 | 978 | 30.2 | 89.2 | 807 | 18 | 29 | 7 | |
| Insect | GCA_000686545 | 918 | 30.2 | 87.7 | 740 | 15 | 29 | 12 | ||
| Horse/Rabbit | GCF_000483165 | 925 | 30.6 | 89.1 | 814 | 15 | 28 | 8 | ||
| Insect | GCA_000702725 | 779 | 28.1 | 91.4 | 681 | 10 | 28 | 8 | ||
| Florum | Plant (surface) | NC_006055 | 793 | 27.0 | 92.7 | 680 | 0 | 29 | 6 | |
| Insect | GCA_000701525 | 807 | 26.9 | 91.4 | 689 | 13 | 29 | 5 | ||
| Insect | CP024966 | 848 | 27.2 | 91.7 | 714 | 7 | 29 | 6 | ||
| Insect | CP024968 | 800 | 27.2 | 91.3 | 691 | 3 | 29 | 6 | ||
| Insect | CP024969 | 847 | 26.9 | 91.6 | 722 | 7 | 29 | 6 | ||
| Plant (surface) | CP024964 | 845 | 26.6 | 80.3 | 789 | 57 | 29 | 6 | ||
| Insect | GCA_000518825 | 844 | 26.9 | 87.6 | 710 | 25 | 29 | 7 | ||
| Insect | PHND00000000 | 900 | 26.2 | 88.6 | 786 | 7 | 29 | 3 | ||
| Insect | PHNF00000000 | 839 | 26.3 | 89.5 | 701 | 4 | 29 | 3 | ||
| Mycoides | Cattle/Goat | NC_005364 | 1,212 | 24.0 | 81.5 | 1,017 | 0 | 30 | 6 | |
| Goat | NC_007633 | 1,010 | 23.8 | 86.0 | 793 | 35 | 30 | 6 | ||
| Cattle | NC_014751 | 1,009 | 23.8 | 85.8 | 835 | 31 | 30 | 6 | ||
| Goat | NC_015946 | 833 | 26.9 | 83.7 | 654 | 25 | 30 | 6 | ||
| Goat | NZ_CP007520 | 895 | 25.7 | 89.5 | 747 | 8 | 30 | 6 | ||
| Hominis | Goat | NC_009497 | 877 | 29.7 | 79.6 | 666 | 63 | 34 | 6 | |
| Pig | NC_007295 | 897 | 28.5 | 85.5 | 665 | 18 | 30 | 3 | ||
| Fish | NC_006908 | 777 | 25.0 | 90.4 | 649 | 6 | 28 | 3 | ||
| Pneumoniae | Human | NC_004432 | 1,359 | 25.7 | 87.0 | 1,022 | 10 | 30 | 3 | |
| Human | NC_000908 | 580 | 31.7 | 93.7 | 507 | 7 | 36 | 3 | ||
| Bird | NC_004829 | 1,013 | 31.5 | 86.5 | 747 | 24 | 32 | 6 |
Note.—All six species in the genus Entomoplasma and all 11 species in Mesoplasma are included.
Genus name abbreviations: A., Acholeplasma; E., Entomoplasma; Me., Mesoplasma; My., Mycoplasma; S., Spiroplasma.
Genome sequences newly reported in this study.
Fig. 3.—Characteristics of the genes gained and lost. (A) Phylogenetic positions of the six internal branches and two terminal branches analyzed in this study. The species phylogeny is based on the result from figure 1A. (B) Putative gene losses. The number in parentheses above each subpanel indicates the total number of inferred gene losses. The genes are binned by length using 250-bp intervals and color-coded according to their functional category assignments. The circle size is proportional to the number of genes in each bin. (C) Putative gene gains.
Fig. 1.—Molecular phylogeny and gene content dissimilarity of the Mycoplasmatales–Entomoplasmatales clade lineages. (A) Maximum likelihood phylogeny based on 161 shared single-copy genes (71,550 aligned amino acid sites). One node with <98% bootstrap support is labeled with “*”. Bayesian inference based on the same concatenated alignment produced a phylogeny with identical topology and comparable levels of support. (B) Hierarchical clustering based on gene content. Two nodes with <80% support are labeled with “*”. Note the shift in position of the Mycoides clade lineages. (C) Principal coordinates analysis of gene content. The % variance explained by each axis is provided in parentheses.
Fig. 2.—Presence and absence of genes related to substrate transport, metabolism, and pathogenicity. Three sets of genes could be classified as two distinct types based on sequence similarities among homologous genes (i.e., oppA/B/C/D, phnB/D, and lplA) and the gene names are color-coded (green, Entomoplasmatales-type; brown, Mycoplasmatales-type).
Fig. 4.—Organization of horizontally acquired gene islands in Mycoplasma mycoides (dark green) and their homologous genes in other genomes (light green). Annotated pseudogenes are indicated by transparent light green; neighbor genes that were not involved in horizontal transfer are shown in white. The sequence accession numbers and start/end positions of these regions are labeled.