| Literature DB >> 28186559 |
Louis-Marie Bobay1, Howard Ochman1.
Abstract
Entities:
Year: 2017 PMID: 28186559 PMCID: PMC5381558 DOI: 10.1093/gbe/evx026
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
FRecognizing biological species through genome analysis. (A) “Scheme used to test for gene flow”. In each species or designated set of genomes composed of n strains (depicted as filled colored circles), nonredundant combinations of i strains (with i ranging from 4 to n–2 strains) were subsampled 100 times for each value of i. At each iteration, the h/m ratio was calculated for a randomly selected 10-kilobase fragment from the alignment of the core genome concatenate common to all strains. Within the bivariate plots, black dots are medians, and the grey-shaded region is the standard deviation, for the indicated number of subsampled combinations of strains. Left panel is the graphical representation observed when there are no barriers to gene flow among strains, and right panel depicts the discontinuity produced by inclusion of a strain that does not participate in gene exchange. (B–D) “Patterns of genetic exchange observed in taxonomically defined bacterial species”. Kingella kingae, in which there is gene flow among the entire set of sequenced strains; C. pseudotuberculosis, in which there is too little gene flow to assess species status; Buckholderia pseudomallei, in which there is a sharp drop in h/m ratios, denoting the presence of a sexually isolated strain or strains. The complete set of graphical representations of gene flow for 93 taxonomically defined bacterial and archaeal species is presented in supplementary fig. S1, Supplementary Material online. (E) Graphical representation of B. pseudomallei after removal of the sexually isolated strain, showing that the remaining strains constitute a biological species. The complete set of graphs is presented for the redefined species in fig. S3 before and after species redefinition. (F–I) Verification of method for recognizing biological species using obligatory sexual animals. Graphical representation of gene flow in Drosophila when including 311 haplotypes of D. melanogaster and one haplotype of D. simulans, and when the analysis is restricted to the 311 haplotypes of D. melanogaster. Graphical representation of gene flow in Homininae when including 311 haplotypes of chromosome 21 in humans and one haplotype of chimpanzees (P. troglodytes), and when the analysis is restricted to the 311 chromosome 21 haplotypes of humans.
FMaximum sequence divergence within species of bacteria. Shown are the average nucleotide sequence identity values for orthologs shared by the two maximally divergent strains within each biological species of bacteria (grey and pink bars). The inset shows these values for those named species (red bars) that were subsequently redefined into biological species (pink bars) based on gene flow (n = 23). Within strictly clonal species (n = 14; not included in figure), maximally divergent strains ranged from 73 to > 99% nucleotide sequence identity (95% on average). Green dots indicate the degree of sequence identity between two maximally divergent haplotypes of D. melanogaster and of H. sapiens.
FDifferential effects of sequence divergence on gene flow in biological species of bacteria. For each species, the h/m ratio for each of the subsampled combination of strains was compared to the average genome-wide sequence identity (supplementary fig. S6, Supplementary Material online). Spearman correlation coefficients Rho were computed independently for all nonredundant combinations of strains (ranging from 4 to n–2 strains) each subsampled 100 times and whose distributions are displayed as box-and-whiskers (first and third quartiles, 1.5 interquartile range) plots for each species. Positive values of Rho indicate a positive correlation between nucleotide sequence identity and gene flow (such that genetic exchange occurs between more similar strains), whereas negative values indicate that genetic exchange occurs preferentially among divergent strains.