| Literature DB >> 27495742 |
Nicolás Toro1, Francisco Martínez-Abarca2, Manuel Fernández-López2.
Abstract
BACKGROUND: Population genetic analyses based on genome-wide sequencing data have been carried out for Sinorhizobium medicae and S. meliloti, two closely related bacterial species forming nitrogen-fixing symbioses with plants of the genus Medicago. However, genome coverage was low or the isolates had a broad geographic distribution, making it difficult to interpret the estimated diversity and to unravel the early events underlying population genetic variations and ecological differentiation.Entities:
Keywords: Diversity; Genome-wide sequencing; Genomic islands; Group II introns; Illumina technology; Insertion sequences; Polymorphism; Population genomics; Recombination
Mesh:
Substances:
Year: 2016 PMID: 27495742 PMCID: PMC4974801 DOI: 10.1186/s12864-016-2878-9
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Type of nucleotides segregating in the GR4-type isolates
| Total Sites | Chr/sites | pSymB/sites | pSymA/sites | Chr/SNPsa | pSymB/SNPsa | pSymA/SNPsa | Total SNPs | |
|---|---|---|---|---|---|---|---|---|
| sSNPs | 109 | 56 | 34 | 19 | 126 | 56 | 47 | 229 |
| nSNPs | 178 | 80 | 53 | 45 | 201 | 129 | 115 | 445 |
| iSNPs | 83 | 41 | 20 | 22 | 78 | 84 | 78 | 240 |
| All SNPs | 370 | 177 | 107 | 86 | 405 | 269 | 240 | 914 |
| Chr | pSymB | pSymA | Total | |||||
| Size of Replicon 3,620,713 | 1,701,381 | 1,417,906 | 6,740,001 |
aTotal SNPs per replicon in the isolates
Fig. 1Distribution of SNPs on the replicons. The names of the replicons are indicated. sSNPs are indicated in blue, nsSNPs in pink and iSNPs are shown in black. Light green and white background, respectively; indicate island and non-island regions in the most likely global assignment of CpG islands to the sequence. To improve their visualization the SNPs are placed out of the sequence
Fig. 2Phylogeny of GR4-type isolates based on alignments of SNP types. Phylogeny based on all SNPs (370 nt positions). Phylogenetic trees were inferred from Bayesian analyses and posterior probabilities (≥50 %) are indicated at the nodes. The names of the isolates and clades are indicated
Neutrality tests
| Concatenated coding sequences carrying sSNPs and nsSNPs | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| nt positions | Sa | πb | θc | Dd | D(Syn)d | D(nSyn)d | D*e | F*f | Fsg | |
| Chromosome | 146,721 | 123 | 0.000135 | 0.000264 | −2.18** | −2.23** | −2.09** | −2.79** | −3.01** | −3.78** |
| 1st half | 64,742 | 56 | 0.000146 | 0.000272 | −2.03* | −2.10** | −1.76 | −2.54** | −2.76** | −6.85** |
| 2nd half | 81,972 | 67 | 0.000127 | 0.000257 | −2.24** | −2.25** | −2.09** | −2.90** | −3.12** | −6.39** |
| pSymB | 76,977 | 67 | 0.000142 | 0.000274 | −2.12** | −1.83* | −2.19** | −2.60** | −2.84** | −6.13** |
| pSymA | 66,945 | 58 | 0.000136 | 0.000272 | −2.20** | −2.12** | −2.15** | −2.74** | −2.98** | −7.04** |
a Number of segregating sites
b The mean number of pairwise nucleotide differences per site
c The number of segregating mutations per site
d Tajima’s D statistic, Syn (synonymous sites), nSyn (non-synonymous sites)
e Fu and Li’s D* statistic
f Fu and Li’s F* statistic
g Fu’s Fs statistic
* Statically significant result (P-value <0.05)
** Statically significant result (P-value <0.01 or <0.02)
Fig. 3Mismatch distribution analysis for concatenated genes carrying SNPs. Graphs of the mismatch distribution are shown for each replicon and for the two halves of the chromosome. The x-axis shows the observed distribution of pairwise nucleotide differences, with frequency plotted on the y-axis. The values of the moment estimator Tau (τ), and the Raggedness index r and R statistics are shown
Fig. 4The maximum clade credibility tree estimated with BEAST 2, based on concatenated genes carrying SNPs. a Chromosome. b pSymB. c pSymA. d All concatenated genes carrying SNPs. Node bars indicate the 95 % credible intervals (95 % HPD) for node ages, and posterior probabilities (>0.5) are indicated (in italics) on the branches. Relevant node ages (years) and clades are indicated. The root of the unrooted tree was placed at the mid-point of the longest distance between two taxa in the tree
Fig. 5Bayesian skyline plots based on the concatenated genes carrying SNPs for each replicon. a Chromosome. b pSymB. c pSymA. d All concatenated genes carrying SNPs. The median estimates are shown as solid lines, and the blue areas indicate the limits of the 95 % HPD. The vertical line indicates the beginning of the stabilization period. Time (x-axis) is expressed in years ago. The effective population size is indicated on the y-axis
Fig. 6Pyrimidine/purine dinucleotide CT/GA-rich MRI (mid-range inhomogeneity) region. The identified MRI and flanking sequences in the pSymA of GR4-type isolates and that conserved on the chromosomes of S. meliloti and S. medicae are shown. Residues in color are different from the consensus sequence. The location of the replicon and GenBank accession numbers are shown