| Literature DB >> 25155272 |
Sue Sherman-Broyles1, Aureliano Bombarely2, Jane Grimwood3, Jeremy Schmutz3, Jeff Doyle2.
Abstract
Organelle sequences have a long history of utility in phylogenetic analyses. Chloroplast sequences when combined with nuclear data can help resolve relationships among flowering plant genera, and within genera incongruence can point to reticulate evolution. Plastome sequences are becoming plentiful because they are increasingly easier to obtain. Complete plastome sequences allow us to detect rare rearrangements and test the tempo of sequence evolution. Chloroplast sequences are generally considered a nuisance to be kept to a minimum in bacterial artificial chromosome libraries. Here, we sequenced two bacterial artificial chromosomes per species to generate complete plastome sequences from seven species. The plastome sequences from Glycine syndetika and six other perennial Glycine species are similar in arrangement and gene content to the previously published soybean plastome. Repetitive sequences were detected in high frequencies as in soybean, but further analysis showed that repeat sequence numbers are inflated. Previous chloroplast-based phylogenetic trees for perennial Glycine were incongruent with nuclear gene-based phylogenetic trees. We tested whether the hypothesis of introgression was supported by the complete plastomes. Alignment of complete plastome sequences and Bayesian analysis allowed us to date putative hybridization events supporting the hypothesis of introgression and chloroplast "capture."Entities:
Keywords: divergence dates; incongruence; inversions; repetitive sequences
Mesh:
Substances:
Year: 2014 PMID: 25155272 PMCID: PMC4199708 DOI: 10.1534/g3.114.012690
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Figure 1Partial representation of legume phylogeny. Fourteen genera with sequenced plastomes are named. Asterisks indicate genera with multiple species with plastome sequences.
Summary statistics from complete Glycine chloroplast sequences
| Plastome Group | Genome Group | Total | LSC | SSC | IR | Duplicated | %GC | ||
|---|---|---|---|---|---|---|---|---|---|
| G | G | 152218 | 83174 | 17896 | 25574 | 478 | 0.34 | ||
| A | A | 152794 | 83844 | 17840 | 25555 | 463 | 61 | 0.353 | |
| A | A/D | 152804 | 83815 | 17807 | 25591 | 490 | 61 | 0.353 | |
| A | A | 152533 | 83559 | 17844 | 25565 | 463 | 61 | 0.353 | |
| A | D | 152728 | 83773 | 17829 | 25563 | 463 | 61 | 0.353 | |
| A | F | 153023 | 84027 | 17846 | 25575 | 463 | 59 | 0.353 | |
| B | B | 152618 | 83937 | 17817 | 25432 | 463 | 61 | 0.353 | |
| C | C | 152381 | 89368 | 17801 | 25505 | 463 | 61 | 0.353 |
Length in base pairs of the chloroplast and the three major divisions. Extent of border genes within the IR. GC content in each chloroplast genome.
Saski .
Figure 2Gene map of Glycine syndetika chloroplast genome. Gray bars on inner circle represent the extent of the inverted repeats (IRa and IRb). Yellow bar on middle circle represents 51-kb inversion found in papillionoid legumes. Genes on the outside of the circle are transcribed in a clockwise direction, and genes on the inside of the circle are transcribed in a counter-clockwise direction.
Figure 3Sequence similarity plot generated by VISTA tools. Base sequence is G. syndetika. Intergenic regions are pink. Shuffle LAGAN option used to align Phaseolus and Vigna that have 78-kb inversions. Hypervariable region identified by Magee is demarcated by red line. Gene with low sequence similarity to other angiosperms, ycf4 is indicated, along with five other genes for orientation. Only one inverted repeat is shown.
Number and types of repeated sequences
| RePuter | Dispersed Repeats | RepeatMasker/Repeat Finder | Low Complexity | ||||
|---|---|---|---|---|---|---|---|
| Species | Repeat Sequences | Unique Locations | Unique Sequences | Tandem Repeats | Repeat Sequences | Low Complexity | SSR |
| 57 | 19 | 9 | 18 | ||||
| 104 | 22 | 10 | 28 | 83 | 56 | 8 | |
| 103 | 20 | 5 | 32 | 95 | 69 | 7 | |
| 104 | 20 | 5 | 29 | 94 | 69 | 7 | |
| 86 | 16 | 5 | 29 | 95 | 71 | 5 | |
| 108 | 28 | 8 | 30 | 92 | 69 | 4 | |
| 85 | 28 | 7 | 27 | 89 | 64 | 5 | |
| 96 | 19 | 6 | 31 | 82 | 58 | 5 | |
| 101 | 20 | 6 | 34 | 78 | 54 | 6 | |
| 62 | 9 | 3 | 21 | ||||
| 68 | 10 | 3 | 24 | ||||
| 46 | 10 | 3 | 21 | ||||
Repeated sequences were detected by RePuter, RepeatMasker, and Repeat Finder. Further analysis of dispersed repeats showed that many of the sequences shared similarity, reducing the number of unique repeat sequences and locations.
Saski .
Figure 4Locations of repetitive sequences in Glycine plastomes. Black horizontal lines represent coding regions in each plastome. Light green squares mark the location of dispersed repeat sequence 1 from REPuter. Dark green squares mark the location of 10 additional dispersed sequences. The locations of low complexity repeats are shown by blue lines. SSRs are shown by red lines as determined by RepeatScout. Asterisks represent repeats within coding sequence. Black diamonds represent repeats with sequence similarity to hAT repeat transposons.
Figure 5Comparison of Bayesian inference trees based on whole plastome alignment (A) and histone H3D alignment (B). Divergence dates calculated using BEAST. Bars represent the 95% HPD (highest posterior density) interval.
Pairwise SNPs and substitution rates
| Rate/SNPs | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| 2066 | 2044 | 2074 | 2104 | 2094 | 2022 | 2384 | 8234 | ||
| 2.49E−09 | 175 | 210 | 307 | 638 | 1608 | 2026 | 8353 | ||
| 2.47E−09 | 2.22E−09 | 213 | 303 | 635 | 1596 | 2032 | 8341 | ||
| 2.50E−09 | 1.04E−09 | 2.70E−09 | 322 | 660 | 1627 | 2054 | 8331 | ||
| 2.54E−09 | 687 | 1640 | 2066 | 8368 | |||||
| 2.53E−09 | 1640 | 2074 | 8365 | ||||||
| 2.44E−09 | 2.74E−09 | 2.72E−09 | 2.77E−09 | 2.79E−09 | 2.79E−09 | 1284 | 8351 | ||
| 2.88E−09 | 3.70E−09 | 3.71E−09 | 3.75E−09 | 3.77E−09 | 3.78E−09 | 2.34E−09 | 8616 | ||
| 2.72E−09 | 2.76E−09 | 2.76E−09 | 2.75E−09 | 2.77E−09 | 2.77E−09 | 2.76E−09 | 2.85E−09 |
The number of SNPs between species appears above the diagonal, and the substitution rate was calculated using histone H3D divergence dates. Substitution rates in bold represent A plastome taxa with substitution rates that appear slower because of inferred chloroplast capture.