| Literature DB >> 25780758 |
Susan R Strickler1, Aureliano Bombarely2, Jesse D Munkvold1, Thomas York1, Naama Menda1, Gregory B Martin3, Lukas A Mueller1.
Abstract
Background. Studies of ancestry are difficult in the tomato because it crosses with many wild relatives and species in the tomato clade that have diverged very recently. As a result, the phylogeny in relation to its closest relatives remains uncertain. By using the coding sequence from Solanum lycopersicum, S. galapagense, S. pimpinellifolium, S. corneliomuelleri, and S. tuberosum and the genomic sequence from S. lycopersicum 'Heinz', an heirloom line, S. lycopersicum 'Yellow Pear', and two of cultivated tomato's closest relatives, S. galapagense and S. pimpinellifolium, we have aimed to resolve the phylogenies of these closely related species as well as identify phylogenetic discordance in the reference cultivated tomato. Results. Divergence date estimates suggest that the divergence of S. lycopersicum, S. galapagense, and S. pimpinellifolium happened less than 0.5 MYA. Phylogenies based on 8,857 coding sequences support grouping of S. lycopersicum and S. galapagense, although two secondary trees are also highly represented. A total of 25 genes in our analysis had sites with evidence of positive selection along the S. lycopersicum lineage. Whole genome phylogenies showed that while incongruence is prevalent in genomic comparisons between these genotypes, likely as a result of introgression and incomplete lineage sorting, a primary phylogenetic history was strongly supported. Conclusions. Based on analysis of these genotypes, S. galapagense appears to be closely related to S. lycopersicum, suggesting they had a common ancestor prior to the arrival of an S. galapagense ancestor to the Galápagos Islands, but after divergence of the sequenced S. pimpinellifolium. Genes showing selection along the S. lycopersicum lineage may be important in domestication or selection occurring post-domestication. Further analysis of intraspecific data in these species will help to establish the evolutionary history of cultivated tomato. The use of an heirloom line is helpful in deducing true phylogenetic information of S. lycopersicum and identifying regions of introgression from wild species.Entities:
Keywords: Genome; Incomplete lineage sorting; Introgression; Phylogeny; Selection; Solanum; Tomato
Year: 2015 PMID: 25780758 PMCID: PMC4358695 DOI: 10.7717/peerj.793
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Figure 1Feature density of Yellow Pear, S. galapagense, and S.pimpinellifolium in comparison to H1706.
Red arrow points to putative introgression. (A) SNP density on chromosome 4 of sequenced genotypes. (B) SNP density on chromosome 5 of sequenced genotypes. (C) Read depth on chromosome 4 of sequenced genotypes. (D) Read depth on chromosome 5 of sequenced genotypes (E) Gene density on chromosome 4 based on H1706 annotations (F) Gene density on chromosome 5 based on H1706 annotations.
Figure 2Putative deletion size distribution in combined assemblies.
Pairwise estimates of nonsynonymous (dN), synonymous (dS) mean substitution rate.
Calculations are in comparison with S. lycopersicum ‘Yellow Pear’ and are based on 8,578 orthologous coding sequences for numbers not in parenthesis. Numbers in parenthesis are based on all usable coding sequences.
| Species | dN | dS |
|
|---|---|---|---|
|
| 0.0012 ± 0.0019 | 0.0037 ± 0.0059 | 0.3535 ± 2.0205 |
|
|
|
| |
|
| 0.0013 ± 0.0022 | 0.0043 ± 0.0065 | 0.4305 ± 2.9802 |
|
|
|
| |
|
| 0.0037 ± 0.0041 | 0.0151 ± 0.0123 | 0.3219 ± 1.0219 |
|
| 0.0332 ± 0.4361 | 0.1306 ± 1.3060 | 0.2386 ± 0.3127 |
Notes.
Maximum likelihood estimate, values >99 removed.
Figure 3Gene trees inferred from coding sequence of 8,796 Solanum species genes.
Phylogenetic trees were derived using maximum likelihood and were supported in at least 75 of 100 bootstrap replicates.
Divergence time estimates of selected species.
Calculations for Global Clock Method based on pairwise silent site substitutions for 3,611 genes. Calculations for coalescence method were performed with 8 genes. All genes used in calculations fit gene tree topology 1. Divergence date estimates are in reference to H1706.
| Species | dS | Divergence Date (MYA) |
|
|---|---|---|---|
|
| 0.0024 ± 0.0038 | 0.19 | 0.25 |
|
| 0.0053 ± 0.0066 | 0.44 | 0.45 |
|
| 0.0166 ± 0.0126 | 1.38 | 1.54 |
|
| 0.1335 ± 1.2383 | 11.07 | NA |
Notes.
Based on global clock method.
Based on coalescence method.
million years ago
Figure 4Tree topologies across selected chromosomes of H1706.
Coordinates are based on the H1706 reference genome. Posterior probabilities are shown for each tree. (A) Chromosome 4. (B) Chromosome 5. (C) Predominant tree topologies. 1 = YP; 2 = H1706; 3 = S. galapagense; 4 = S. pimpinellifolium; 5 = S. tuberosum.