| Literature DB >> 26048340 |
Jonathan Brassac1, Frank R Blattner2.
Abstract
Polyploidization is an important speciation mechanism in the barley genus Hordeum. To analyze evolutionary changes after allopolyploidization, knowledge of parental relationships is essential. One chloroplast and 12 nuclear single-copy loci were amplified by polymerase chain reaction (PCR) in all Hordeum plus six out-group species. Amplicons from each of 96 individuals were pooled, sheared, labeled with individual-specific barcodes and sequenced in a single run on a 454 platform. Reference sequences were obtained by cloning and Sanger sequencing of all loci for nine supplementary individuals. The 454 reads were assembled into contigs representing the 13 loci and, for polyploids, also homoeologues. Phylogenetic analyses were conducted for all loci separately and for a concatenated data matrix of all loci. For diploid taxa, a Bayesian concordance analysis and a coalescent-based dated species tree was inferred from all gene trees. Chloroplast matK was used to determine the maternal parent in allopolyploid taxa. The relative performance of different multilocus analyses in the presence of incomplete lineage sorting and hybridization was also assessed. The resulting multilocus phylogeny reveals for the first time species phylogeny and progenitor-derivative relationships of all di- and polyploid Hordeum taxa within a single analysis. Our study proves that it is possible to obtain a multilocus species-level phylogeny for di- and polyploid taxa by combining PCR with next-generation sequencing, without cloning and without creating a heavy load of sequence data.Entities:
Keywords: Evolution; Hordeum; in silico cloning; multispecies coalescent; nuclear single-copy genes; phylogeny; polyploidy; systematics
Mesh:
Substances:
Year: 2015 PMID: 26048340 PMCID: PMC4538882 DOI: 10.1093/sysbio/syv035
Source DB: PubMed Journal: Syst Biol ISSN: 1063-5157 Impact factor: 15.683
Taxa included in the study
| Taxon | Ploidy level (N)a | Haploid genome | Distribution area |
|---|---|---|---|
| | |||
| | |||
| subsp. | 2x (2) | SW Asia | |
| | 2x (1), 4x (3) | Mediterranean to C Asia | |
| | |||
| subsp. | 2x (2) | Mediterranean to C Asia | |
| subsp. | 4x (2) | NW Europe to Caucasus | |
| subsp. | 4x (2), 6x (1) | Mediterranean to C Asia | |
| | |||
| | 2x (2), 4x (2) | Mediterranean to C Asia | |
| | 2x (2) | Mediterranean | |
| | |||
| | 2x (3) | C Asia | |
| | 2x (5), 4x (4), 6x (3) | C Asia | |
| | 2x (2) | C Asia | |
| | 2x (3) | SW California | |
| | 2x (2) | Chile and W Argentina | |
| | 2x (3) | S Argentina | |
| | 2x (2) | C Argentina | |
| | 2x (1) | C Argentina | |
| | 2x (3) | C Argentina, Uruguay | |
| | 2x (1) | E+C Argentina | |
| | 2x (3) | SW California, NW Mexico | |
| | 2x (2) | C to N Andes | |
| | 2x (3) | S Argentina | |
| | 2x (2) | S Argentina | |
| | 2x (2) | C+E USA | |
| | 2x (2) | C Argentina | |
| | 4x (2) | W USA | |
| Interserial allopolyploids of series | |||
| | 4x (2) | W North America, Kamchatka, Newfoundland | |
| | 4x (2) | S Argentina, S Chile | |
| | 4x (1) | Guatemala, S Mexico | |
| | 4x (2) | NE Asia, NW+W North America, C Argentina | |
| | 4x (4) | C Argentina | |
| | 6x (3) | SW USA | |
| | 6x (3) | C+S Argentina | |
| | 6x (3) | C Argentina | |
| | 6x (2) | S Argentina | |
| | 6x (1) | C California | |
| | 4x (2) | S Africa | |
| | 4x (2) | Mediterranean to W Europe | |
| | 2x (1) | ||
| | 2x (2) | ||
| | 2x (1) | ||
| | 2x (1) | ||
| | 2x (1) | ||
| | 2x (1) | ||
| | 2x (1) | ||
aNumber of individuals included per species or cytotype.
bSpecies with subspecies not further detailed here.
Information on individual data sets and setting used in the analyses
| Locus | Alignment length | Parsimony-informative sites | Variable sites | Model of evolutiona | Clock for *BEAST |
|---|---|---|---|---|---|
| 752 (753) | 123 (184) | 147 (250) | HKY+G (1) | RLC | |
| 4408 (4603) | 238 (387) | 310 (524) | HKY+G (2) | RLC | |
| 915 (1121) | 138 (307) | 199 (448) | HKY+G (1) | RLC | |
| 1631 (1631) | 260 (409) | 293 (584) | HKY+G (2) | RLC | |
| 844 (889) | 99 (176) | 128 (268) | K80+G | b | |
| 2094 (2707) | 268 (367) | 327 (562) | HKY+G (3) | RLC | |
| 4913 (5163) | 519 (993) | 838 (1333) | HKY+G (4) | RLC | |
| 1490 (1495) | 119 (211) | 249 (412) | HKY+G (1) | RLC | |
| 1647 (1736) | 104 (236) | 183 (379) | HKY+G (3) | RLC | |
| 1092 (1101) | 129 (211) | 164 (303) | HKY+G (1) | RLC | |
| 1256 (2134) | 142 (225) | 165 (325) | HKY+G (1) | RLC | |
| 1487 (1509) | 207 (309) | 254 (465) | HKY+G (1) | RLC | |
| 2606 (2628) | 95 (100) | 203 (210) | HKY+G | RLC | |
| Supermatrix | 25135 (24996) | 2852 (3633) | 4915 (5469) | c | b |
Notes: Values in brackets correspond to alignments including polyploids; RLC, random local clock.
aModels used for single locus analyses, in brackets models linked in the *Beast analysis.
bData set not included in the *Beast analysis.
cData set consisting of all loci and divided in five partitions.
Loci initially explored in the study and location on barley chromosomes
| Locus name | RAP2 description | Chromosomea | No. of reads | Coverage mean ± 1 SD |
|---|---|---|---|---|
| Kinesin, motor region domain containing protein | 1H | 62,836 | 518 ± 340.0 | |
| Similar to COP9 signalosome complex subunit 5b | 2H (?) | 51,409 | 305 ± 214.4 | |
| Xylose isomerase | 2HS | 87,926 | 547 ± 329.8 | |
| Ubiquitin domain containing protein | 3HL | 53,286 | 334 ± 243.1 | |
| Nucellin | 4HL | 31,916 | 260 ± 275.4 | |
| Similar to SAC domain protein 1 (FIG4-like protein AtFIG4) | 4HL | 151,385 | 490 ± 334.9 | |
| Proteasome subunit beta type 2 | 4HS | 97,610 | 380 ± 389.5 | |
| Barley leucin zipper | 5HL | 56,064 | 376 ± 236.8 | |
| Peptidase S16, ATP-dependent protease La family protein | 5HL | 71,133 | 374 ± 342.0 | |
| Topoisomerase VI subunit B | 5HL | 27,416 | 226 ± 159.1 | |
| Conserved hypothetical protein | 5HL (?) | Na | Na | |
| Similar to Nucleoside diphosphate kinase II, chloroplast precursor | 5HS | 71,439 | 502 ± 327.2 | |
| Heat shock protein Hsp70 family protein | 6HL | 49,843 | 297 ± 241.7 | |
| Beta 5 subunit of 20S proteasome | 7HS (?) | Na | Na | |
| Maturase K | cp-LSC | 179,382 | 426 ± 284.2 | |
| Subunit 6 of NADH-dehydrogenase | cp-SSC | Na | Na |
aChromosome locations were checked by blasting sequences obtained against the barley genome, no significant result was obtained from TNAC1142, locations for this locus and the other loci missing were inferred from synteny with rice and wheat. Questionmarks refer to not safely determined positions.
bndhF was excluded because of large amount of missing data and TNAC1577 and TNAC1781 were not single-copy loci and difficult to sequence (large homopolymer regions).
FPhylogenetic tree derived from the concatenated supermatrix consisting of 13 loci of the diploid Hordeum taxa and six out-group species calculated with Bayesian inference. Posterior probability values of the clades are indicated close to the nodes. Infrageneric treatment of the genus and genome denominations (bold letters along branches) in Hordeum follow Blattner (2009).
FCalibrated multispecies coalescent of the diploid Hordeum species as calculated with *Beast from all loci excluding NUC. Numbers along the branches are posterior probability values/concordance factors of the clades. Ages (in millions of years) were inferred from the crown clade age of B. distachyon + Triticeae at 44.4 Ma and from the divergence between Hordeum and Triticum lineages at 15.32 Ma (Marcussen et al. 2014); calibration points are depicted by asterisks, divergence dates are reported in Table 4.
Ages of the clades in the MSC analysis and CF
| Clade | (MSC/BI/MP) | Ages Myr (95% HPD) | Sample-wide CF (95% CI) |
|---|---|---|---|
| 0.43 (0.17,0.68) | 0.230 (0.083,0.417) | ||
| 0.73 (0.51,0.97) | 0.176 (0.000,0.333) | ||
| 0.19 (0.09,0. 30) | 0.586 (0.417,0.667) | ||
| 0.082 (0.000,0.250) | |||
| 0.87 (0.66,1.09) | 0.448 (0.333,0.583) | ||
| 0.44 (0.23,0.65) | 0.127 (0.000,0.250) | ||
| 0.69 (0.44,0.95) | 0.003 | ||
| 0.43 (0.17,0.68) | 0.596 (0.500,0.667) | ||
| 0.97 (0.74,1.23) | 0.088 (0.000,0.167) | ||
| 0.82 (0.51,1.14) | 0.255 (0.167,0.333) | ||
| 1.12 (0.92,1.35) | 0.134 (0.083,0.167) | ||
| New World I-clade taxa (0.98/1/94) | 1.46 (1.22,1.72) | 0.186 (0.083,0.250) | |
| 0.058 (0.000,0.083) | |||
| 0.072 (0.000,0.167) | |||
| 0.131 (0.000,0.250) | |||
| Old World I-clade taxa (0.66/1/94) | 0.89 (0.42,1.51) | 0.161 (0.083,0.250) | |
| I clade (1/1/100) | 1.71 (1.38,2.07) | 0.176 (0.083,0.333) | |
| Xa clade (1/1/100 | 1.37(0.71,2.06) | 0.746 (0.583,0.833) | |
| I plus Xa clade (1/1/100) | 5.01 (4.01,6.05) | 0.220 (0.083,0.333) | |
| H clade (1/1/100) | 3.74 (2.67,4.80) | 0.643 (0.500,0.833) | |
| H plus Xu clade (1/1/100) | 8.13 (6.96,9.40) | 0.373 (0.250,0.500) | |
| 9.23 (8.07,10.45) | 0.919 (0.833,1.000) | ||
| Triticeae | 14.79 (14.13,15.45) | e | |
| 56.35 (51.08,61.49) | e | ||
Notes: MSC, multispecies coalescence; BI, Bayesian inference; MP, maximum parsimony; Myr, millions of years; HPD, highest posterior density; CF, concordance factor; CI, credibility interval.
aNode present only in the 26 loci supermatrix (Supplementary Fig. S17).
bNode present only in the *Beast analysis (Fig. 2).
cNode present in the 13 loci supermatrix MP analysis (Supplementary Fig. S16).
dNode present in the 13 loci supermatrix analyses (Fig. 1, Supplementary Fig. S16).
eNo CF available.
FScheme summarizing phylogenetic relationships of species and cytotypes in the genus Hordeum based on 12 loci and one chloroplast region. Diploid taxa were drawn directly at the tree obtained from MSC analysis (Fig. 2) modified to reflect differences with BI analysis based on 13 (Fig. 1) and 26 loci (Supplementary Fig. S17) and position of extinct taxa inferred from the BI tree including diploid and polyploids (Supplementary Fig. S19). Tetra- and hexaploids were connected to their inferred progenitors. Double lines indicate autopolyploid origin, dashed lines mark uncertainties and colors refer to the direction of crosses resulting in allopolyploid taxa (blue for paternal, red for maternal parent, black for both observed) based on the chloroplast data (Supplementary Fig. S14). Asterisks indicate incongruent relationships of polyploids recovered by chloroplast sequences (Jakob and Blattner 2006) compared to nuclear sequences that probably indicate polyphyletic origins.