| Literature DB >> 28900436 |
Lua Lopez1, Eva M Wolf1, J Chris Pires2, Patrick P Edger3,4, Marcus A Koch1.
Abstract
The rapidly falling costs and the increasing availability of large DNA sequence data sets facilitate the fast and affordable mining of large molecular markers data sets for comprehensive evolutionary studies. The Brassicaceae (mustards) are an important species-rich family in the plant kingdom with taxa distributed worldwide and a complex evolutionary history. We performed Simple Sequence Repeats (SSRs) mining using de novo assembled transcriptomes from 19 species across the Brassicaceae in order to study SSR evolution and provide comprehensive sets of molecular markers for genetic studies within the family. Moreover, we selected the genus Cochlearia to test the transferability and polymorphism of these markers among species. Additionally, we annotated Cochlearia pyrenaica transcriptome in order to identify the position of each of the mined SSRs. While we introduce a new set of tools that will further enable evolutionary studies across the Brassicaceae, we also discuss some broader aspects of SSR evolution. Overall, we developed 2012 ready-to-use SSR markers with their respective primers in 19 Brassicaceae species and a high quality annotated transcriptome for C. pyrenaica. As indicated by our transferability test with the genus Cochlearia these SSRs are transferable to species within the genus increasing exponentially the number of targeted species. Also, our polymorphism results showed substantial levels of variability for these markers. Finally, despite its complex evolutionary history, SSR evolution across the Brassicaceae family is highly conserved and we found no deviation from patterns reported in other Angiosperms.Entities:
Keywords: Brassicaceae; RNAseq/transcriptome; SSRs; data mining; molecular resources; sequencing platform
Year: 2017 PMID: 28900436 PMCID: PMC5581910 DOI: 10.3389/fpls.2017.01488
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Species' information table.
| Oreophytoneae | I | 1 | none | HiSeq2000 | |
| Alyssopsideae | I | 1 | none | HiSeq2000 | |
| Calepineae | Exp-II | 1 | none | HiSeq2000 | |
| Cochlearieae | Exp-II | 1 | none | HiSeq2000 | |
| Kernereae | Exp-II | 1 | none | HiSeq2000 | |
| Buniadeae | III | 1 | none | HiSeq2000 | |
| Dontostemoneae | III | 1 | none | HiSeq2000 | |
| Stevenieae | Exp-II | 1 | none | HiSeq2000 | |
| Coluteocarpeae | Exp-II | 1 | none | HiSeq2000 | |
| Coluteocarpeae | Exp-II | 1 | none | HiSeq2000 | |
| Arabideae | Exp-II | 2 (TZWR) | PPE-2012-1000-0003 (UMO) | HiSeq | |
| Brassiceae | II | 2 (IPWB) | PPE-2012-1000-0001 (UMO) | HiSeq | |
| Cleomaceae | outgroup | 2 (HELY) | PPE-2012-1000-0004 (UMO) | HiSeq | |
| Brassiceae | II | 2 (VMNH) | none | HiSeq | |
| Arabideae | Exp-II | 2 (HABV) | none | HiSeq | |
| Arabideae | Exp-II | 2 (GTSV) | none | HiSeq | |
| Arabideae | Exp-II | 2 (UVQL) | none | HiSeq | |
| Arabideae | Exp-II | 2 (LIQF) | none | HiSeq | |
| Arabideae | Exp-II | 2 (BXBF) | none | HiSeq |
Tribe, tribe which the species belongs to; Lineage, phylogenetic lineage to which the species belong within the Brassicaceae phylogeny as in Hohmann et al. (.
Figure 1Brassicaceae lineages included in the present study. Phylogeny adapted from Franzke et al. (2011). Purple stars marks tribes with samples included in this study. Genome size (1 Cx-value) indicates mean tribal value (whole bar = 4.33pg). Percentage of polyploids indicates neopolyploids (auto- and allopolyploids; Hohmann et al., 2014).
Species from the genus Cochlearia used for the polymorphism study.
| Caes0741 | RBGE Gill Stelle s.n. | Asturias, Spain. | |
| Calp0759 | HEID BNr. 504202 | Teesdale, UK. | |
| Cexc1253 | HEID BNr. 404206 | Mt. Seckauer Zinken, Austria. | |
| Cgro0474 | O 894768 | Svalbard, Norway. | |
| Cgro1038 | O 990848 | Nome Census Area, Alaska. | |
| Cisla1233 | HEID BNr.503296 | Stokkseyri, Iceland. | |
| Cpyr0260 | HEID BNr.501471 | Kelmis, Belgium. | |
| Cpyr0310 | HEID BNr.480392 | Türnitz, Austria. | |
| Cpyr0456 | HEID BNr.503877 | Carpathian Mountains, Slovakia. | |
| Cpyr0560 | HEID BNr.921491 | Allgäu, Germany. | |
| Cpyr0699 | HEID RBGE Gill Stelle GS8 | Gordale Scar, Scotland. | |
| Cpyr1222 | HEID BNr.503293 | Asturias, Spain. | |
| Cses1285 | HEID BNr.503294 | Kodiak Island, Alaska. | |
| Ctri1287 | MT MT00127822 | L'Anse Amour, Canada. |
ID, individual ID used to identify each sample in this study; Voucher, herbarium acronym and number identifying the herbarium voucher, Species, species name and Location, name of the location and country where the individual was collected.
Results' summary for each species.
| 0.2 | HiSeq2000 | 14771107 | 13824138 | 40433 | 87.55 | 92.26 | 1005 | 1/21.99 | 128 | |
| 0.19 | HiSeq2000 | 19418002 | 18035058 | 44010 | 82.25 | 94.25 | 1808 | 1/13.51 | 111 | |
| 0.25 | HiSeq2000 | 16020733 | 14949117 | 51415 | 73.73 | 93.62 | 1516 | 1/16.08 | 100 | |
| 0.4 | HiSeq2000 | 110411546 | 102682738 | 51572 | 83.91 | 94.46 | 2783 | 1/12.25 | 160 | |
| 0.2 | HiSeq2000 | 14995455 | 13932451 | 41932 | 86.53 | 92.78 | 1316 | 1/35.28 | 115 | |
| 2.63 | HiSeq2000 | 20013688 | 18704854 | 60941 | 82.92 | 94.67 | 1533 | 1/20.71 | 112 | |
| 3.97 | HiSeq2000 | 26930913 | 25034919 | 71021 | 77.97 | 94.25 | 1418 | 1/23.51 | 96 | |
| 0.53 | HiSeq2000 | 23012277 | 21347707 | 97523 | 72.75 | 95.19 | 2524 | 1/15.22 | 128 | |
| 0.26 | HiSeq2000 | 16766557 | 15823266 | 35086 | 90.13 | 89.44 | 1294 | 1/15.01 | 135 | |
| 0.33 | HiSeq2000 | 11364197 | 11364197 | 49653 | 86.40 | 92.47 | 2095 | 1/11.77 | 130 | |
| 0.38 | HiSeq | 10881467 | 10713549 | 44322 | 90.27 | 90.27 | 795 | 1/19.87 | 110 | |
| 0.64 | HiSeq | 12275029 | 12118888 | 56228 | 80.33 | 691 | 1/20.58 | 130 | ||
| NA | HiSeq | 9196502 | 9048135 | 40529 | 82.42 | 89.33 | 1448 | 1/11.67 | 97 | |
| NA | HiSeq | 9645203 | 9335024 | 40118 | 80.73 | 61.19 | 415 | 1/18.80 | 81 | |
| NA | HiSeq | 10148457 | 9768317 | 36938 | 82.19 | 62.45 | 418 | 1/18.97 | 72 | |
| 0.66 | HiSeq | 11335649 | 11131989 | 53209 | 58.16 | 434 | 1/22.11 | 64 | ||
| NA | HiSeq | 9270733 | 8915641 | 39175 | 77.89 | 62.76 | 448 | 1/18.69 | 61 | |
| 0.42 | HiSeq | 15233675 | 14973184 | 63087 | 78.77 | 717 | 1/20.89 | 80 | ||
| 0.53 | HiSeq | 16401105 | 16186125 | 60288 | 84.00 | 768 | 1/21.51 | 102 |
Seq., type of Illumina sequencing technology used; 1C, holoploid genome size in pg (.
Figure 2SSR's types distribution across species. In the x-axis, each type of motif is indicated while the y-axis refers to the percentage of each type of motif in each species. Each species is denoted by a color as shown in the legend.
Figure 3SSR's motifs distribution across species. The x-axis indicates species and the y-axis abundance in percentage. Each motif is denoted by a color as shown in the legend. Bars represents those motifs with ≥5% abundance. These motifs represent, depending on the species, between 55 and 75% of the total motifs' frequency.
ORF location and type distribution of SSRs used for the in silico polymorphism test in Cochlearia species.
| In-ORF | 5 | 63 | 4 | 0 | 2 | 74 |
| Out-ORF | 34 | 18 | 28 | 4 | 2 | 86 |
| Total | 39 | 81 | 32 | 4 | 4 | 160 |
In-ORF indicates SSR located within coding region while out-ORF denotes SSR outside coding region.
SSRs' information for the in silico polymorphism study in Cochlearia sp.
| c12574_g3_i1 | in | 379 | 399 | CAA | Uncharacterized protein. |
| c13603_g4_i1 | in | 417 | 437 | GGT | NADH-ubiquinone oxidoreductase 20.9 kDa subunit-like. |
| c13427_g2_i1 | in | 482 | 502 | TTC | Glucose-6-phosphate isomerase 1, chloroplastic. |
| c13710_g1_i1 | in | 227 | 247 | TCT | Protein with domain DUF314 -unknown function. |
| c13881_g1_i2 | in | 1844 | 1864 | CCA | Phosphatidylinositol-4-phosphate 5-kinase 1. |
| c147_g2_i1 | in | 159 | 179 | GTG | Transcription factor TCP15-like. |
| c15301_g1_i1 | in | 330 | 371 | CAAAAC | Uncharacterized protein. |
| c7760_g1_i1 | in | 276 | 296 | GGT | Uncharacterized protein with domain DUF616. |
| c13912_g1_i1 | out | 278 | 298 | TCA | No hit in homology search. |
| c13000_g1_i1 | out | 1112 | 1132 | GTA | No hit in homology search. |
| c14456_g1_i1 | out | 112 | 131 | AT | No hit in homology search. |
| c16762_g1_i1 | out | 248 | 267 | AC | No hit in homology search. |
| c25249_g1_i1 | out | 2453 | 2472 | AG | No hit in homology search. |
| c28231_g1_i1 | out | 135 | 155 | ACG | No hit in homology search. |
| c10839_g2_i2 | out | 762 | 781 | ATGG | No hit in homology search. |
| c13423_g2_i1 | out | 193 | 212 | TGCT | No hit in homology search. |
ID, as contig identifier; ORF, “in” designates SSR located in ORF while “out” points to SSR outside ORF; P.
SSR polymorphism in species from the genus Cochlearia. Alleles scored in each sample per SSR (indicated by length in bp).
| c12574_g3_i1 | 18–21 | 21 | 21 | 18 | 15–18 | 15–18 | 15 | 21 | 18 | 18–21 | 18 | 21–24 | 18 | 18 |
| c13603_g4_i1 | 9–12 | 12–24 | 12–15 | 21 | 12 | 24 | 24 | 12 | 12 | 15 | 18–24 | 12 | 21 | 21 |
| c13427_g2_i1 | 18–21 | 21 | 21–36 | 21–27 | 21 | 21–27 | 27–33 | 21–32 | 21 | 21 | 12 | 21 | 21 | 27 |
| c13710_g1_i1 | 18 | 18 | 18 | 18 | 18 | 18 | 24 | 18 | 18 | 21 | 15–18 | 12 | 18 | 18 |
| c13881_g1_i2 | 18–21 | 15–18 | 18–21 | 18 | 21 | 18 | 21 | 21 | 21 | 21 | 9–18 | 21 | 18 | 9–18 |
| c147_g2_i1 | 21 | 12 | 12 | 21 | 12 | 21 | 15 | 12 | 12 | 12 | 12 | 12 | 21 | 12–24 |
| c15301_g1_i1 | 18 | 18 | 18 | 18 | 18 | 18 | 18 | 18 | 18 | 18 | 18 | 18 | 18 | 18 |
| c7760_g1_i1 | 18–21 | 21 | 27–30 | 18–21 | 15 | 21 | 27 | 18–27 | 18 | 21 | 18–21 | 18 | 21 | 15–27 |
| c13912_g1_i1 | 12–18 | 18 | 18 | 18 | 18 | 18 | 18 | 18 | 18 | 18 | 18 | 18 | 21 | 15–18 |
| c13000_g1_i1 | 15 | 15–21 | 15–18 | 15 | 15 | 15 | 12 | 18 | 15 | 12 | 18–21 | 12–15 | 15 | 18–21 |
| c14456_g1_i1 | 10 | 16 | 14 | 18 | 20 | 10 | 18 | 14 | 16 | 16 | 10–16 | 16 | 16 | 14–16 |
| c16762_g1_i1 | 20–22 | 18 | 24 | 18–26 | 18 | 18 | 14 | 20 | 20–22 | 18 | 12–26 | 14 | 20 | 18–24 |
| c25249_g1_i1 | 16–18 | 18 | 16 | 16 | 16 | 16 | 22 | 24 | 10 | 14 | NA | 18 | 16 | 14 |
| c28231_g1_i1 | 15 | 15 | 18 | 9 | 9 | 15 | 12–18 | 12–24 | 15 | 18 | NA | 15 | 9 | 9–15 |
| c10839_g2_i2 | 16 | 20–24 | 24 | 20–24 | 24 | 24 | 16 | 16 | 20 | 16 | 24–28 | 16–28 | 24 | 12–20 |
| c13423_g2_i1 | 20 | 20 | 20 | 20 | 20 | 12–20 | 16–20 | 20 | 20 | 20 | 20 | 20 | 20 | 16 |
Figure 4Standardized distance PCoA for Cochlearia samples. Each color represents a species from the genus Cochlearia and the country of origin is displayed next to the sample symbol. Both coordinates explain 51.24% of the variability (coord.1 accounted for 32.27% and coord.2 18.97%).