| Literature DB >> 25954285 |
Carole F S Koning-Boucoiran1, G Danny Esselink1, Mirjana Vukosavljev1, Wendy P C van 't Westende1, Virginia W Gitonga1, Frans A Krens1, Roeland E Voorrips1, W Eric van de Weg1, Dietmar Schulz2, Thomas Debener2, Chris Maliepaard1, Paul Arens1, Marinus J M Smulders1.
Abstract
In order to develop a versatile and large SNP array for rose, we set out to mine ESTs from diverse sets of rose germplasm. For this RNA-Seq libraries containing about 700 million reads were generated from tetraploid cut and garden roses using Illumina paired-end sequencing, and from diploid Rosa multiflora using 454 sequencing. Separate de novo assemblies were performed in order to identify single nucleotide polymorphisms (SNPs) within and between rose varieties. SNPs among tetraploid roses were selected for constructing a genotyping array that can be employed for genetic mapping and marker-trait association discovery in breeding programs based on tetraploid germplasm, both from cut roses and from garden roses. In total 68,893 SNPs were included on the WagRhSNP Axiom array. Next, an orthology-guided assembly was performed for the construction of a non-redundant rose transcriptome database. A total of 21,740 transcripts had significant hits with orthologous genes in the strawberry (Fragaria vesca L.) genome. Of these 13,390 appeared to contain the full-length coding regions. This newly established transcriptome resource adds considerably to the currently available sequence resources for the Rosaceae family in general and the genus Rosa in particular.Entities:
Keywords: EST; Rosa; SNP array; assembly; genotyping; transcriptomics
Year: 2015 PMID: 25954285 PMCID: PMC4404716 DOI: 10.3389/fpls.2015.00249
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Figure 1Overview of the strategy used to assemble the rose EST data, to mine SNPs, and to develop the WagRhSNP Axion SNP array. (A) Pictures of the plant material used to isolate RNA. (B) Sequencing of the three sets. (C) Data analysis: (i) SNP mining in the three sample sets (GR, garden roses; MC, Morden Centennial; RND, Red New Dawn), (ii) development of the WagRhSNP Axiom® genotyping array, (iii) transcriptome assembly, and (iv) identification and annotation of sequences orthologous with F. vesca. All SNPs are in Supplementary file ESM2. The SNPs identified in set 1) are coded with a ‘K’, those in 2a) with a ‘G’, in 2b) with a ‘M’ and in 3) with a ‘D’.
Sequencing results and analysis.
*Number of paired-end reads and single reads per sample after quality trimming and merging during pre-processing of the data.
Number of SNPs mined in the three rose datasets.
| K5 | 19080 | 1134.5 | 952.2 | 643.0 | 2017.6 | 1–123 | 0.6 | 0.41 |
| GR | 51106 | 1342.9 | 994.0 | 787.9 | 2700.3 | 1–96 | 0.5 | 0.40 |
| Rh88 | 5493 | 1296.2 | 740.2 | 48.5 | 34.7 | 1–57 | 0.4 | 0.53 |
STD = standard deviation.
Transcripts containing at least one reliable SNP.
Average of the number of reads per transcript.
GR: All 12 garden rose cultivars included.
Figure 2Distribution of reliable SNPs per transcript of K5 (blue bars), GR (red bars), and Rh88 (green bars).
Figure 3Number of proteins (blue bars) and their respective transcripts (red dots) in which they were identified for relevant protein functions in the sampled tissues.
List of the top 20 gene ontology (GO) terms the most represented among the annotated transcripts from cut and garden roses, for each of three main GO categories.
| GO:0005515 | Protein binding | 1871 | Molecular function 38.2% |
| GO:0003677 | DNA binding | 1077 | |
| GO:0004674 | Protein serine/threonine kinase activity | 978 | |
| GO:0003676 | Nucleic acid binding | 881 | |
| GO:0003700 | Transcription factor activity | 828 | |
| GO:0005488 | Binding | 613 | |
| GO:0003824 | Catalytic activity | 588 | |
| GO:0016491 | Oxidoreductase activity | 553 | |
| GO:0003723 | RNA binding | 532 | |
| GO:0005524 | ATP binding | 464 | |
| GO:0000166 | Nucleotide binding | 410 | |
| GO:0009055 | RNA binding Electron carrier activity | 376 | |
| GO:0043565 | Sequence-specific DNA binding | 316 | |
| GO:0004888 | Transmembrane receptor activity | 265 | |
| GO:0016301 | Kinase activity | 262 | |
| GO:0004497 | Monooxygenase activity | 251 | |
| GO:0016758 | Transferase activity | 242 | |
| GO:0020037 | Heme binding | 241 | |
| GO:0004553 | Hydrolase activity | 218 | |
| GO:0005215 | Transporter activity | 205 | |
| GO:0006468 | Protein amino acid phosphorylation | 1027 | Biological process 38.3% |
| GO:0055114 | Oxidation reduction | 1012 | |
| GO:0006355 | Regulation of transcription, DNA-dependent | 644 | |
| GO:0008152 | Metabolic process | 553 | |
| GO:0055085 | Transmembrane transport | 534 | |
| GO:0006508 | Proteolysis | 466 | |
| GO:0006915 | Apoptosis | 423 | |
| GO:0007165 | Signal transduction | 396 | |
| GO:0006952 | Defense response | 340 | |
| GO:0005975 | Carbohydrate metabolic process | 336 | |
| GO:0006412 | Translation | 329 | |
| GO:0045087 | Innate immune response | 260 | |
| GO:0006457 | Protein folding | 258 | |
| GO:0009651 | Response to salt stress | 244 | |
| GO:0046686 | Response to cadmium ion | 232 | |
| GO:0006629 | Lipid metabolic process | 186 | |
| GO:0009793 | Embryo development ending in seed dormancy | 176 | |
| GO:0009737 | Response to abscisic acid stimulus | 170 | |
| GO:0006886 | Intracellular protein transport | 165 | |
| GO:0006810 | Transport | 162 | |
| GO:0005634 | Nucleus | 1401 | Cellular compound 20.8% |
| GO:0016020 | Membrane | 1175 | |
| GO:0005886 | Plasma membrane | 1172 | |
| GO:0016021 | Integral to membrane | 634 | |
| GO:0009507 | Chloroplast | 566 | |
| GO:0005737 | Cytoplasm | 462 | |
| GO:0005622 | Intracellular | 452 | |
| GO:0005739 | Mitochondrion | 333 | |
| GO:0005773 | Vacuole | 306 | |
| GO:0005840 | Ribosome | 281 | |
| GO:0031224 | Intrinsic to membrane | 259 | |
| GO:0005829 | Cytosol | 243 | |
| GO:0005783 | Endoplasmic reticulum | 205 | |
| GO:0009941 | Chloroplast envelope | 199 | |
| GO:0009570 | Chloroplast stroma | 188 | |
| GO:0005618 | Cell wall | 177 | |
| GO:0009505 | Plant-type cell wall | 173 | |
| GO:0005730 | Nucleolus | 112 | |
| GO:0005794 | Golgi apparatus | 98 | |
| GO:0009535 | Chloroplast thylakoid membrane | 96 | |
| Others 3% | |||