| Literature DB >> 27206485 |
Nikol Voutsina1, Adrienne C Payne1, Robert D Hancock2, Graham J J Clarkson3, Steve D Rothwell3, Mark A Chapman1, Gail Taylor4.
Abstract
BACKGROUND: Consuming watercress is thought to provide health benefits as a consequence of its phytonutrient composition. However, for watercress there are currently limited genetic resources underpinning breeding efforts for either yield or phytonutritional traits. In this paper, we use RNASeq data from twelve watercress accessions to characterize the transcriptome, perform candidate gene mining and conduct differential expression analysis for two key phytonutritional traits: antioxidant (AO) capacity and glucosinolate (GLS) content.Entities:
Keywords: Antioxidant capacity; Brassicaceae; Differential expression; Gluconasturtiin; Glucosinolates; Nasturtium officinale; Phenylpropanoid pathway; RNASeq; Watercress; de novo assembly
Mesh:
Substances:
Year: 2016 PMID: 27206485 PMCID: PMC4875719 DOI: 10.1186/s12864-016-2704-4
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Phenotypic data describing the watercress sequenced in this project. Antioxidant (AO) capacity was assessed using the FRAP antioxidant assay. Gluconasturtiin concentration, the primary glucosinolate (GLS) in watercress, was assessed using HPLC-MS. Concentration of gluconasturtiin was then quantified for this study as the ratio of the compounds peak area over the peak area of the internal standard (sinigrin). NA specifies that no data or classification is available for that sample
| Sample | Antioxidant capacitya | AO group | Gluconasturtiin | GLS group |
|---|---|---|---|---|
| (mmol Fe2+ equivalent/g fresh weight) | (Peak area ratio) | |||
| NAS080 | 501 | Control | 11.1 | Control |
| NAS081 | 837 | High | 7.4 | Low |
| NAS057 | 808 | High | 15.9 | High |
| NAS092 | 803 | High | 12.9 | NA |
| NAS095 | 841 | High | 12.0 | NA |
| NAS058 | 903 | High | 15.1 | High |
| NAS061 | 373 | Low | 14.5 | High |
| NAS068 | 185 | Low | 9.4 | Low |
| NAS066 | 405 | Low | 11.4 | NA |
| NAS070 | 271 | Low | 11.0 | NA |
| NAS093 | 327 | Low | 7.0 | Low |
| NAS065 | NA | NA | NA | NA |
aAntioxidant data modified from Payne et al. [33]
Per sample returns from RNA sequencing on an Illumina Hiseq2500 of 12 samples. This table indicates the total number of fragments sequenced per sample, the number of reads remaining after removal of poor quality reads (Q <15), and the percentage of total reads removed
| Sample | Total fragments sequenced | Reads with Q >15 | % Reads removed |
|---|---|---|---|
| NAS080 | 28128352 | 27988115 | 0.5 |
| NAS081 | 25972028 | 25863143 | 0.4 |
| NAS057 | 24014626 | 23897299 | 0.5 |
| NAS092 | 23467409 | 23383732 | 0.4 |
| NAS095 | 24974526 | 24847043 | 0.5 |
| NAS058 | 28504130 | 28299106 | 0.7 |
| NAS061 | 27430238 | 27263143 | 0.6 |
| NAS065 | 30038254 | 29802658 | 0.8 |
| NAS068 | 30021260 | 29843121 | 0.6 |
| NAS066 | 26151350 | 25992461 | 0.6 |
| NAS070 | 28834483 | 28627104 | 0.7 |
| NAS093 | 26291267 | 26205166 | 0.3 |
Descriptors of the assemblies completed using differing settings to assess the nature of the data. The assemblies use RNASeq data from a commercially active watercress accession, NAS080. Underlined assembly k2g15d2 (k-mer overlap: 2, maximum gap permitted within path: 15 bases, maximum differences allowed within a path: 2) was taken forward as the reference transcriptome for watercress
| Assembly | Min k-mer coverage | Max gap allowed | Max differences allowed | Total transcripts | Total components | % GC | N50 |
|---|---|---|---|---|---|---|---|
| k2g10d2 | 2 | 10 | 2 | 91102 | 48635 | 41.12 | 1587 |
| k2g15d0 | 2 | 15 | 0 | 87823 | 48709 | 41.09 | 1574 |
|
|
|
|
|
|
|
|
|
| k2g15d4 | 2 | 15 | 4 | 87945 | 48717 | 41.09 | 1574 |
| k2g15d8 | 2 | 15 | 8 | 87923 | 48701 | 41.08 | 1575 |
| k1g15d2 | 1 | 15 | 2 | 119516 | 79201 | 40.72 | 1534 |
| k1g15d4 | 1 | 15 | 4 | 119564 | 79225 | 40.74 | 1532 |
Fig. 1Assembled transcript length distribution. Frequency histogram showing the distribution of transcript length in the watercress reference transcriptome
Fig. 2Gene ontology description of the watercress transcriptome. Histogram illustrating the number of genes in the reference watercress transcriptome belonging to GO terms for Biological Process, Molecular Function or Cellular Component categories
Fig. 3Highlighted gene ontology categories in high antioxidant watercress. Barplot depicting standardized gene count (ratio of gene count in that category over total gene count) of significantly overrepresented GO terms in the AO DE genes in comparison the reference transcriptome
Fig. 4Expression levels throughout the shikimate pathway in high and low GLS watercress. Representation of the shikimate biosynthesis pathway with expression levels, as standardized mean counts (± standard error of the mean), of the best match transcript for high and low glucosinolate accessions. Chorismate synthase (AT1G48850) did not have a BLAST hit to the watercress transcriptome
Fig. 5Expression levels throughout phenylalanine biosynthesis in high and low GLS watercress. Representation of the most common phenylalanine biosynthesis pathway in plants with expression levels, as standardized mean counts (± standard error of the mean), of the best match transcript for high and low glucosinolate accessions. Prephenate aminotransferase did not have an available consensus sequence at this time
Fig. 6Expression levels of genes in GLS biosynthesis in high and low GLS watercress. Mean expression levels (± standard error of the mean), as standardized mean counts, of watercress transcripts similar to known glucosinolate biosynthesis genes in high and low glucosinolate accessions