| Literature DB >> 26635144 |
Chenxi Xu1, Chen Jiao2,3, Yi Zheng2, Honghe Sun2,4, Wenli Liu2, Xiaofeng Cai1, Xiaoli Wang1, Shuang Liu1, Yimin Xu2, Beiquan Mou5, Shaojun Dai1, Zhangjun Fei2,6, Quanhua Wang1.
Abstract
Spinach (Spinacia oleracea L.) is an economically important green leafy vegetable crop. In this study, we performed deep transcriptome sequencing for nine spinach accessions: three from cultivated S. oleracea, three from wild S. turkestanica and three from wild S. tetrandra. A total of approximately 100 million high-quality reads were generated, which were de novo assembled into 72,151 unigenes with a total length of 46.5 Mb. By comparing sequences of these unigenes against different protein databases, nearly 60% of them were annotated and 50% could be assigned with Gene Ontology terms. A total of 387 metabolic pathways were predicted from the assembled spinach unigenes. From the transcriptome sequencing data, we were able to identify a total of ~320,000 high-quality single nucleotide polymorphisms (SNPs). Phylogenetic analyses using SNPs as well as gene expression profiles indicated that S. turkestanica was more closely related to the cultivated S. oleracea than S. tetrandra. A large number of genes involved in responses to biotic and abiotic stresses were found to be differentially expressed between the cultivated and wild spinach. Finally, an interactive online database (http://www.spinachbase.org) was developed to allow the research community to efficiently retrieve, query, mine and analyze our transcriptome dataset.Entities:
Mesh:
Year: 2015 PMID: 26635144 PMCID: PMC4669492 DOI: 10.1038/srep17706
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Summary of spinach transcriptome sequences.
| sample ID | Accession No. | Species | No. raw reads | No. high quality reads | No. rRNA reads | No. final cleaned reads | Final cleaned nucleotides |
|---|---|---|---|---|---|---|---|
| Sp40 | PI 608712 | 11,220,670 | 10,823,210 | 130,619 | 10,692,591 | 1,037,741,475 | |
| Sp42 | PI 647860 | 9,301,545 | 9,035,669 | 128,237 | 8,907,432 | 859,510,808 | |
| Sp43 | PI 647861 | 11,968,303 | 11,705,229 | 255,255 | 11,449,974 | 1,117,491,750 | |
| Sp49 | PI 647864 | 9,580,222 | 9,126,379 | 54,604 | 9,071,775 | 872,975,381 | |
| Sp50 | PI 647865 | 15,485,184 | 15,125,268 | 331,496 | 14,793,772 | 1,438,908,213 | |
| Sp51 | PI 662295 | 16,640,543 | 16,276,742 | 401,807 | 15,874,935 | 1,547,778,458 | |
| Sp78 | S13–32 | 11,174,738 | 10,645,123 | 93,995 | 10,551,128 | 1,025,072,557 | |
| Sp82 | JQSZ13-3 | 10,560,339 | 10,039,569 | 154,651 | 9,884,918 | 959,870,221 | |
| Sp90 | JQ13-1 | 8,445,922 | 8,255,054 | 198,762 | 8,056,292 | 789,521,055 | |
| Total | 104,377,466 | 101,032,243 | 1,749,426 | 99,282,817 | 9,648,869,918 |
aReads left after removing adaptor and low quality sequences.
bReads left after removing rRNA reads from the high quality reads.
Figure 1Length (A) and GC content (B) distribution of spinach unigenes.
Figure 2Neighbor-joining phylogenetic tree of spinach accessions on the basis of SNPs.
Summary of SNPs identified in spinach.
| category | No. SNPs |
|---|---|
| Three | 565 |
| Three | 1,143 |
| Two | 3,124 |
| Three | 1,690 |
| Three | 75,599 |
| Three | 75,924 |
| All eight accessions (excluding Sp40) | 76,352 |
| All nine accessions | 76,433 |
Out of 319,838 SNPs identified in spinach, only 76,433 homozygous SNPs with genotype information in all nine accessions were included in the summary.
Figure 3Venn diagram showing the number of genes which displayed higher (A) or lower (B) expression in three different comparisons: S. oleracea vs S. turkestanica, S. oleracea vs S. tetrandra, and S. turkestanica vs S. tetrandra.