| Literature DB >> 25287394 |
Pengshan Zhao1,2, Salvador Capella-Gutiérrez3,4,5, Yong Shi6,7, Xin Zhao8,9, Guoxiong Chen10,11, Toni Gabaldón12,13,14, Xiao-Fei Ma15,16.
Abstract
BACKGROUND: Sand rice (Agriophyllum squarrosum) is an annual desert plant adapted to mobile sand dunes in arid and semi-arid regions of Central Asia. The sand rice seeds have excellent nutrition value and have been historically consumed by local populations in the desert regions of northwest China. Sand rice is a potential food crop resilient to ongoing climate change; however, partly due to the scarcity of genetic information, this species has undergone only little agronomic modifications through classical breeding during recent years.Entities:
Mesh:
Year: 2014 PMID: 25287394 PMCID: PMC4459065 DOI: 10.1186/1471-2164-15-872
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Distribution of Sand rice in northern China. The map was carried out based on altitude by ArcGIS software (http://www.esri.com/software/arcgis/). Red dots represented our sampling sites during wild survey. The latitude and longitude of each dot were listed in Additional file 1.
Overview of sequence assembly for sand rice
| Length range (bp) | Contigs | Transcripts | Unigenes |
|---|---|---|---|
| 200-300 | 1,351,306 (96.03%) | 21,597 (20.74%) | 19,711 (29.10%) |
| 300-500 | 24,424 (1.74%) | 23,915 (22.97%) | 20,837 (30.76%) |
| 500-1000 | 15,609 (1.11%) | 19,725 (18.94%) | 12,693 (18.74%) |
| 1000-2000 | 10,162 (0.72%) | 21,023 (20.19%) | 8,679 (12.81%) |
| >2000 | 5,688 (0.40%) | 17,858 (17.15%) | 5,821 (8.59%) |
| Total number | 1,407,189 | 104,118 | 67,741 |
| Total length | 126,635,518 | 115,271,030 | 51,729,464 |
| N50 length | 136 | 1,950 | 1,343 |
| Mean length | 89.99 | 1107.12 | 763.63 |
Figure 2Analysis of the sand rice trancriptome. (A) Histogram of the length distribution of assembled unigenes. (B) Taxonomic distribution of the top BLAST hits for each unigene. (C) Length distribution of annotated and un-annotated unigenes was inferred with log(Length). (D) Expression level of annotated and un-annotated unigenes was inferred by log(RPKM).
Summary of sequence annotation for sand rice
| Annotated databases | Unigene | Percentage (%) |
|---|---|---|
| Nr | 28,920 | 42.69 |
| Swissprot | 20,730 | 30.60 |
| GO | 22,270 | 32.88 |
| COG | 9,728 | 14.36 |
| KEGG | 5,848 | 8.63 |
| Total | 29,048 | 42.88 |
Figure 3Species phylogeny of ten fully-sequenced plant species plus the two new caryophyllales with transcriptomic data. The phylogeny was based on the concatenation of 221 sets of one-to-one orthologous genes among the species used in this study. All aLRT branch support values were equal to 1.0 and, therefore, not shown in the tree. Different colour backgrounds highlights the relationships of the three main groups studied here: caryophyllales (light blue), asterids (Beige), and rosids (green). The placement of caryophyllales as basal to the branching of asterids and rosids was significantly more supported than an alternative placement as sister group of rosids.
Figure 4Comparative analysis of sand rice unigenes versus genes. (A) Histogram showing length distribution of sand rice unigenes with RBH. (B) Distribution of frequency versus percentage similarity (positive amino acid identity) of sand rice unigenes versus a Beta vulgaris peptide. (C) The 20 most highly represented BP GO terms of unigenes with RBH were shown.
Figure 5Information of candidate genes in sand rice transcriptome. (A) Histogram showing length distribution of the 1000 most highly expressed unigenes. (B) The 50 highly represented BP GO terms of unigenes were shown.