| Literature DB >> 23555859 |
Haikuo Fan1, Yong Xiao, Yaodong Yang, Wei Xia, Annaliese S Mason, Zhihui Xia, Fei Qiao, Songlin Zhao, Haoru Tang.
Abstract
BACKGROUND: Cocos nucifera (coconut), a member of the Arecaceae family, is an economically important woody palm grown in tropical regions. Despite its agronomic importance, previous germplasm assessment studies have relied solely on morphological and agronomical traits. Molecular biology techniques have been scarcely used in assessment of genetic resources and for improvement of important agronomic and quality traits in Cocos nucifera, mostly due to the absence of available sequence information. METHODOLOGY/PRINCIPALEntities:
Mesh:
Substances:
Year: 2013 PMID: 23555859 PMCID: PMC3612046 DOI: 10.1371/journal.pone.0059997
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Summary of RNA-Seq and de novo assembly of Cocos nucifera results.
| Number | Mean size | N50 size | Total nucleotides | |
| Read | 54,931,406 | 90 | 90 | 4,943,826,540 |
| Contig | 127,952 | 344 | 594 | 43,994,141 |
| Unigene | 57,304 | 752 | 1,219 | 43,090,665 |
Figure 1Statistical analysis of a de novo assembly of Cocos nucifera short reads generated by Illumina Hiseq™ 2000 sequencing.
The distribution of assembled contigs and unigene lengths is shown (127,952 contigs and 57,304 unigenes were generated in the study).
Figure 2Summary of the nucleotide BLAST results between the de novo-assembled unigenes and EST sequences from Cocos nucifera and Elaeis guineensis.
(A) Percentage of unigenes matched with EST sequences from Cocos nucifera. (B) Percentage of unigenes matched with EST sequences from Elaeis guineensis. (C) Percentage of EST sequences from Cocos nucifera matched with unigenes. (D) Percentage of EST sequences from Elaeis guineensis matched with unigenes. (E) E-values of unigene matches with cDNA sequences from Cocos nucifera. (F) E-values of unigene matches with cDNA sequences from Elaeis guineensis. (G) Unigene similarities to cDNA sequences from Cocos nucifera. (H) Unigene similarities to cDNA sequences from Elaeis guineensis.
Summary of annotations of assembled Cocos nucifera unigenes.
| Category | Account | Percentage (%) |
| Nr | 39,109 | 68.2% |
| Unique Nr proteins | 29,425 | 51.3% |
| Swissprot | 25,293 | 44.1% |
| GO classified unigenes | 15,178 | 26.5% |
| COG classified unigenes | 14,741 | 25.7% |
| KEGG classified unigenes | 23,128 | 40.4% |
Nr: NCBI non-redundant sequence database.
Percentage of annotated unigenes in total 57,307 assembled unigenes of coconut.
Statistical analysis of Cocos nucifera unigenes with sequence matches against public protein databases.
| Database | E–value range | No. of matched Unigenes | No. of matched proteins in Nr |
|
| (0–1e−5) | 33,120 (59%) | 145,825 (32%) |
| (457 052) | (1e−10–1e−5) | 4,149 (7%) | 18,695 (4%) |
| (1e−50–9e−11) | 16,122 (28%) | 71,257 (16%) | |
| (0–1e−51) | 12,851 (22%) | 55,872 (12%) | |
|
| (0–1e−5) | 19,643 (34%) | 45,620 (38%) |
| (119 770) | (1e−10–1e−5) | 2,915 (5%) | 4,945 (4%) |
| (1e−50–9e−11) | 9,968 (17%) | 21,559 (18%) | |
| (0–1e−51) | 9,239 (16%) | 19 117 (16%) | |
|
| (0–1e−5) | 34,708 (61%) | 122,467 (44%) |
| (279 003) | (1e−10–1e−5) | 9,571 (17%) | 17,540 (6%) |
| (1e−50–9e−11) | 23,888 (42%) | 61,620 (22%) | |
| (0–1e−51) | 17,886 (31%) | 43,307 (16%) | |
|
| (0–1e−5) | 17,430 (30%) | 33,512 (45%) |
| (74 331) | (1e−10–1e−5) | 2,629 (5%) | 3,882 (5%) |
| (1e−50–9e−11) | 9,248 (16%) | 16,511 (22%) | |
| (0–1e−−51) | 7,783 (14%) | 13,120 (18%) |
Total protein sequences of the corresponding species from Nr database.
Percentage of matched sequences in total unigenes of coconut.
Percentage of matched sequences in total protein sequences used in this species.
Figure 3Histogram of GO classifications of assembled Cocos nucifera unigenes.
Results are summarized for three main GO categories: biological process, cellular component and molecular function.
Figure 4Histogram of cluster of COG classification of Cocos nucifera sequences deposited in the NCBI database and de novo assembled unigenes.
14,741 assembled unigenes were annotated, falling into 25 clusters.
Classification of 347 unigenes involved in fatty acid biosynthesis and metabolism.
| Pathway | Genenumber | Types of Genes |
| Fatty acid biosysthsis | 81 | acetyl CoA:ACP carboxylase (21) |
| beta-ketoacyl-ACP synthase (22) | ||
| beta-ketoacyl-ACP reductase (13) | ||
| enoyl-ACP reductase (4) | ||
| fatty acyl-ACP thioesterase (20) | ||
| beta-hydroxyacyl-ACP dehydratase (1) | ||
| Unsaturated fatty acid | 41 | Oxidoreductases (14) |
| fatty acid desaturase (9) | ||
| cytochrome P450 (9) | ||
| stearyl-ACP desaturase (9) | ||
| Citrate cycle | 127 | citrate synthase (17) |
| malate dehydrogenase (20) | ||
| isocitrate dehydrogenase (21) | ||
| pyruvate dehydrogenase (23) | ||
| oxoglutarate dehydrogenase (16) | ||
| succinate dehydrogenase (8) | ||
| phosphoenolpyruvate carboxykinase (10) | ||
| fumarate hydratase (1) | ||
| Acon5itate hydratase (8) | ||
| succinate-CoA ligase (3) | ||
| Fatty acid metabolism | 94 | long-chain-fatty-acid-CoA synthetase (25) |
| acyl-CoA oxidase (16) | ||
| acetyl-CoA C-acetyltransferase (5) | ||
| acetyl-CoA C-acyltransferase (7) | ||
| acyl-CoA dehydrogenase (1) | ||
| alcohol dehydrogenase (26) | ||
| aldehyde dehydrogenase(8) | ||
| enoyl-CoA hydratase (6) | ||
| Fatty acid elongation | 4 | quinone reductase (1) |
| trans-2-enoyl-CoA reductase (2) | ||
| palmitoyl-protein thioesterase (1) |
Number of unigenes with annotated functions.