| Literature DB >> 25268521 |
Xin Chen1, Jin Zhang2, Qingzhong Liu1, Wei Guo3, Tiantian Zhao2, Qinghua Ma2, Guixi Wang2.
Abstract
BACKGROUND: The genus Corylus is an important woody species in Northeast China. Its products, hazelnuts, constitute one of the most important raw materials for the pastry and chocolate industry. However, limited genetic research has focused on Corylus because of the lack of genomic resources. The advent of high-throughput sequencing technologies provides a turning point for Corylus research. In the present study, we performed de novo transcriptome sequencing for the first time to produce a comprehensive database for the Corylus heterophylla Fisch floral buds.Entities:
Mesh:
Year: 2014 PMID: 25268521 PMCID: PMC4182504 DOI: 10.1371/journal.pone.0108604
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Summary of Illumina transcriptome sequencing for C. heterophylla Fisch.
| Sample | Total Raw Reads | Total Clean Reads | Total Clean Nucleotides (nt) | GC (%) | Q20 (%) | N (%) |
|
| 28,930,890 | 25,221,054 | 1,853,747,469 | 47.77 | 97.38 | 0.01 |
Summary of Illumina transcriptome assembly for C. heterophylla Fisch.
| Length | Total Length (Percentage) | |
| Contigs | Unigenes | |
| 100–500 | 69,816 (84.44%) | 23,184 (56.63%) |
| 500–1000 | 6,660 (8.05%) | 8,434 (20.60%) |
| 1000–1500 | 3,259 (3.94%) | 4,746 (11.59%) |
| 1500–2000 | 1,626 (1.97%) | 2,482 (6.06%) |
| ≥2000 | 1,323 (1.60%) | 2,095 (5.12%) |
| Total Length | 26,907,332 | 27,778,721 |
| Count | 82,684 | 40,941 |
| N50 Length | 642 | 1128 |
| Mean Length | 325 | 679 |
Figure 1Overview of the C. heterophylla Fisch transcriptome sequencing and assembly.
(A) Length distribution of C. heterophylla Fisch contigs.(B) Size distribution of C. heterophylla Fisch unigenes.(C) Log-Log plot showing the dependence of unigene length on the number of reads assembled into each unigene.
Functional annotation of the C. heterophylla Fisch transcriptome.
| Annotated databases | All sequences | ≥300 bp | ≥500 bp | ≥1000 bp | ≥2000 bp |
| Nr Annotation | 30,549 | 22,862 | 17,076 | 9,259 | 2,089 |
| Swiss-Prot Annotation | 18,581 | 14,707 | 11,477 | 6,752 | 1,668 |
| COG Annotation | 10,514 | 9,338 | 7,903 | 5,066 | 1,319 |
| KEGG Annotation | 17,207 | 13,808 | 10,885 | 6,587 | 1,646 |
| GO Annotation | 25,311 | 19,453 | 14,842 | 8,267 | 1,879 |
| Total Annotation | 31,844 | 22,889 | 17,080 | 9,260 | 2,089 |
| All | 40,941 | 25,779 | 17,757 | 9,323 | 2,095 |
Figure 2Comparative analysis of C. heterophylla Fisch unigenes with different species.
Homology analysis of C. heterophylla Fisch unigenes with multiple species.
Figure 3Functional annotation of assembled sequences based on gene ontology (GO) categorization.
Results are summarized for three main Go categories: Biological Process, Molecular Function, and Cellular Component. Detail information of GO terms for all unigenes were listed in Table S2.
Figure 4GO terms under response to stress and signal transduction.
(A) Summarized of GO terms in Biological Process (BP), Molecular Function (MF), and Cellular Component (CC). (B) GO terms related to signal transduction. (C) GO terms related to stress response.
Figure 5Clusters of orthologous groups (COG) classification.
In total, 10,514 sequences were grouped into 25 COG classifications.
Figure 6Distribution of C. heterophylla Fisch unigenes among Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways.
The top 30 most highly represented pathways are shown. Analysis was performed using Blast2GO and the KEGG database.
Figure 7C. heterophylla Fisch unigenes (red rectangle) involved in spliceosome assembly pathway.
Some C. heterophylla Fisch unigenes, such as Prp5, Prp2, Prp16, Prp12, and Prp43 etc, directly participated in the process of spliceosome assembly, while others made up the key components in spliceosome assembly including U1, U2, U4, U5, and U6 etc.
Candidate genes involved in cold tolerance in C. heterophylla Fisch floral buds.
| Unigene ID | BLASTn Hit |
| Gene Name | RPKM | References |
| Unigene24785 | Beta-amylase [ | 0.00E+00 |
| 42.38 |
|
| CL4405.Contig2 | Potassium channel AKT1-like [ | 0.00E+00 | KT | 2.00 |
|
| CL3300.Contig1 | Asparagine synthetase 1 [ | 0.00E+00 | ASNS | 33.28 |
|
| Unigene23215 | Betaine-aldehyde dehydrogenase [ | 0.00E+00 | BADH | 53.50 |
|
| CL1670.Contig2 | Sucrose-phosphate synthase [ | 0.00E+00 | SPS | 181.59 |
|
| Unigene11928 | Delta 1-pyrroline-5-carboxylate synthetase [ | 0.00E+00 | P5CS | 26.10 |
|
| CL28.Contig4 | Sensitive to freezing [ | 0.00E+00 | SFR6 | 0.57 |
|
| Unigene26540 | CBL-interacting serine/threonine-protein kinase 5-like [ | 0.00E+00 | CIPK | 30.94 |
|
| CL1657.Contig1 | Mitochondrial alternative oxidase 1A [ | 0.00E+00 | AOX1A | 1.11 |
|
| CL371.Contig1 | Enolase [ | 0.00E+00 | ENO | 49.23 |
|
| CL2001.Contig2 | MAP kinase [ | 0.00E+00 | MAPK | 2.07 |
|
| Unigene24663 | Omega-6 fatty acid desaturase, endoplasmic reticulum isozyme 2-like [ | 0.00E+00 | FAD2 | 368.94 |
|
| CL569.Contig5 | Calcium-dependent protein kinase 4-like [ | 0.00E+00 | CDPK | 1.70 |
|
| CL3389.Contig1 | Phosphoinositide phospholipase C 2 isoform 2 [ | 4.00E−179 | PLC | 11.89 |
|
| CL1380.Contig2 | Cold-induced protein [ | 1.00E−169 |
| 5.53 |
|
| Unigene12569 | Spermidine synthase 1-like [ | 1.00E−169 | SPDS | 61.34 |
|
| CL3248.Contig2 | Protein phosphatase 2C [ | 2.00E−165 | PP2C | 7.57 |
|
| CL539.Contig1 | Phospholipase d beta, putative [ | 1.00E−156 | PLD | 15.21 |
|
| Unigene515 | Glycerol-3-phosphate acyltransferase [ | 5.00E−150 |
| 23.50 |
|
| CL382.Contig1 | Class IV chitinase [ | 3.00E−142 | CHI | 22.23 |
|
| Unigene3155 | Nitrate Reductase (NADH) [ | 8.00E−125 |
| 6.51 |
|
| Unigene23517 | Superoxide dismutase [Mn] [ | 1.00E−114 | Mn-SOD | 110.73 |
|
| CL3124.Contig4 | Galactinol synthase 3 [ | 1.00E−109 | GOLS | 3.77 |
|
| Unigene23440 | Heat shock protein 22 [ | 3.00E−103 |
| 22.52 |
|
| Unigene117 | 14-3-3 [ | 1.00E−102 | 14-3-3 | 10.62 |
|
| CL3168.Contig1 | CBF/DREB1 transcription factor 1 [ | 6.00E−94 | CBF1 | 77.68 |
|
| Unigene21005 | Early-responsive to dehydration protein [ | 6.00E−92 |
| 11.82 |
|
| CL100.Contig2 | NAC family protein [ | 4.00E−82 | NAC | 19.627 |
|
| Unigene18268 | COR414-TM [ | 2.00E−71 | COR414 | 160.5406 |
|
| CL2732.Contig3 | Transcription factor hy5, putative [ | 1.00E−69 | HY | 11.5457 |
|
| CL1909.Contig1 | Fatty acid desaturase 3 [ | 6.00E−61 | FAD3 | 7.6633 |
|
| CL2284.Contig2 | Homeobox protein, putative [ | 4.00E−54 |
| 2.516 |
|
| Unigene22531 | bZIP transcription factor [ | 1.00E−51 |
| 17.6696 |
|
| Unigene6581 | MYB transcription factor [ | 1.00E−46 | MYB | 9.0585 |
|
| Unigene21495 | AP2/ERF domain-containing transcription factor [ | 1.00E−41 | ERF | 48.4576 |
|
| Unigene1447 | Choline monooxygenase [ | 1.00E−40 | CMO | 6.152 |
|
| CL1909.Contig2 | Omega-3 fatty acid desaturase [ | 1.00E−35 | FAD8 | 7.5632 |
|
| Unigene5353 | Beta-ketoacyl-ACP synthase [ | 3.00E−33 | KAS | 10.0582 |
|
| Unigene10730 | ICE-like [ | 1.00E−31 | ICE | 7.3351 |
|
| Unigene12179 | Actin depolymerizing factor [ | 3.00E−14 |
| 57.5651 |
|
| CL4024.Contig1 | Lipid transfer protein precursor [ | 4.00E−14 | LTP | 11.4457 |
|
| Unigene5522 | COR413-PM1 [ | 4.00E−11 |
| 9.2351 |
|
Figure 8qRT-PCR analysis of 10 cold tolerance candidate unigenes in C. heterophylla Fisch.
(A) Time course expression pattern of 10 cold tolerance candidate unigenes under 4°C cold stress (0 h, 2 h, 4 h, 8 h, 24 h) in C. heterophylla Fisch leaves. (B) Expression pattern of 10 cold tolerance candidate unigenes during four stages (NA, CA, MW, and DA) of overwintering in C. heterophylla Fisch floral buds. The gene names and the primers used for qRT-PCR analysis are shown in Table S4. Standard error of the mean for three biological replicates (nested with three technical replicates) is represented by the error bars.