| Literature DB >> 25836368 |
Yunqing Cheng1, Jianfeng Liu1, Huidi Zhang1, Ju Wang1, Yixin Zhao1, Wanting Geng1.
Abstract
BACKGROUND: A high ratio of blank fruit in hazelnut (Corylus heterophylla Fisch) is a very common phenomenon that causes serious yield losses in northeast China. The development of blank fruit in the Corylus genus is known to be associated with embryo abortion. However, little is known about the molecular mechanisms responsible for embryo abortion during the nut development stage. Genomic information for C. heterophylla Fisch is not available; therefore, data related to transcriptome and gene expression profiling of developing and abortive ovules are needed. METHODOLOGY/PRINCIPAL <br> FINDINGS: In this study, de novo transcriptome sequencing and RNA-seq analysis were conducted using short-read sequencing technology (Illumina HiSeq 2000). The results of the transcriptome assembly analysis revealed genetic information that was associated with the fruit development stage. Two digital gene expression libraries were constructed, one for a full (normally developing) ovule and one for an empty (abortive) ovule. Transcriptome sequencing and assembly results revealed 55,353 unigenes, including 18,751 clusters and 36,602 singletons. These results were annotated using the public databases NR, NT, Swiss-Prot, KEGG, COG, and GO. Using digital gene expression profiling, gene expression differences in developing and abortive ovules were identified. A total of 1,637 and 715 unigenes were significantly upregulated and downregulated, respectively, in abortive ovules, compared with developing ovules. Quantitative real-time polymerase chain reaction analysis was used in order to verify the differential expression of some genes. <br> CONCLUSIONS/SIGNIFICANCE: The transcriptome and digital gene expression profiling data of normally developing and abortive ovules in hazelnut provide exhaustive information that will improve our understanding of the molecular mechanisms of abortive ovule formation in hazelnut.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25836368 PMCID: PMC4383543 DOI: 10.1371/journal.pone.0122072
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Unigene size distribution.
All of the unigene sizes were calculated.
Fig 2Species distribution of the BLASTX search results.
This figure shows the species distribution of unigene BLASTX results against the NR protein database with a cutoff E-value < 10−5 and also shows the proportions of each species. Different colors represent different species. Species with proportions of less than 1% are not shown.
Fig 3Gene Ontology (GO) categories of unigenes.
Unigenes were annotated in three categories: cellular components, molecular functions, and biological processes.
Summary of mapping results.
| Sample ID | Total reads | Total base pairs | Total mapped reads | Perfect match | < = 2 bp Mismatch | Unique match | Multi-position match | Total unmapped reads |
|---|---|---|---|---|---|---|---|---|
| Empty | 7,456,236 | 365,355,564 | 6,565,172 | 5,323,902 | 1,241,270 | 5,187,188 | 1,377,984 | 891,064 |
| -100% | -100% | -88.05% | -71.40% | -16.65% | -69.57% | -18.48 | -11.95% | |
| Full | 7,422,728 | 363,713,672 | 6,461,102 | 5,029,053 | 1,432,049 | 5,021,287 | 1,439,815 | 961,626 |
| -100% | -100% | -87.04% | -67.75% | -19.29% | -67.75% | -19.4 | -12.96% |
1 Total Reads: all reads included in this study.
2 Total BasePair: all nucleotides in this study.
3 Total Mapped Reads: number of reads that are similar in sequence to part of reference.
4 Perfect Match: portion of total mapped reads that can be perfectly mapped to reference.
5 < = 2 bp Mismatch: portion of total mapped reads that can be mapped to reference with < = 2 bp mismatches.
6 Unique Match: portion of total mapped reads that have only one mapped site in reference.
7 Multi-position Match: portion of total mapped reads that have multiple mapped sites in reference.
8 Total Mapped Reads: number of reads that have no similar sequences as any part of reference.
9 Empty: cDNA library generated from abortive ovules.
10 Full: cDNA library generated from developing ovules.
Summary of unigenes mapped to transcriptome reference database.
| Sample | NUMT | Average length | Average coverage | Average RPKM |
|---|---|---|---|---|
| Empty | 45,641 | 797.68 | 56.52 | 19.38 |
| Full | 44,863 | 802.76 | 54.47 | 19.69 |
1 NUMT: Number of unigenes mapped to transcriptome reference database.
2 Average length: average length of gene.
3 Average coverage: average coverage of gene.
4 Average RPKM: average reads per kb per million reads.
5 Empty: cDNA library generated from empty ovules.
6 Full: cDNA library generated from developing ovules.
Fig 4Genes’ coverage distribution of empty and full ovules in hazelnut.
The numbers preceding the parentheses indicate the percentage of unique reads in each category, and the data in parentheses indicate the number of unique reads.
Fig 5Overview of differential expression.
Upregulated (red) and downregulated (green) unigenes were quantified.
Downregulated differentially expressed genes that are probably involved in embryo development.
| Gene ID | Gene length | Full RPKM | Empty RPKM | log2 Ratio | Up-Down | P-value | FDR | Annotation |
|---|---|---|---|---|---|---|---|---|
| CL97.Contig2_D2 | 727 | 52 | 1 | -5.75 | Down | 2.60E-15 | 1.06E-13 | Late embryogenesis abundant protein |
| CL97.Contig1_D2 | 760 | 21 | 1 | -4.44 | Down | 4.05E-06 | 6.10E-05 | Late embryogenesis abundant protein [ |
| Unigene22699_D2 | 933 | 298 | 38 | -3.02 | Down | 2.75E-53 | 4.28E-51 | Transformer-2 protein ( |
| CL2945.Contig1_D2 | 450 | 30 | 4 | -2.95 | Down | 2.21E-06 | 3.51E-05 | Transcription elongation factor SPT5 ( |
| CL3163.Contig3_D2 | 1041 | 161 | 28 | -2.57 | Down | 4.35E-25 | 3.02E-23 | Histone deacetylase 1/2 ( |
| Unigene17483_D2 | 1258 | 69 | 24 | -1.57 | Down | 1.01E-06 | 1.70E-05 | Brassinosteroid insensitive 1-associated receptor kinase 1 ( |
| Unigene18327_D2 | 629 | 68 | 25 | -1.49 | Down | 3.01E-06 | 4.64E-05 | Amidophosphoribosyltransferase ( |
| Unigene19803_D2 | 1748 | 131 | 53 | -1.35 | Down | 1.53E-09 | 3.82E-08 | Protein-serine/threonine kinase transport inhibitor response 1 ( |
| Unigene18238_D2 | 953 | 1282 | 537 | -1.3 | Down | 1.18E-75 | 2.62E-73 | Superoxide dismutase, Fe-Mn family ( |
| CL3630.Contig1_D2 | 1710 | 237 | 111 | -1.14 | Down | 1.08E-12 | 3.60E-11 | polypeptide N-Acetylglucosaminyltransferase ( |
| Unigene18285_D2 | 951 | 890 | 435 | -1.08 | Down | 7.83E-40 | 9.01E-38 | DNA polymerase epsilon subunit 3 ( |
1 Gene length: length of all exons in gene
2 Full RPKM: reads per kb per million reads of cDNA library generated from developing ovules
3 Empty RPKM: reads per kb per million reads of cDNA library generated from abortive ovules
4 log2 Ratio (empty RPKM / full RPKM), log2 (Fold Change)
5 Up-/down-regulation (empty/full): empty is up-/down-regulation in empty library relative to full library
6 P-value: p-value for hypothesis testing
7 FDR: false discovery rate.
Fig 6(A) Gene expression data for digital gene expression analysis.
The fold changes of the genes were calculated as the log2 value of each empty/full comparison and are shown on the y-axis. (B) The quantitative real-time PCR analysis of gene expression data. Quantitative variation of ABF, ERF1, CTR1, ERF2, EIN3, PYL, SUPT5H and TRA2 in Empty were compared with Full, and β-Actin (GenBank: AB298788) was used as the internal standard.
The DEGs involved in abscisic acid and ethylene signal pathways.
| Gene ID | Gene length | Full RPKM | Empty RPKM | log2Ratio | Up-Down | P-value | FDR | Annotation |
|---|---|---|---|---|---|---|---|---|
| Unigene16189_D2 | 854 | 0.01 | 3.16 | 8.30 | Up | 7.77E-05 | 8.93E-04 | ABA-responsive element binding factor |
| Unigene4733_D2 | 380 | 5.76 | 20.29 | 1.82 | Up | 5.87E-05 | 6.96E-04 | Abscisic acid receptor PYR/PYL family |
| CL7150.Contig2_D2 | 1572 | 4.56 | 14.35 | 1.65 | Up | 8.33E-11 | 2.35E-09 | Protein phosphatase 2C |
| Unigene31203_D2 | 388 | 0.51 | 9.94 | 4.28 | Up | 1.52E-05 | 2.06E-04 | Ethylene-responsive transcription factor 1 |
| Unigene15326_D2 | 573 | 0.70 | 9.76 | 3.81 | Up | 3.87E-07 | 6.97E-06 | Ethylene-responsive transcription factor 1 or 2 |
| Unigene19967_D2 | 991 | 1.00 | 9.34 | 3.22 | Up | 7.93E-10 | 2.03E-08 | Serine/threonine-protein kinase CTR1 |
| Unigene26530_D2 | 2242 | 19.99 | 72.83 | 1.87 | Up | 3.62E-81 | 8.69E-79 | Ethylene-insensitive protein 3 |
1 Gene length: length of all exons in gene.
2 Full RPKM: reads per kb per million reads of cDNA library generated from developing ovules.
3 Empty RPKM: reads per kb per million reads of cDNA library generated from abortive ovules.
4 log2Ratio (empty RPKM /full RPKM), log2 (Fold Change).
5 Up-/downregulation (empty/full): empty is up-/downregulation in empty library relative to full library.
6 P-value: p-value for hypothesis testing.
7 FDR: false discovery rate.