| Literature DB >> 28068432 |
Suzana Tiemi Ivamoto1,2, Osvaldo Reis3, Douglas Silva Domingues4, Tiago Benedito Dos Santos2, Fernanda Freitas de Oliveira2, David Pot5, Thierry Leroy5, Luiz Gonzaga Esteves Vieira6, Marcelo Falsarella Carazzolle3, Gonçalo Amarante Guimarães Pereira3, Luiz Filipe Protasio Pereira2,7.
Abstract
Coffea arabica L. is an important crop in several developing countries. Despite its economic importance, minimal transcriptome data are available for fruit tissues, especially during fruit development where several compounds related to coffee quality are produced. To understand the molecular aspects related to coffee fruit and grain development, we report a large-scale transcriptome analysis of leaf, flower and perisperm fruit tissue development. Illumina sequencing yielded 41,881,572 high-quality filtered reads. De novo assembly generated 65,364 unigenes with an average length of 1,264 bp. A total of 24,548 unigenes were annotated as protein coding genes, including 12,560 full-length sequences. In the annotation process, we identified nine candidate genes related to the biosynthesis of raffinose family oligossacarides (RFOs). These sugars confer osmoprotection and are accumulated during initial fruit development. Four genes from this pathway had their transcriptional pattern validated by quantitative reverse transcription polymerase chain reaction (qRT-PCR). Furthermore, we identified ~24,000 putative target sites for microRNAs (miRNAs) and 134 putative transcriptionally active transposable elements (TE) sequences in our dataset. This C. arabica transcriptomic atlas provides an important step for identifying candidate genes related to several coffee metabolic pathways, especially those related to fruit chemical composition and therefore beverage quality. Our results are the starting point for enhancing our knowledge about the coffee genes that are transcribed during the flowering and initial fruit development stages.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28068432 PMCID: PMC5221826 DOI: 10.1371/journal.pone.0169595
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Summary of C. arabica de novo transcriptome assembly.
| Assembly Information | Values |
|---|---|
| High-quality Reads | 41,881,572 |
| Percentage of Mapped Reads | 65% |
| GC Content | 41,13% |
| N50 | 2,118 bp |
| Total of Contigs | 127,600 |
| Number of Unigenes (>200 bp) | 65,364 |
| Number of Coding Protein Unigenes | 24,548 |
| Number of Full-Length Coding Protein Unigenes | 12,560 |
| Unigenes Average Size | 1,264 bp |
Similarity analysis of coffee and plant database sequences.
| Reference Database | Hits | No Hits |
|---|---|---|
| 26,176 | 39,188 | |
| 24,798 | 40,566 | |
| 20,542 | 44,822 |
A summary of up- and down-regulated DGEs among coffee libraries*.
| Coffee Libraries | Flower | Leaves | Perisperm | ||||
|---|---|---|---|---|---|---|---|
| 30 DAF | 60 DAF | 90 DAF | 120 DAF | 150 DAF | |||
| - | 234 | 288 | 428 | 1132 | 1394 | ||
| 599 | - | 363 | 895 | 1108 | |||
| 1311 | 680 | - | 1123 | 863 | 1425 | 1349 | |
| 588 | - | 614 | 1558 | ||||
| 919 | 356 | 799 | - | 1569 | 1426 | ||
| 828 | 381 | 1159 | 790 | - | 1433 | ||
| 822 | 219 | 1111 | 568 | 1232 | - | ||
*Up-regulated unigenes are placed at the botton of the table (under—mark), and down-regulated unigenes are placed at the top of the table (above—mark). Number in bold and italic are high and low values, respectively, as observed between their respective libraries. Numbers underlined are the lowest and highest values among all libraries comparisons for up- and down-regulated genes.
Fig 1Venn diagrams showing unigenes up- (a) and down-regulated (b) among perisperm development stages (60, 90, 120 and 150 DAF) compared with perisperm at 30 DAF.
A total of 3,130 unigenes classified as down-regulated (a) and 3,412 unigenes classified as up-regulated (b) were used in this analysis.
Transcriptionally active transposable elements in C. arabica transcriptome.
| Number of Contigs | TE Name | TE Class |
|---|---|---|
| 50 | Gypsy | I |
| 10 | Copia | I |
| 10 | LINE | I |
| 29 | MuDR | II |
| 19 | Helitron | II |
| 13 | hAT | II |
| 2 | EnSPM | II |
| 1 | Harbinger | II |
Raffinose family oligosaccharide candidate genes.
| Gene name | Enzymatic activity | TAIR database | CDD database | Pfam Entry | Protein length | |
|---|---|---|---|---|---|---|
| galactinol synthase | At1G56600 | Cc03_g00450 | PLN00176 | pfam01501 | 345 aa | |
| galactinol synthase | At1G09350 | Cc02_g35350 | PLN00176 | pfam01501 | 335 aa | |
| galactinol synthase | At1G60470 | Cc11_g15250 | PLN00176 | pfam01501 | 339 aa | |
| galactinol synthase | At3G28340 | Cc11_g14010 | PLN00176 | pfam01501 | 389 aa | |
| galactinol synthase | At3G06260 | Cc11_g10580 | PLN00176 | pfam01501 | 350 aa | |
| raffinose synthase | At1G55740 | Cc05_g15530 | PLN02355 | pfam05695 | 678 aa | |
| raffinose synthase | At5G40390 | Cc07_g01840 | PLN02355 | pfam05695 | 782 aa | |
| raffinose synthase | At5G20250 | Cc06_g08070 | PLN02355 | pfam05695 | 870 aa | |
| stachyose synthase | At4G01970 | Cc01_g21600 | PLN02355 | pfam05695 | 879 aa |
Fig 2GO term categorization analysis performed by Blast2GO tools for RFO biosynthetic genes.
GO categories annotated for RFOs candidate genes are indicated in blue; the global dataset, used as a reference, is indicated in red. GO categorization analyses were performed using the Blast2GO software with the default parameters.
Fig 3DGE comparison of raffinose biosynthesis-related genes in leaves and perisperm between C. arabica and C. canephora.
RPKM values are represented in Log10 scale. Leaves noted in green, and perisperm in red. Ca = C. arabica. Cc = C. canephora. C. canephora RPKM values were obtained from the Coffee Genome Hub database [50].
Fig 4qRT-PCR analysis of selected RFO-related genes.
Leaves are represented in green and perisperm in yellow (90 DAF), red (120 DAF) and blue (120 DAF). Relative expression values are represented in Log10 scale. Calibrator tissue is always the minimal gene expression value. Lower-case letters, from a to d, represent statistically significant differences for each RFO gene among coffee tissues (leaves and perisperm from 90 to 150 DAF).