| Literature DB >> 26109289 |
Andreia Miguel1,2, José de Vega-Bartol3,4,5, Liliana Marum6,7,8, Inês Chaves9,10, Tatiana Santo11, José Leitão12, Maria Carolina Varela13, Célia M Miguel14,15.
Abstract
BACKGROUND: Cork oak (Quercus suber L.) has a natural distribution across western Mediterranean regions and is a keystone forest tree species in these ecosystems. The fruiting phase is especially critical for its regeneration but the molecular mechanisms underlying the biochemical and physiological changes during cork oak acorn development are poorly understood. In this study, the transcriptome of the cork oak acorn, including the seed, was characterized in five stages of development, from early development to acorn maturation, to identify the dominant processes in each stage and reveal transcripts with important functions in gene expression regulation and response to water.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26109289 PMCID: PMC4479327 DOI: 10.1186/s12870-015-0534-1
Source DB: PubMed Journal: BMC Plant Biol ISSN: 1471-2229 Impact factor: 4.215
Fig. 1Developmental stages established for the cork oak acorn. a Cork oak fruits collected at different developmental stages (S1-S8). The scale bar corresponds to 1 mm in S1 to S3 and to 5 mm in S4 to S8. b Cork oak fruits at stages S3-S5 after removal of the cupule (above), or cupule and pericarp (below) exposing the seed, and acorn measurement parameters (S7) used for acorn staging. D, maximum diameter of the acorn; Pi, portion of the acorn outside the cupule; P, acorn portion covered by the cupule. The scale bar corresponds to 1 mm in S3 to S5 and to 5 mm in S7
Criteria used for categorizing the cork oak acorn into different developmental stages. Representation of each stage in the normalized (N) and non-normalized (nN) cDNA libraries. Embryo tissues isolated from acorns belonging to stages S3-S5 and S8 were classified as EM3-EM5 and EM8
| Developmental stage | Max Ø with cupule (mm) | Max Ø without cupule (mm) | Isolation of embryos | Other observations; library type |
|---|---|---|---|---|
| S1 | 2 - 3 | nd | no | no endosperm visible; N and nN |
| S2 | 3 - 5 | nd | no | multiple fertilized ovules, some aborted; N and nN |
| S3 | 5 - 8 | nd | yes (EM3) | dominant embryo; N and nN |
| S4 | 8 - 12 | nd | yes (EM4) | acorn completely covered by the cupule; N and nN |
| S5 | 12 - 17 | 7 - 11 | yes (EM5) | acorn already visible out of the cupule; N and nN |
| S6 | 17 - 20 | 11 - 16 | no | ~1/2 of the acorn out of the cupule |
| S7 | >20 | >16 | no | Green acorn mostly out of the cupule; N |
| S8 | >20 | >16 | yes (EM8) | Brown acorn; N and nN |
nd: not determined
Read statistics from libraries of cork oak acorn and embryos before and after pre-processing. Embryo tissues isolated from acorns belonging to stages S3-S5 and S8 were termed EM3-EM5 and EM8, respectively
| Non-normalized cDNA libraries | Normalized cDNA libraries | |||||||
|---|---|---|---|---|---|---|---|---|
| S1 | S2 | S3 + S4 | S5 | S8 | S1 to S8 | EM3 to EM5 + EM8 | ||
| Raw reads | Total | 111,703 | 373,962 | 200,862 | 302,253 | 102,250 | 738,266 | 705,151 |
| Range (bp) | 51-1,200 | 52-1,200 | 47-1,200 | 50-1,200 | 55-1,200 | 50-1,201 | 52-1,201 | |
| Mode (bp) | 520 | 521 | 519 | 522 | 519 | 516 | 516 | |
| Mean (bp) | 503.7 | 515.8 | 510.7 | 524.8 | 513 | 538.5 | 538.7 | |
| Size (Mbp) | 47.5 | 162.5 | 86.1 | 133.6 | 44.3 | 314.7 | 298.8 | |
| Valid reads | Total | 100,081 | 332,674 | 179,588 | 267,171 | 88,261 | 572,665 | 547,895 |
| Range (bp) | 40-568 | 40-579 | 40-571 | 40-565 | 40-546 | 40-576 | 40-616 | |
| Mode (bp) | 209 | 213 | 220 | 217 | 214 | 409 | 380 | |
| Mean (bp) | 254.1 | 255.2 | 253.6 | 253.5 | 258.1 | 304.3 | 289.2 | |
| Size (Mbp) | 27.6 | 92.7 | 49.5 | 74.4 | 25.5 | 201.0 | 183.0 | |
De novo transcriptome assemblies and classification of the assembled cork oak transcripts
| MIRA | Newbler 2.6 | Final (Merged) | |
|---|---|---|---|
| Contigs | 104,862 | 33,034 | 80,357 |
| Contigs > 500 bp | 54,764 (52.2 %) | 26,313 (79.6 %) | 50,197 (62.5 %) |
| Contigs < 200 bp | 6,306 (6 %) | 481 (1.5 %) | 4,510a |
| Contigs with homologous in Uniprot | 73,103 (69.7 %) | 26,951 (81.6 %) | 56,517 (70.3 %) |
| Unique Uniprot IDs | 28,154 (38.5 %) | 16,047 (48.6 %) | 24,474 (43.3 %) |
| Complete contigs | 16,149 (15.4 %) | 12,953 (39.2 %) | 19,146 (23.8 %) |
| C-terminus contigs | 15,818 (15.1 %) | 5,342 (16.2 %) | 11,410 (14.2 %) |
| N-terminus contigs | 15,368 (14.7 %) | 4,064 (12.3 %) | 10,108 (12.6 %) |
| Internal contigs | 25,432 (24.2 %) | 4,575 (13.8 %) | 15,509 (19.3 %) |
| Misassembled contigs | 336 (0.32 %) | 17 (0.05 %) | 344 (0.43 %) |
| Contigs without UNIPROT homologous | 31,759 (30.3 %) | 6,083 (18.40 %) | 23,840 (29.6 %) |
| Novel genes | 5,388 (5.14 %) | 1,320 (4 %) | 4,658 (5.8 %) |
| Complete ORF | 2,758 (2.6 %) | 664 (2.01 %) | 2,318 (2.9 %) |
| Partial ORF | 2,630 (2.5 %) | 656 (1.99 %) | 2,340 (2.9 %) |
| Unknown contigs | 26,346 (25.1 %) | 4,757 (14.4 %) | 19,163 (23.8 %) |
| Unknown contigs < 200 bp | 5,471 (5.2 %) | 407 (1.2 %) | 0a |
| Putative ncRNAs | 25 (0.02 %) | 6 (0.02 %) | 19 (0.02 %) |
| Reads mappedb | 2,020,921 (96.8 %) | 1,909,842 (91.5 %) | 2,009,759 (96.2 %) |
| Unique mapped reads | 1,703,996 (84.3 %) | 1,549,803 (74,2 %) | 1,491,131 (71.4 %) |
| Duplicated mapped reads | 273,619 (13.1 %) | 270,027 (14.1 %) | 367,486 (17.6 %) |
| Mapping more than two times | 43,306 (2.1 %) | 90,012 (4.7 %) | 151,142 (7.2 %) |
| Average coverage | 7.8 | 14.2 | 8 |
aContigs shorter than 200 bp were filtered out before analyzing
bThe percentage of reads that mapped over a possible total of 2,088,230 reads
Fig. 2Heatmap of the expression levels of the KEGG level 2 pathways. The expression levels were normalized in Z scores, with signals from red (higher expression) to green (lower expression)
Fig. 3Venn diagram illustrating the number of transcripts differentially expressed between two consecutive stages of development
Fig. 4Clustering analysis of differentially expressed genes (DEGs) according to their expression profiles
Shortlist of differentially expressed transcripts annotated as involved in response to water. Transcripts with a known Arabidopsis homolog were selected from a total of 211 DEGs in this category, based on their specificity to a single cluster and higher expression level in a given stage as determined by a stage expression factor (SEFa) higher than 0.85. To have transcripts specific of cluster D in the short list, the stage expression factor considered in this case was 0.57. Expression in each stage is represented as normalized counts
| Cluster | Transcript name |
|
| Annotation | counts | SEFa | ||||
|---|---|---|---|---|---|---|---|---|---|---|
| S1 | S2 | S3S4 | S5 | S8 | ||||||
| A | Contig16112 | AT2G47000 | MDR4, PGP4, ABCB4 | Multidrug resistance 4, P-glycoprotein 4, ATP binding cassette subfamily B4 | 19.8 | 0 | 0.6 | 0.4 | 0 | 0.95 |
| A | Qs-dev_rep_c84235 | AT3G17000 | UBC32 | Ubiquitin-conjugating enzyme 32 | 25 | 0 | 1.2 | 0 | 1.2 | 0.91 |
| B | Contig18768 | AT3G23920 | BAM1 | β-Amylase 1 | 1.2 | 29.6 | 0.6 | 1.2 | 2.2 | 0.85 |
| Qs-dev_rep_c76744 | 0.4 | 54.2 | 0.2 | 8.4 | 0.8 | 0.85 | ||||
| C | Qs-dev_rep_c77033 | AT5G25610 | RD22 | Responsive to desiccation 22 | 0.4 | 0 | 34.4 | 0 | 3.6 | 0.90 |
| D | Contig17288 | AT3G30775 | ERD5, PRODH, ATPOX, ATPDH, PRO1, PDH1 | Methylenetetrahydrofolate reductase family protein | 0 | 0 | 0 | 22 | 15 | 0.59 |
| D | Qs-dev_rep_c76415 | AT3G29320 | PHS1 | Glycosyl transferase | 0.4 | 0.6 | 0.2 | 25.4 | 16.6 | 0.59 |
| D | Contig19318 | AT3G24650 | ABI3 | AP2/B3-like transcriptional factor family protein | 0 | 0 | 7.2 | 66.8 | 42.2 | 0.57 |
| E | Contig19981 | AT2G41430 | ERD15, LSR1, CID1 | Dehydration-induced protein (ERD15) | 0 | 2.2 | 0.6 | 25.8 | 37.2 | 0.96 |
| F | Contig18471 | AT5G59320 | LTP3 | Lipid transfer protein 3 | 0 | 0.2 | 0 | 5.6 | 38.6 | 0.87 |
| F | Contig16935 | AT4G15910 | DI21 | Late embryogenesis abundant protein Lea5; drought-induced 21 | 1.2 | 0 | 3.6 | 18.8 | 179.2 | 0.88 |
| Qs-dev_c42419 | 0 | 0 | 0 | 0 | 16.4 | 1 | ||||
| F | Contig18808 | AT5G06760 | LEA4-5 | Late embryogenesis abundant 4-5 | 1.2 | 2.2 | 0.6 | 0.4 | 53 | 0.92 |
| F | Contig17917 | AT3G50980 | XERO1 | Dehydrin xero1 | 3 | 0.8 | 0.6 | 3.4 | 50.8 | 0.87 |
| F | Contig19993 | AT1G05680 | UGT74E2 | Uridine diphosphate glycosyltransferase 74E2 | 0 | 0.8 | 0 | 4.2 | 33.8 | 0.87 |
| F | Qs-dev_rep_c73281 | AT3G15353 | MT3 | Metallothionein 3 | 0.4 | 3 | 0.2 | 4.6 | 75 | 0.90 |
| Qs-dev_rep_c79475 | 0.4 | 0 | 0.2 | 0.8 | 18.8 | 0.93 | ||||
a
Shortlist of differentially expressed transcripts putatively coding for transcription factors and transcriptional regulators. Transcripts with a known Arabidopsis homolog were selected according the following criteria: transcript is either specific of a single cluster and is stage-specific or belongs to TF families with well characterized roles in plant development
| Cluster | Transcript name | Transcription factor | Domain /Family |
|
| Annotation | counts | SEFa | ||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| S1 | S2 | S3S4 | S5 | S8 | ||||||||
| A | Contig20395 | Vitis_vinifera_ GSVIVP00000629001 | MYB_DNA binding/MYB | AT5G57620 | MYB36 | MYB domain protein 36 | 24.2 | 0 | 0 | 0 | 0 | 1 |
| A | Contig14583 | Vitis_vinifera_ GSVIVP00024505001 | B3/ARF | AT5G60450 | ARF4 | Auxin response factor 4 | 21.6 | 0 | 0 | 0 | 0 | 1 |
| C | Contig20996 | Oryza_sativa_subsp._Indica_OsIBCD010288 | MYB_DNA binding/MYB-related | AT5G66390 | PRX72 | Peroxidase superfamily protein | 0 | 0 | 16.2 | 0 | 0 | 1 |
| F | Contig14504 | Oryza_sativa_subsp._japonica_LOC_Os11g45530.1 | FAR1/FAR1 | AT1G76320 | FRS4, CPD25 | FAR1-related sequence 4 | 0 | 0 | 0 | 0 | 23.2 | 1 |
| A | Contig23988 | Carica_papaya_emv.TU.supercontig_187.28 | KNOX1/HB | AT5G25220 | KNAT3 | KNOTTED1-like homeobox gene 3 | 35.6 | 0.8 | 3.8 | 11.6 | 13.8 | 0.54 |
| B | Contig17627 | Vitis_vinifera_ GSVIVP00005632001 | AT5G11060 | KNAT4 | KNOTTED1-like homeobox gene 4 | 1.2 | 34 | 27.4 | 5.6 | 2 | 0.48 | |
| E | Contig24728 | Carica_papaya_emv.TU.supercontig_996.1 | AP2/AP2-AREBP DUF260/LOB | AT4G25140 | OLEO1 | Oleosin 1 | 0 | 0 | 3 | 69.2 | 58.5 | 0.98 |
| E | Contig20473 | Vitis_vinifera_ GSVIVP00016010001 | 0 | 0 | 8.2 | 428.2 | 192.4 | 0.99 | ||||
| E | Contig23293 | Vitis_vinifera_ GSVIVP00016010001 | DUF260/LOB | AT3G27660 | OLEO4 | Oleosin 4 | 0 | 0 | 0 | 71 | 25.8 | 1 |
| F | Qs-dev_rep_c103605 | Vitis_vinifera_ GSVIVP00016010001 | 0.4 | 0 | 0 | 9.2 | 39.6 | 0.80 | ||||
| F | Qs-dev_c70574 | Vitis_vinifera_ GSVIVP00016010001 | AT5G40420 | OLEO2 | Oleosin 2 | 0.2 | 0 | 0 | 2.2 | 49.8 | 0.95 | |
| B | Contig1681 | Carica_papaya_evm.TU.supercontig_3.229 | HLH/bHLH | AT5G08130 | BIM1 | 0 | 22.8 | 0 | 1.2 | 8.8 | 0.70 | |
| D | Contig25637 | Vitis_vinifera_GSVIVP00008627001 | AT1G01260 | JAM2 | Jasmonate associated MYC2 like 2 | 5.2 | 16.6 | 7.4 | 21.6 | 0 | 0.43 | |
| E | Contig18945 | Populus_trichocarpa_180347 | AT5G41315 | GL3, MYC6.2 | Glabra 3, Glabrous 3, MYC6.2 | 0 | 4.6 | 0.6 | 24.6 | 23.8 | 0.90 | |
| A | Contig4505 | Populus_trichocarpa_218417 | NAM/NAC | AT1G56010 | NAC1 | NAC domain containing protein 1 | 34.2 | 2.2 | 0.6 | 0 | 0 | 0.92 |
| A | Contig23086 | Populus_trichocarpa_714847 | AT1G34190 | NAC017, ANAC017 | NAC domain containing protein 17 | 20.2 | 0 | 0 | 11 | 4.6 | 0.56 | |
| B | Contig23085 | 0 | 27.8 | 22.8 | 18.4 | 7 | 0.37 | |||||
| A | Contig16452 | Vitis_vinifera_ GSVIVP00027622001 | AT4G27410 | RD26, ANAC072 | Responsive to desiccation 26 | 608.8 | 184.2 | 138.6 | 32 | 11 | 0.62 | |
| B | Contig25368 | Vitis_vinifera_ GSVIVP00000471001 | AT4G35580 | NTL9 | NAC transcription factor-like 9 | 18.2 | 42 | 8.2 | 14.8 | 0 | 0.50 | |
| D | Contig26025 | Vitis_vinifera_ GSVIVP00027621001 | AT3G15510 | NAC2 | NAC domain containing protein 2 | 1.2 | 1.6 | 12 | 19.8 | 0 | 0.57 | |
a
Primers used in RT-qPCR for validation of the expression profile obtained by 454 sequencing.
| Clustera | Gene abbreviationb | Gene description | Transcript name |
| Primer sequences (forward / reverse) |
|---|---|---|---|---|---|
| A | MRP10, ABCC14 | ABC transporter C family member 14 | Contig3296 | AT3G62700 | GCTGCCTTTGCCCCACACT/ TGGAAGAGCCTTGAACGCTGCC |
| A | CPK10, CDPK1 | Calcium-dependent protein kinase | Contig4133 | AT1G18890 | GCTCACCTCCCTCGCACAACT/ AGGCGCGAGGTGGCGATAAT |
| B | ASPG1 | Protein aspartic protease in guard cell | Contig19387 | AT3G18490 | ATGCAGCTCGCTCGACGTGT/ TGTCGTGTCCGCAACCCAGA |
| B | ABF2 | Abscisic acid responsive elements-binding factor 2 | Contig22999 | AT1G45249 | CGGCCTGCTTTTTCCGCAACT/ AGTCAGCTGCAAGGTCACGAGC |
| B | AVP1 | Pyrophosphate-energized vacuolar membrane proton pump | Contig 8793 | AT1G15690 | CGCACTTGAGAACGACGCT/ TGCGCGGTCGTCGGAATCAT |
| B | HSP90-7 | Chaperone protein htpG family protein | Qs-dev_rep_c72408 | AT4G24190 | CAGCATCAGCTTCAGCCTCGGT/ AGCGGCTTCACGCCTTCTGA |
| B | BAM1, BMY7, TR-BAMY | β-amylase 1 | Contig 25491 | AT3G23920 | AACGCGAGTTTGCAGGCGCT/ CGACGTTTCCGCCGCATTGA |
| B | GolS2 | Galactinol synthase 2 | Contig 17943 | AT1G56600 | TGACGCCATAACATGGCCAGCA/ TTGCCAGCAGTGCCCTGACA |
| C | ALDH7B4 | Aldehyde dehydrogenase 7B4 | Contig 9124 | AT1G54100 | AGGGTGCTCCAACGACTCCA/ ACGCACAGCTAGGCCGATG |
| C | OPR3 | Oxophytodienoate-reductase 3 | Contig 16616 | AT2G06050 | TTGGTCTCTAACACAGCCGCCG/ CACGGCCGACATGCCATA |
| C | PAL1 | Phenylalanine ammonia-lyase 1 | Contig 9321 | AT2G37040 | TGCTAACTGGCCGCCCCAAT/ GCCAGAACCAACAGCTGTGCCA |
| D | ABI3, SIS10 | Abscisic acid insensitive 3 | Contig 19318 | AT3G24650 | GCAGTGGCCGTGGTGCAAT/ ATGGCGAGGCAAAGGCGGTT |
| D | PHS1 | Glycosyl transferase, family 35 | Contig 10143 | AT3G29320 | TGGCTTGAGATGGGCAACCCT/ TTTGGTGTGCTCCCCGGCAT |
| D | SIR | Sulfite reductase | Contig 19883 | AT5G04590 | TGCAATGGCATGCCCAGCCT/ ACTGGGGTTTCCACCAAGCCA |
| D | PIP2B | Aquaporin PIP2-4 | Contig 24062 | AT2G37170 | GCTTGGGCCTTTGGTGGCA/ TGGCTCCACCACCGTACTTGCT |
| F | COR414-TM1 | Cold-regulated 314 inner membrane 1 | Contig 19349 | AT1G29395 | TGGTGAGCTTGAGCTGCCGT/ TGCAAGCTGCCTGCTCGTGA |
| F | DI21 | Late embryogenesis abundant protein | Contig 16865 | AT4G15910 | AGCGTGCTCTCACCTTGTGGT/ TGCGGCTGCATCACATGCCT |
| F | JMT | Jasmonic acid carboxyl methyltransf. | Contig 17776 | AT1G19640 | TGGCTATCTTCGCCGCCCTT/ TGCCGTGGTTTGCACCTACTTCG |
| F | GSTU25 | Glutathione S-transferase TAU 25 | Contig 17981 | AT1G17180 | ACCACCTCGTCCGCCATTGT/ AGCCTGACCGACTCCATGGCAA |
| F | LTP3 | Lipid transfer protein 3 | Contig 19993 | AT5G59320 | TGCCCAGGCCACCATAACATGC/ TGTGGTCTTGGCCATGCCGTT |
a According to their expression profiles each transcript belongs to a different cluster
b Corresponds to the homologous in Arabidopsis thaliana
Fig. 5Validation of the RNA-Seq transcript profiles. Comparison of transcripts expression patterns from RNA-Seq data and from reverse transcription quantitative real-time PCR (RT-qPCR). In the y-axis it is represented the Log2 of the relative expression level in each developmental stage and the five acorn developmental stages are represented in the x-axis. The numbers above the graphics correspond to the values obtained with the Pearson correlation