| Literature DB >> 25393112 |
Sara Torre1, Massimiliano Tattini1, Cecilia Brunetti2, Silvia Fineschi1, Alessio Fini3, Francesco Ferrini3, Federico Sebastiani4.
Abstract
Quercus pubescens Willd., a species distributed from Spain to southwest Asia, ranks high for drought tolerance among European oaks. Q. pubescens performs a role of outstanding significance in most Mediterranean forest ecosystems, but few mechanistic studies have been conducted to explore its response to environmental constrains, due to the lack of genomic resources. In our study, we performed a deep transcriptomic sequencing in Q. pubescens leaves, including de novo assembly, functional annotation and the identification of new molecular markers. Our results are a pre-requisite for undertaking molecular functional studies, and may give support in population and association genetic studies. 254,265,700 clean reads were generated by the Illumina HiSeq 2000 platform, with an average length of 98 bp. De novo assembly, using CLC Genomics, produced 96,006 contigs, having a mean length of 618 bp. Sequence similarity analyses against seven public databases (Uniprot, NR, RefSeq and KOGs at NCBI, Pfam, InterPro and KEGG) resulted in 83,065 transcripts annotated with gene descriptions, conserved protein domains, or gene ontology terms. These annotations and local BLAST allowed identify genes specifically associated with mechanisms of drought avoidance. Finally, 14,202 microsatellite markers and 18,425 single nucleotide polymorphisms (SNPs) were, in silico, discovered in assembled and annotated sequences. We completed a successful global analysis of the Q. pubescens leaf transcriptome using RNA-seq. The assembled and annotated sequences together with newly discovered molecular markers provide genomic information for functional genomic studies in Q. pubescens, with special emphasis to response mechanisms to severe constrain of the Mediterranean climate. Our tools enable comparative genomics studies on other Quercus species taking advantage of large intra-specific ecophysiological differences.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25393112 PMCID: PMC4231058 DOI: 10.1371/journal.pone.0112487
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Summary of RNA-Seq and de novo sequence assembly for Q. pubescens.
| Total raw reads | 310,521,410 |
| Total clean reads | 254,265,700 |
| Number of contigs | 96,006 |
| Mean lenght of contigs (bp) | 618 |
| N75 | 401 |
| N50 | 910 |
| N25 | 1,889 |
N75 length is defined as the length N for which 75% of all bases in the sequences are in a sequence of length L
N50 length is defined as the length N for which 50% of all bases in the sequences are in a sequence of length L
N25 length is defined as the length N for which 25% of all bases in the sequences are in a sequence of length L
Summary of annotations of assembled Q. pubescens contigs.
| Database | Number of trascripts annotated | Percentage of trascripts annotated |
| Viridiplantae Nr | 68,285 | 71.12% |
| RefSeq | 77,623 | 80.85% |
| UniProt | 73,148 | 76.19% |
| InterPro | 48,622 | 50.27% |
| KOG | 19,146 | 19.94% |
| Pfam | 20,448 | 21.29% |
| GO | 8,536 | 12.54% |
| KEGG | 4,050 | 4.22% |
Figure 1Hit species distribution of BLASTX matches of Q. pubescens contigs.
Proportion of Q. pubescens contigs with similarity to sequences from Nr Viridiplantae protein database.
Figure 2Histogram of the 30 most abundant InterPro domains revealed by the InterProScan annotation of the high quality Quercus pubescens transcript set.
KOG functional classification of all Q. pubescens transcripts.
| KOG Classification | Sequences (n) | Percentage |
| Signal transduction mechanisms | 3356 | 17.0 |
| Multiple classes | 2259 | 11.4 |
| General function prediction only | 1928 | 9.8 |
| Posttranslational modification, protein turnover, chaperones | 1776 | 9.0 |
| Function unknown | 1228 | 6.2 |
| Translation, ribosomal structure and biogenesis | 1065 | 5.4 |
| Transcription | 821 | 4.2 |
| Carbohydrate transport and metabolism | 753 | 3.8 |
| Cytoskeleton | 750 | 3.8 |
| Intracellular trafficking, secretion, and vescicular transport | 747 | 3.8 |
| RNA processing and modification | 723 | 3.7 |
| Amino acid transport and metabolism | 635 | 3.2 |
| Secondary metabolites biosynthesis, transport and catabolism | 604 | 3.1 |
| Energy production and conversion | 603 | 3.1 |
| Lipid transport and metabolism | 540 | 2.7 |
| Inorganic ion transport and metabolism | 445 | 2.3 |
| Replication, recombination and repair | 402 | 2.0 |
| Coenzyme transport and metabolism | 196 | 1.0 |
| Cell cycle control, cell division, chromosome partitioning | 192 | 1.0 |
| Cell wall/membrane/envelope biogenesis | 179 | 0.9 |
| Defense mechanisms | 175 | 0.9 |
| Nucleotide transport and metabolism | 167 | 0.8 |
| Chromatin structure and dynamics | 147 | 0.7 |
| Extracellular structures | 38 | 0.2 |
| Nuclear structure | 10 | 0.1 |
Figure 3Histogram of GO classifications of assembled Quercus pubescens transcripts.
Results are summarized for three main GO categories: biological process, cellular component and molecular function.
Figure 4Catalytic activity distribution in annotated Q.pubescens transcripts.
Figure 5Top 30 metabolic pathways in Q. pubescens.
This table shows the KEGG metabolic pathways of plants that were well represented by unique sequences of Quercus pubescens. The number of sequences and enzymes involved are described
Q. pubescens contigs related to genes involved in drought avoidance in Arabidopsis Thaliana.
| Gene name | Sequence similarity | Contig ID | Forward and Reverse Primers 5′-3′ | Amplicon length (bp) |
| Auxin response factor 1 (ARF1) | Arabidopsis thaliana (AT1G59750) | Quercus_contig_413 | F: | 706 |
| R: | ||||
| Mitogen-activated protein kinase 6 (MPK6) | Arabidopsis thaliana (AT2G43790) | Quercus_contig_575 | F: | 907 |
| R: | ||||
| Rho GDP-dissociation inhibitor 1 (SCN1) | Arabidopsis thaliana (AT3G07880) | Quercus_contig_781 | F: | 749 |
| R: | ||||
| Glabra 2 (GL2) | Arabidopsis thaliana (AT1G79840) | Quercus_contig_1106 | F: | 2362 |
| R: | ||||
| Phospholipase D deltaPLD delta | Arabidopsis thaliana (AT4G35790) | Quercus_contig_1195 | F: | 949 |
| R: | ||||
| Can Of Wms1 (COW1) | Arabidopsis thaliana (AT4G34580) | Quercus_contig_1527 | F: | 1888 |
| R: | ||||
| GL2-Expression Modulator (GEM) | Arabidopsis thaliana (AT2G22475) | Quercus_contig_4979 | F: | 474 |
| R: | ||||
| Phospholipase D alpha 2 (PLD alpha2) | Arabidopsis thaliana (AT1G52570) | Quercus_contig_7695 | F: | 2360 |
| R: | ||||
| Cyclin D 3;2 (CYCD3;2) | Arabidopsis thaliana (AT5G67260) | Quercus_contig_11597 | F: | 880 |
| R: | ||||
| Transparent Testa Glabra 1 (TTG1) | Arabidopsis thaliana (AT5G24520) | Quercus_contig_11685 | F: | 417 |
| R: | ||||
| Phospholipase D beta2 (PLD beta2) | Arabidopsis thaliana (AT4G00240) | Quercus_contig_15358 | F: | 2523 |
| R: | ||||
| Cyclin D 3;3 (CYCD3;3) | Arabidopsis thaliana (AT3G50070) | Quercus_contig_21908 | F: | 1085 |
| R: | ||||
| Cyclin D 3;1 (CYCD3;1) | Arabidopsis thaliana (AT4G34160) | Quercus_contig_22955 | F: | 1038 |
| R: | ||||
| Phospholipase D alpha1 (PLD alpha 1) | Arabidopsis thaliana (AT3G15730) | Quercus_contig_25616 | F: | 1157 |
| R: | ||||
| Enhancer of Glabra 3 (EGL3) | Arabidopsis thaliana (AT1G63650) | Quercus_contig_32077 | F: | 207 |
| R: | ||||
| Werewolf (WER) | Arabidopsis thaliana (AT5G14750) | Quercus_contig_35143 | F: | 332 |
| R: | ||||
| Root Hair Defective 2 (RHD2) | Arabidopsis thaliana (AT5G51060) | Quercus_contig_39550 | F: | 2451 |
| R: | ||||
| Triptychon (TRY) | Arabidopsis thaliana (AT5G53200) | Quercus_contig_41116 | F: | 211 |
| R: | ||||
| Phospholipase D epsilon (PLD epsilon) | Arabidopsis thaliana (AT1G55180) | Quercus_contig_47871 | F: | 186 |
| R: | ||||
| Glabra 1 (GL1) | Arabidopsis thaliana (AT3G27920) | Quercus_contig_56291 | F: | 351 |
| R: | ||||
| Cyclin A2;3 (CYCA2;3) | Arabidopsis thaliana (AT1G15570) | Quercus_contig_72031 | F: | 248 |
| R: |
Figure 6SSRs distribution in the leaf transcriptome of Q.pubescens.