| Literature DB >> 28253862 |
Stephanie C Kerr1, Federico Gaiti1, Christine A Beveridge1, Milos Tanurdzic2.
Abstract
BACKGROUND: The decision for a bud to grow into a branch is a key regulatory process affecting plant architecture. In order to study molecular processes regulating axillary bud outgrowth in the model plant garden pea (Pisum sativum), we sequenced the axillary bud transcriptome and performed de novo transcriptome assembly.Entities:
Keywords: Axillary buds; Diurnal; Gene expression; Long non-coding RNA; Pisum sativum; RNA-seq; Transcriptome
Mesh:
Substances:
Year: 2017 PMID: 28253862 PMCID: PMC5335751 DOI: 10.1186/s12864-017-3577-x
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Summary of de novo pea axillary bud transcriptome statistics
| Statistic | Pea bud transcriptome |
|---|---|
| # contigs | 81,774 |
| # isoforms | 194,067 |
| Shortest isoform (bp) | 201 |
| Longest isoform (bp) | 17,155 |
| # large isoforms (>1000 bp) | 108,011 |
| N50 isoforms (bp) | 2170 |
| Ave. isoform length (bp) | 1285 |
| # paired reads used in assembly | ~55 million |
| # nucleotides used in assembly | ~23 Gb |
| Reads mapped in pairs (%) | 58.67% |
| Individual reads mapped (%) | 90.94% |
Estimated size of different Pisum sativum and other plant species transcriptomes
| Species | Source | Estimated transcriptome size (Mb) |
|---|---|---|
|
| - | 57a |
|
| - | 249b |
|
| [ | 37 |
|
| [ | 10 |
|
| [ | 59 |
|
| [ | 81b |
|
| [ | 72b |
|
| Mt4.0v1 | 67c |
|
| ITAG1 | 38c |
|
| TAIR10 | 66c |
areflects the transcriptome size estimated using the longest isoform per contig
breflects the transcriptome size estimated using all isoforms
cbased on whole genome reference
BLAST comparisons (E-value 1E−10) between the pea bud isoforms as the query and Pisum sativum contigs [5–8] or Medicago CDS and proteins as the reference
| Reference | Analysis | # reference matches | # pea bud isoform matches |
|---|---|---|---|
| Pea (Franssen) contigs | BLASTN | 78,692 (93%) | 109,619 (56%) |
| Pea (Kaur) contigs | BLASTN | 13,112 (96%) | 46,411 (24%) |
| Pea (Duarte) contigs | BLASTN | 65,732 (96%) | 112,429 (58%) |
| Pea (Kaspa) contigs | BLASTN | 98,613 (79%) | 143,092 (74%) |
| Pea (Parafield) contigs | BLASTN | 112,836 (77%) | 147,661 (76%) |
|
| TBLASTX | 48,203 (77%) | 117,699 (61%) |
|
| BLASTX | 47,385 (76%) | 115,151 (59%) |
BLASTX (E-value 1E−10) comparisons between the pea bud isoforms as the query and Ultra-Conserved Ortholog (UCO) protein sequences, Swiss-Prot pea or UniProtKB pea protein sequences as the reference
| Reference | Min. subject coverage | # reference matches | # pea bud isoform matches |
|---|---|---|---|
| UCO | 75% | 316 (81%) | 320 |
| Swiss-Prot | 80% | 308 (79%) | 812 |
| UniProtKB | 80% | 1210 (80%) | 2289 |
Fig. 1a Comparison of the sequence length (nt) of unannotated and annotated isoforms from the de novo pea axillary bud assembly, and b the overlap between the number of unannotated isoforms and the number of lncRNA in the pea bud transcriptome
Fig. 2Rfam non-coding RNA families represented in the pea bud transcriptome
Fig. 3Enzyme representation of the top five KEGG pathways ranked by the percentage of enzymes annotated in the pea axillary bud transcriptome
Differentially expressed transcripts (FDR < 0.05) between pairwise comparisons of each time window collected over a time frame of 170 min in Pisum sativum buds
| Pairwise comparison | Down | Up | Total | Annotated | Potential long non-coding RNA |
|---|---|---|---|---|---|
| Time window 1 vs 2 | 0 | 0 | 0 | n/a | n/a |
| Time window 1 vs 3 | 58 | 66 | 124 | 62 (50%) | 28 (23%) |
| Time window 2 vs 3 | 5 | 32 | 37 | 13 (35%) | 5 (14%) |
| Unique transcriptsa | 61 | 81 | 142 | 71 (50%) | 31 (22%) |
arepresents unique transcripts from all three pairwise comparisons
Fig. 4Validation of 14 differentially expressed transcripts using qRT-PCR. Transcript expression in node 2 rms5-3 axillary buds was calculated relative to time window 3 for both RNA-seq data (blue bars) and qRT-PCR data (orange bars). Only time windows that were statistically significantly different in the RNA-seq edgeR analysis were included. Data are means ± SE (n = 3–4 pools of ~60 plants). Statistically significant differences were determined using a one-way ANOVA with a post-hoc Tukey’s test. Different letters (capital letters for RNA-seq values, lower case letters for qRT-PCR values) represent statistically significant differences at FDR<0.05 or p < 0.05, respectively
List of annotated differentially expressed transcripts identified in pairwise comparisons (FDR < 0.05) between the three time windows in Pisum sativum buds collected over a 170 min time frame
| Transcript | AT# |
|
|---|---|---|
| comp103811_c0 | AT5G16080 | Probable carboxylesterase 15 |
| comp34724_c0 | AT4G30110 | Cadmium/zinc-transporting ATPase HMA2 |
| comp34931_c0 | AT1G15760 | Sterile alpha motif domain-containing protein |
| comp35134_c1 | AT5G02570 | Histone h2b |
| comp35188_c0 | AT2G43460 | 60s ribosomal protein l38 |
| comp35195_c0 | AT3G62980 | Protein TRANSPORT INHIBITOR RESPONSE 1 |
| comp35285_c0 | AT1G14450 | NADH dehydrogenase |
| comp35297_c0 | AT4G26470 | Calcium-binding protein CML21 |
| comp35867_c0 | AT2G16365 | F-box protein |
| comp36828_c0 | AT3G48770 | ATP/DNA binding protein |
| comp42921_c0 | AT1G68050 | Kelch repeat-containing protein |
| comp54867_c0 | AT1G02070 | Uncharacterized protein |
| comp55051_c0 | AT1G27480 | Lecithin-cholesterol acyltransferase-like 1 |
| comp55149_c0 | AT3G24150 | Uncharacterized protein |
| comp55670_c2 | ATMG00030 | Uncharacterized mitochondrial protein ATMG00030 |
| comp55866_c0 | AT4G00100 | 40s ribosomal protein s13-2 |
| comp55974_c0 | AT3G61610 | Galactose mutarotase-like superfamily protein |
| comp64053_c1 | AT3G12587 | Oligosaccaryltransferase |
| comp68925_c0 | AT3G55340 | Phragmoplastin interacting protein 1 |
| comp69006_c0 | AT3G47570 | Receptor kinase |
| comp70446_c0 | AT5G24930 | Zinc finger protein constans-like 4 |
| comp70806_c0 | AT1G75540 | Constans-like b-box zinc finger protein |
| comp71289_c1 | AT1G07770 | 40s ribosomal protein s15a |
| comp71932_c0 | AT5G25450 | Cytochrome bd ubiquinol oxidase |
| comp72075_c0 | AT1G04400 | Cryptochrome 2 |
| comp73339_c0 | AT3G11050 | Ferritin 2 |
| comp75279_c0 | AT2G03340 | WRKY transcription factor 3 |
| comp75525_c2 | AT2G37620 | Actin 1 |
| comp77858_c0 | AT4G40030 | Histone h3 |
| comp77929_c0 | AT4G29390 | 40s ribosomal protein s30 |
| comp78122_c1 | AT3G54500 | Night light-inducible and clock-regulated 2 |
| comp78315_c0 | AT1G78510 | Solanesyl diphosphate synthase 1 |
| comp79509_c0 | AT1G69180 | Transcription factor crc |
| comp79848_c1 | AT5G24780 | Acid phosphatase VSP1 |
| comp80157_c2 | AT1G07050 | CCT motif family protein |
| comp81803_c0 | AT5G42900 | Cold regulated protein 27 |
| comp82468_c1 | AT2G05960 | Retroelement pol polyprotein |
| comp82517_c0 | AT5G02560 | Histone h2a |
| comp83232_c1 | AT3G15620 | UV repair defective 3 |
| comp83562_c0 | AT4G38960 | B-box type zinc finger-containing protein |
| comp83593_c0 | AT2G25530 | AFG1-like ATPase family protein |
| comp83707_c0 | AT3G49430 | Ser arg-rich protein 34a |
| comp84080_c0 | AT5G35970 | P-loop containing nucleoside triphosphate hydrolases superfamily protein |
| comp84301_c1 | AT2G24820 | Translocon at the inner envelope membrane of chloroplasts 55 |
| comp84585_c0 | AT5G58140 | Phototropin 2 |
| comp85037_c0 | AT4G24290 | Mac perforin domain-containing protein |
| comp85451_c1 | AT2G44740 | Preg1-like negative regulator |
| comp85721_c1 | AT2G42750 | DNAJ heat shock N-terminal domain-containing protein |
| comp86930_c0 | AT2G40130 | Double clp-n motif-containing p-loop nucleoside triphosphate hydrolases superfamily protein |
| comp87035_c5 | AT2G38540 | Nonspecific lipid-transfer protein |
| comp87071_c2 | AT3G61130 | Galacturonosyltransferase 3 |
| comp87121_c0 | AT5G66290 | Uncharacterized protein |
| comp87227_c2 | AT1G18330 | MYB-related transcription factor EPR1 |
| comp87716_c1 | AT3G62330 | CCHC-type zinc knuckle protein |
| comp88021_c0 | AT1G56220 | Dormancy auxin associated protein |
| comp89415_c1 | AT3G20810 | Jumonji-C domain-containing protein 30 |
| comp89617_c0 | AT5G24850 | Cryptochrome 3 |
| comp89772_c0 | AT5G56850 | Uncharacterized protein |
| comp90486_c1 | AT5G24470 | Pseudo-response regulator 5 |
| comp90925_c6 | AT3G19900 | Uncharacterized protein |
| comp91531_c2 | AT3G14050 | RelA-SpoT like protein RSH2 |
| comp91844_c1 | - | Hypothetical protein |
| comp91844_c4 | - | Hypothetical protein |
| comp92821_c0 | AT4G16146 | cAMP-regulated phosphoprotein 19-related protein |
| comp93257_c0 | AT3G20390 | Endoribonuclease L-PSP family protein |
| comp93445_c0 | AT5G04800 | 40s ribosomal protein S17-4 |
| comp93451_c0 | AT1G04220 | Beta-ketoacyl-synthase |
| comp93952_c0 | AT3G55280 | 60s ribosomal protein L23a-2 |
| comp94434_c0 | AT3G06730 | M-type thioredoxin |
| comp95443_c0 | AT5G08180 | Ribosomal protein L7ae-like |
| comp99868_c0 | AT4G29040 | Regulatory particle AAA-ATPase 2A |
Fig. 5GO terms enriched in the differential expression analysis according to Fisher’s Exact Test (FDR < 0.05), and ordered according to % of sequences annotated in the DE gene set (red) and the reference gene set (blue). The longest isoform for each transcript in the transcriptome was used in the analysis
Fig. 6KEGG pathways represented in the differential expression analysis, ordered according to fold change difference between the percentage of genes annotated in the DE gene set (red) and the reference gene set (blue). The longest isoform for each transcript in the transcriptome was used in the analysis