| Literature DB >> 28431015 |
Yuko Makita1, Kiaw Kiaw Ng1,2, G Veera Singham1,3, Mika Kawashima1, Hideki Hirakawa4, Shusei Sato4, Ahmad Sofiman Othman2,3, Minami Matsui1.
Abstract
Natural rubber has unique physical properties that cannot be replaced by products from other latex-producing plants or petrochemically produced synthetic rubbers. Rubber from Hevea brasiliensis is the main commercial source for this natural rubber that has a cis-polyisoprene configuration. For sustainable production of enough rubber to meet demand elucidation of the molecular mechanisms involved in the production of latex is vital. To this end, we firstly constructed rubber full-length cDNA libraries of RRIM 600 cultivar and sequenced around 20,000 clones by the Sanger method and over 15,000 contigs by Illumina sequencer. With these data, we updated around 5,500 gene structures and newly annotated around 9,500 transcription start sites. Second, to elucidate the rubber biosynthetic pathways and their transcriptional regulation, we carried out tissue- and cultivar-specific RNA-Seq analysis. By using our recently published genome sequence, we confirmed the expression patterns of the rubber biosynthetic genes. Our data suggest that the cytoplasmic mevalonate (MVA) pathway is the main route for isoprenoid biosynthesis in latex production. In addition to the well-studied polymerization factors, we suggest that rubber elongation factor 8 (REF8) is a candidate factor in cis-polyisoprene biosynthesis. We have also identified 39 transcription factors that may be key regulators in latex production. Expression profile analysis using two additional cultivars, RRIM 901 and PB 350, via an RNA-Seq approach revealed possible expression differences between a high latex-yielding cultivar and a disease-resistant cultivar.Entities:
Keywords: RNA-Seq; full-length cDNA; latex; rubber biosynthesis; transcription factor
Mesh:
Substances:
Year: 2017 PMID: 28431015 PMCID: PMC5397604 DOI: 10.1093/dnares/dsw056
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.458
Figure 1Flowchart of the steps in the construction of the FL-cDNA libraries, sequencing and data analysis.
Summary of sequence resources of FL-cDNAs
| No. of sequences | Min. length (bp) | Max. length (bp) | Mean length (bp) | Mapping ratio (by PASA) | ||
|---|---|---|---|---|---|---|
| Sanger FL-cDNA | 5′ ESTs | 19,487 | 102 | 920 | 667.4 | |
| 3′ ESTs | 4,840 | 111 | 876 | 668.7 | ||
| Sanger FL-cDNA contigs | 4,590 | 111 | 2,026 | 868.4 | ||
| Sanger FL-cDNA singletons | 3,570 | 103 | 884 | 655.7 | ||
| Assembled Illumina FL-cDNA contigs | 15,683 | 201 | 11,217 | 966.3 | ||
| Merged FL-cDNA (contigs) | 23,843 | 103 | 11,217 | 893.9 | 91.4% | |
aThe number of reads per contig was 4.5.
Statistics of updated structural gene annotation in H. brasiliensis.
| Categories | RIKEN data | + public ESTs |
|---|---|---|
| 84,440 | ||
| No. of loci detected the expression | 7,704 | 23,790 |
| Total no. of loci updated with the PASA analysis | 5,466 | 17,201 |
| No. of fused genes | 78 | 236 |
| FL-cDNAs split single gene into multiple genes | 6 | 6 |
| No. of genes modified in UTRs | 2,605 | 7,525 |
| No. of genes modified in exon structure | 1,998 | 6,087 |
| No. of gene with newly defined isoforms | 218 | 1,967 |
| Proteins with CDS modification | 2,901 | 7,645 |
| Novel genes | 22 | 22 |
| Novel isoforms in the novel genes | 4 | 5 |
Figure 2(a) Numbers of tissue-specific and co-expressed genes in RRIM 600 latex, bark, petiole and leaf. (b) Numbers of clone-specific and co-expressed genes in PB 350, RRIM 901 and RRIM 600 latexes. We defined expressed genes with FPKM > 0.
Figure 3Latex biosynthesis pathway and heat maps of scaled expression values (log10(FPKM + 1)) for genes associated with latex biosynthesis. The lines of each heat map represent duplicate genes and the columns represent samples: PB 350 latex, RRIM 901 latex, RRIM 600 latex, RRIM 600 bark, RRIM 600 petiole and RRIM 600 leaf from the left. 37 MVA pathway, 21 MEP pathway and 18 prenyl-PP biosynthetic genes, 7 CPTs, 1 CPTL, 9 REFs and 8 SRPP genes are defined in our draft genome article. In the heat maps, genes were sorted by the expression value in RRIM 600 latex. These figures were drawn by an R package of gplots. Abbreviations in the MVA pathway: PD, pyruvate dehydrogenase; PDC, pyruvate dehydrogenase complex; DLD, dihydrolipoamide dehydrogenase; AACT, acetyl-CoA acetyltransferase; HMGS, hydroxymethyglutaryl coenzyme A synthase; HMG-CoA, hydroxymethylglutaryl coenzyme A; HMGR, hydroxymethyglutaryl coenzyme A reductase; MK, mevalonate kinase; mevalonate-OP, mevalonate monophosphate; PMK, phosphomevalonate kinase; mavalonate-OPP, mevalonate diphosphate; MDC, diphosphomevalonate decarboxylase; in the plastidic MEP pathway: DXS, 1-deoxy-D-xylulose 5-phosphate synthase; DXP, 1-deoxy-D-xylulose-5-phosphate; DXR, 1-deoxy-D-xylulose 5-phosphate reductoisomerase; MEP, 2-C-methyl-D-erythritol 4-phosphate; MCT, 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase; CDP-ME, 2-C-methyl-D-erythritol-4-phosphate; CMK, 4-(cytidine 5′-diphospho)-2-C-methyl-D-erythritol kinase; CDP-ME2P, 4-(cytidine 5′-diphospho)-2-C-methyl-D-erythritol; MDS, 2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase; MEcPP, 2-C-methyl-D-erythritol 2,4-cyclodiphosphate; HDS, 4-hydroxy-3-methylbut-2-enyl diphosphate synthase; HMB-PP, 4-hydroxy-3-methyl-but-2-en-1-yl diphosphate; HDR, 4-hydroxy-3-methylbut-2-enyl diphosphate reductase. Corresponding gene IDs are as follow: PD#1:Hb_001638_290, PD#2:Hb_006420_080, PD#3:Hb_132840_110, PD#4:Hb_001500_040, PD#5:Hb_116349_120, PD#6:Hb_001123_160, PD#7:Hb_027506_040, PD#8:Hb_003397_030, PDC#1:Hb_000800_090, PDC#2:Hb_005306_180, PDC#3:Hb_002477_290, PDC#4:Hb_000110_310, PDC#5:Hb_003411_090, PDC#6:Hb_000207_300, PDC#7:Hb_002876_240, DLD#1:Hb_001723_060, DLD#2:Hb_002217_100, DLD#3:Hb_000645_180, DLD#4:Hb_012305_100, AACT#1:Hb_000613_200, AACT#2:Hb_000184_040, AACT#3: Hb_000377_090, AACT#4:Hb_000795_060, HMGS#1:Hb_003207_110, HMGS#2:Hb_000107_440, HMGS#3:Hb_004032_440, HMGS#4:Hb_001153_150, HMGS# 5:Hb_014497_120, HMGS#6:Hb_142382_010, HMGS#7:Hb_000116_530, MK#1:Hb_035273_030, MK#2:Hb_127194_010, MK#3:Hb_002042_080, PMK#1:Hb_002759_250, PMK#2:Hb_116349_100, MDC#1:Hb_00 0539_030, MDC#2:Hb_000721_030, DXS#1:Hb_003185_080, DXS#2:Hb_062135_030, DXS#3:Hb_019 153_040, DXS#4:Hb_ 002473_110, DXS#5:Hb_021596_070, DXS#6:Hb_000046_120, DXS#7:Hb_005628_050, DXS#8:Hb_012796_050, DXS#9:Hb_124247_030, DXR#1:Hb_007520_020, DXR#2:Hb_000922_160, MCT#1:Hb_029142_030, MCT#2:Hb_007576_140, CMK#1:Hb_000453_240, CMK#2:Hb_001975_040, MDS#1:Hb_177215_030, MDS#2:Hb_ 002529_090, HDS#1:Hb_000029_350, HDS#2:Hb_000008_410, HDR#1:Hb_159809_090, HDR#2:Hb_000963_090, IPPI#1:Hb_005686_130, IPPI#2:Hb_000608_190, DMAT#1:Hb_001842_100, DMAT#2:Hb_000428_050, DMAT#3:Hb_000130_260, FPPS#1:Hb_000072_070, FPPS#2:Hb_100414_010, FPPS#3:Hb_011224_160, GGPPS# 1:Hb_1 15919_020, GGPPS#2:Hb_000371_100, GGPPS#3:Hb_134789_010, GGPPS#4:Hb_020367_050, GGPPS#5:Hb_000114_150, GGPPS#6:Hb_002768_080, GGPPS# 7:Hb_000009_180, GGPPS#8:Hb_010672_040, GGPPS#9:Hb_009780_060, GGPPS#10:Hb_001948_040.
Gene set enrichment analysis against TF families that are significantly highly expressed in latex.
| TF family | No. of TFs | Highly expressed TFs in latex | Function | |||
|---|---|---|---|---|---|---|
| C2C2-Dof | 49 | 7 | 0.009 | plant specific | Plant-specific phenomena including light, phytohormone and defense responses, seed development and germination | — |
| TAZ | 7 | 2 | 0.027 | plant specific | Responses to different stress stimuli | — |
| AP2 | 28 | 4 | 0.032 | plant specific | A variety of biological processes and response to biotic and environmental stress | 5′-gCAC(A/G)N(A/T) TcCC(a/g)ANG(c/t)-3′ |