| Literature DB >> 28874672 |
R V Sreedhar1,2, P Prasad1,2, L Prasanna Anjaneya Reddy1, Ram Rajasekharan1, Malathi Srinivasan3.
Abstract
Buglossoides arvensis is an emerging oilseed crop that is rich in stearidonic acid (SDA) and has several potential applications in human health and nutrition. The molecular basis of SDA biosynthesis in this plant remains unknown due to lack of genomic information. To unravel key genes involved in SDA-rich triacylglycerol (TAG) biosynthesis, we performed transcriptome sequencing of pooled mRNA from five different developmental stages of B. arvensis seeds using Illumina NextSeq platform. De novo transcriptome assembly generated 102,888 clustered transcripts from 39.83 million high-quality reads. Of these, 62.1% and 55.54% of transcripts were functionally annotated using Uniprot-Viridiplantae and KOG databases, respectively. A total of 10,021 SSR-containing sequences were identified using the MISA tool. Deep mining of transcriptome assembly using in silico tools led to the identification of genes involved in fatty acid and TAG biosynthesis. Expression profiling of 17 key transcripts involved in fatty acid desaturation and TAG biosynthesis showed expression patterns specific to the development stage that positively correlated with polyunsaturated fatty acid accumulation in the developing seeds. This first comprehensive transcriptome analysis provides the basis for future research on understanding molecular mechanisms of SDA-rich TAG accumulation in B. arvensis and aids in biotechnological production of SDA in other oilseed crops.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28874672 PMCID: PMC5585386 DOI: 10.1038/s41598-017-09882-y
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Lipid content and fatty acid composition. (a) Lipid content of B. arvensis developing seeds and leaves. Lipid content was determined gravimetrically and expressed as % of dry weight. (b) Neutral lipid profile of five developmental stages of B. arvensis seed by TLC. Lipids were extracted and separated on a silica-TLC plate using the solvent system, petroleum ether:diethyl ether:acetic acid (70:30:1, v/v) and developed by manganese chloride charring. (c) Fatty acid composition of developing seeds and leaves. (d) Representative GC-MS chromatogram showing the fatty acid profile of B. arvensis seeds (24 DAF). All values are represented as the mean of biological triplicates ± SD. DAF, days after flowering; SE, steryl ester; TAG, triacylglycerol; DAG, diacylglycerol; FFA, free fatty acid; C16:0, palmitic acid; C18:0, stearic acid; C18:1, oleic acid; C18:2, linoleic acid; C18:3 (n-6), gamma-linolenic acid; C18:3 (n-3), alpha-linolenic acid; C18:4, stearidonic acid.
Transcriptome raw data and de novo assembly statistics of B. arvensis.
| Summary of sequencing reads | ||
| Number of raw reads | 42528726 | |
| Number of processed reads | 39833196 | |
| Total number of nucleotides | 6379308900 | |
| GC content (%) | 44% | |
| Q30 (%) | 82.24% | |
| Summary of | ||
|
|
|
|
| Hash length | 25 | 25 |
| Transcripts generated | 1,20,320 | 1,02,888 |
| Maximum transcript length | 16659 | 16659 |
| Minimum transcript length | 301 | 301 |
| Average transcript length | 954.6 | 922.8 |
| Median Transcript Length | 823 | 450 |
| Total Transcripts Length | 11,48,54,624 | 9,49,41,727 |
| Total Number of Non-ATGC Characters | 0 | 0 |
| Transcripts >= 300 b | 1,20,320 | 1,02,888 |
| Transcripts >= 500 b | 77640 | 64959 |
| Transcripts >= 1 Kb | 40411 | 32509 |
| Transcripts >= 5 Kb | 335 | 259 |
| Transcripts >= 10 Kb | 10 | 9 |
| N50 value | 1318 | 1263 |
| GC% | 39.13252809 | 38.94526376 |
Figure 2Annotation statistics of B. arvensis clustered transcripts against Uniprot-Viridiplantae database. (a) Length distribution of clustered transcripts. (b) Nucleotide content of clustered transcripts. (c) Species distribution of the top BLASTX hits in the Uniprot database.
Figure 3GO and KOG classification. (a) Gene ontology distribution of B. arvensis clustered transcripts according to biological process, cellular component and molecular function. The percentage of transcripts encoding the top ten GO terms in each category is represented. (b) KOG classification of clustered transcripts into metabolism, cellular processes and signalling, poorly characterized and information storage and processing. The percentage of clustered transcripts encoding the KOG terms in each category are represented.
Figure 4KEGG pathway analysis. (a) Classification of B. arvensis clustered transcripts based on the metabolism category. The bars represent the number of clustered transcripts in each sub-category of metabolism. (b) Percentage distribution of lipid metabolism transcripts.
List of enzymes involved in fatty acid biosynthesis, desaturation and TAG biosynthesis in developing seeds of B. arvensis.
| Symbol | Enzymes | Arabidopsis gene ID | Number of Clustered transcripts | RPKM Value of best hit transcript |
|---|---|---|---|---|
|
| ||||
| ACC1 | Acetyl-CoA carboxylase/biotin carboxylase 1 | AT1G36160 | 1 | 17.11 |
| α-CT | Acetyl-CoA carboxylase carboxyl transferase subunit alpha | AT2G38040 | 12 | 34.02 |
| BCCP | Acetyl-CoA carboxylase biotin carboxyl carrier protein | AT5G15530 | 3 | 50.92 |
| BC | Acetyl-CoA carboxylase, biotin carboxylase subunit | AT5G35360 | 1 | 19.46 |
| β-CT | Acetyl-CoA carboxylase carboxyl transferase subunit beta | ATCG00500 | 2 | 56.11 |
| MCMT | Malonyl-CoA- ACP Malonyl transferase | AT2G30200 | 2 | 26.29 |
| KASIII | 3-oxoacyl-[acyl-carrier-protein] synthase III | AT1G62640 | 1 | 13.91 |
| KASII | 3-oxoacyl-[acyl-carrier-protein] synthase II | AT1G74960 | 4 | 32.6 |
| KASI | 3-oxoacyl-[acyl-carrier-protein] synthase I | AT5G46290 | 4 | 16.46 |
| KAR | 3-oxoacyl-[acyl-carrier protein] reductase | AT1G24360 | 5 | 67.56 |
| HAD | 3-hydroxyacyl-[acyl-carrier-protein] dehydratase | AT2G22230 | 1 | 26.91 |
| ER | Enoyl-[acyl-carrier protein] reductase I | AT2G05990 | 4 | 26.44 |
| FATA | Fatty acyl-ACP thioesterase A | AT3G25110 | 2 | 18.24 |
| FATB | Fatty acyl-ACP thioesterase B | AT1G08510 | 21 | 15.06 |
| LACS 8 | Long-chain acyl-CoA synthetase 8 | AT2G04350 | 2 | 27.40 |
| LACS 9 | Long-chain acyl-CoA synthetase 9 | AT1G77590 | 5 | 3.25 |
| LACS 4 | Long-chain acyl-CoA synthetase 4 | AT4G23850 | 11 | 4.03 |
|
| ||||
| SAD | Stearoyl-ACP desaturase | AT2G43710 | 6 | 9.99 |
| FAD2 | Omega-6 fatty acid desaturase (∆12-desaturase), Microsomal | AT3G12120 | 5 | 505.81 |
| FAD6 | Omega-6 fatty acid desaturase (∆12-desaturase), Choloroplastic | AT4G30950 | 2 | 10.12 |
| FAD3 | Omega-3 fatty acid desaturase (∆15-desaturase), Microsomal | AT2G29980 | 2 | 1763.42 |
| FAD7/8 | Omega-3 fatty acid desaturase (∆15-desaturase), Chloroplastic | AT3G11170 | 3 | 14.08 |
| D6D-1 | Delta-6 desaturase | — | 3 | 11.30 |
| D6D-2 | Delta-6 desaturase | — | 1 | 329.42 |
|
| ||||
| GPAT9 | Glycerol-3-Phosphate Acyltransferase 9 | AT5G60620 | 2 | 13.03 |
| LPAAT2 | 1-Acylglycerol-3-Phosphate Acyltransferase | AT3G57650 | 4 | 20.57 |
| PAH2 | Phosphatidate Phosphatase | AT5G42870 | 6 | 6.17 |
| DGAT1 | Acyl-CoA: Diacylglycerol Acyltransferase 1 | AT2G19450 | 2 | 8.60 |
| DGAT2 | Acyl-CoA: Diacylglycerol Acyltransferase 2 | AT3G51520 | 3 | 6.88 |
| PDCT | Phosphatidylcholine:diacylglycerol cholinephosphotransferase | AT3G15820 | 4 | 20.23 |
| CPT | Diacylglycerol Cholinephosphotransferase | AT1G13560 | 3 | 26.18 |
| PDAT1 | Phospholipid: Diacylglycerol Acyltransferase 1 | AT5G13640 | 2 | 22.88 |
| PDAT2 | Phospholipid: Diacylglycerol Acyltransferase 2 | AT3G44830 | 3 | 19.25 |
| LPCAT | 1-Acylglycerol-3-Phosphocholine Acyltransferase | AT1G12640 | 4 | 17.07 |
| OLE1 | Oleosin 1 | AT4G25140 | 2 | 2144.55 |
| WRI | Wrinkled 1 | AT3G54320 | 2 | 35.94 |
Number of clustered transcripts and RPKM value of the best hit for each gene is represented.
Figure 5Multiple sequence alignment and phylogenetic analysis. (a) Amino acid sequence alignment of delta-6 desaturases (D6D) of B. arvensis isoforms (BaD6D-1 and BaD6D-2) identified from our transcriptome data with the reported D6D sequences of B. arvensis (BaD6DES; GenBank ID: AJS13593), Echium plantagineum (EpD6D; GenBank ID: AAZ08559), Echium sabulicola (EsD6D; GenBank ID: AAZ23035), and Borage officinalis (BoD6DES; GenBank ID: ABU51607). Pink shades indicate identical amino acids. Green, blue, violet and yellow shades indicate dissimilar amino acids with different percentages of amino acid conservation. Dashes indicate gaps in alignment. H1, H2 and H3 boxes indicate conserved histidine boxes. The green box indicates a conserved N-terminal cytochrome b5 heme binding motif. (b) Phylogenetic tree showing the relationship of B. arvensis D6D (BaD6D-1 and BaD6D-2) with orthologue proteins from the other related organisms in the Boraginaceae family. The tree was constructed by MEGA6 software using ClustalW and a neighbour-joining method. Bootstrap value = 1000.
Figure 6Schematic representation of a B. arvensis oil biosynthetic pathway rich in stearidonic acid. The relative expression of genes involved in fatty acid desaturation and triacylglycerol biosynthesis in developing seeds(6, 12, 18, 24, 30 DAF) and leaves of B. arvensis were analysed through qRT-PCR. Alpha-tubulin and clathrin adaptorcomplex (CAC) were used as internal reference genes. Fold change is represented as log2 in a heat map. Stage-1(6 DAF) was used as a calibrator and its expression was set as 1.0. ACP: acyl-Carrier protein; GL: galactolipid; G3P: sn-glycerol-3 phosphate; PA: phosphatidic acid; LPA; lysophosphatidic acid; LPC: lysophosphatidylcholine; DAG: diacylglycerol; TAG: triacylglycerol; PC: phosphatidylcholine. For full name of genes, see Table 2.