Literature DB >> 32308267

Mining of miRNAs from EST data in Dendrobium nobile.

Debasish B Krishnatreya1, Pooja Moni Baruah1, Bhaskar Dowarah1, Kuntala Sarma Bordoloi1, Heena Agarwal1, Niraj Agarwala1.   

Abstract

Dendrobium nobile is an orchid species highly popular for its therapeutic properties and is often used as a medicinal herb. Documenting miRNA-target associations in D. nobile is an important step to facilitate functional genomics studies in this species. Therefore, it is of interest to identify miRNA sequences from EST data available in public databases using known techniques and tools. We report 14 potential miRNAs from three ESTs of D. nobile. They belong to 3 miRNA families (miR390, miR528 and miR414) linking to transcription factor regulation, signal transduction, DNA and protein binding, and various cellular processes covering 34 different metabolic networks in KEGG. These results help in the understanding of miRNA-mRNAs functional networks in Dendrobium nobile.
© 2020 Biomedical Informatics.

Entities:  

Keywords:  Dendrobium nobile; Expressed Sequence Tags; in silico; miRNA

Year:  2020        PMID: 32308267      PMCID: PMC7147496          DOI: 10.6026/97320630016245

Source DB:  PubMed          Journal:  Bioinformation        ISSN: 0973-2063


Background

Dendrobium nobile is ornamentally and medicinally one of the most important species of flowering plants. It belongs to the Orchidaceae family, which is one of the largest families of the angiosperms and has been used as a first-rate herb in India and China since ancient times [1]. The pattern of flowering of the violet coloured flowers of D. nobile make them more fascinating and attractive [2]. The presence of various active compounds likes Dendrobine, Moscatilin, Gigantol, Nobiline and Dendrophenol in the stems and leaves of D. nobile are known to be responsible for the greatly increased medicinal property of this plant [3, 4]. These compounds are known to have strong anti-mutagenic properties and are anti-carcinogenic against lung carcinoma, ovary adenocarcinoma and promyelocytic leukemia [5]. Moreover genetic diversity studies indicate that D. nobile from Northeast India has a comparatively higher rate of genetic diversity [6,7]. The orchid, being prized for its immense commercial importance, is often subjected to unrestrained anthropogenic pressures, thereby threatening its natural population [8]. In addition to its health benefits and economic value, D. nobile is also a wonderful source of experimental material to expound gene expression and regulation because of its versatile characteristics; the availability of decent numbers of expressed sequence tags of this species also augmented this study. MicroRNAs are a class of endogenous small, non-coding, single stranded RNAs that act as post-transcriptional regulators in eukaryotic organisms [9]. Each miRNA is capable of regulating the expression of many genes - either by translational repression or mRNA cleavage- allowing them to simultaneously regulate multiple cellular signalling and biosynthetic pathways [10]. Plant microRNAs play important roles in plant growth and development including leaf morphology and polarity, organ development, cell differentiation and proliferation, programmed cell death, signal transduction, stress responses, hormone signalling, floral organ identity and maturity, phase transition and reproduction [11-13]. For miRNAs to be reliably distinguishable from other RNAs, Ambros et al. (2003) developed a set of criteria for miRNA identification and annotation and their guidelines for experimental verification [14]. However, those criteria for miRNA annotation have been revised by Axtell and Meyers (2018), which has been followed in this study [15]. The first miRNA to be discovered was lin-4, predicted to be of 22 nucleotides in length and found in the larval form of Caenorhabditis elegans [16]. It is responsible for regulation of the pathway that triggers the transitions of first larval stage cell division to the second [17]. In plants, RNA polymerase II is responsible to transcribe majority of primary miRNA transcripts (pri-miRNAs) from miRNA genes. Processing of pri-miRNAs to precursor miRNAs and then further to mature miRNA-miRNA* duplex is brought about by the DCL1 (Dicer-like 1) enzyme [18]. The duplex is methylated by HUA ENHANCER 1 (HEN1) and transported to the cytoplasm by HASTY, after which the guide miRNA strand is then incorporated into ARGONAUTE (AGO) protein [19]. Once a suit-able pairing event between a miRNA and target mRNA occurs, the RISC (RNA-induced silencing complex) then triggers almost complete inhibition of protein expression by either cleavage of mRNA targets or by inhibiting protein translation [20]. The repressional activity of miRNA is mainly based on the property of regulation of gene expression at the post-transcriptional level either by cleavage mediated mRNA degradation or inhibition of translation [21]. Discovery of genetic modulators in various plants has helped to comprehend their specific regulatory modules involved in complex biological processes. Understanding the biological functions of miRNAs, identification of miRNAs and their target genes is an important step in interpreting the roles of miRNAs in regulation of specific characters. Documentation of miRNAs and their targets have been very effective in a number of plants such as Arabidopsis, rice, maize, wheat, soybean, cotton and tea [22, 23].

Methods:

Reference set of miRNAs and Sequence data:

A total of 38,589 previously identified mature micro-RNAs from different plants were retrieved from the miRBase database (http://www.mirbase.org/) (release 22.1). These sequences were defined as the query sequence set and used for identifying miRNAs in D. nobile Expressed Sequence Tags (ESTs). Publicly available 15,383 ESTs of the species were downloaded from National Centre for Biotechnology Information (NCBI)(https://www.ncbi.nlm.nih.gov/). Local database for BLAST was constructed for D. nobile ESTs by using the locally installed NCBI-Blast+ application (ftp://ftp.ncbi.nlm.nih.gov /blast/ executables/blast+/). Non-redundant protein sequences were used from the NR protein database of NCBI (ftp://ftp.ncbi.nlm.nih.gov/blast/db/).

Identification of putative miRNAs:

Sequence and structural homologies are used for computer based predictions of miRNAs. Computational strategies provide less time consuming, valuable and efficient means for prediction and identification of miRNA genes and their targets (Figure 1). NCBI-BLAST+ program was used to screen the ESTs against the reference miRNAs obtained from miRBase by searching for homologous hits [24]. A maximum of two mismatches,threshold e-value of <0.001 and word-size value of 7 was set for the blast+ analysis. After removing redundancy the ESTs with matched hits were subjected to Blastx analysis with NR protein database, and the non-protein coding sequences were retained for further analysis of RNA secondary structure using Zuker folding algorithm by Mfoldv3.5 (http://unafold.rna.albany.edu/ ?q=mfold/ RNA-Folding-Form) [25]. The following parameters were used in defining the sequences as miRNA homologs: (1) The sequence should fold into an appropriate stem-loop secondary structure. (2) The miRNA should be present in one arm of the hairpin structure. (3) The mature miRNA and its complementary miRNA* sequence should not have more than 5 mismatches. (4) The value of Minimal Folding free Energy Index (MFEI) of precursor miRNA structures should not be less than 0.5 and should have a high Minimal Folding free Energy (MFE) value. MFE is the negative equivalent of the ΔG value [26]. The MFEI value has been calculated by using the following formula proposed by Zhang et al. [27].
Figure 1

Computational pipeline for identification of putative miRNAs of D. nobile and their target genes

AMFE = (MFE x 100)/Length of precursor MFEI = AMFE / (GC) % MFEI = [(MFE/length of the RNA sequence) x 100]/(GC)%

Prediction of putative target genes:

A plant small RNA Target Analysis Server viz. psRNATarget was used for predicting the targets of the newly identified miRNA by using Schema V2 (2017 Release) with the maximum expectation value threshold as 3 and rest of the values set as default [28]. A maximum of two mismatches were allowed in the complementary region of target genes with the miRNAs, whereas mismatch inhibition was maintained at 10th and 11th nucleotide position along the aligned region. Target genes were identified against Arabidopsis thaliana transcript, TAIR V10 as genome or transcriptome sequences of Dendrobiumnobile are not available in public domain.

Gene Ontology, KEGG pathway and Phylogenetic analysis:

Annotations of the target genes were carried out using a Blastx analysis with an e-value of 10 3 against the NCBI non-redundant protein database. Blast2go version 5.2 (https://www.blast2go.com/blast2go-pro/) was used for the gene ontology and KEGG (Kyoto Encyclopaedia of Genes and Genomes) pathway analysis of the annotated target genes in order to assess the phenotypic traits which may get affected by expression of the identified miRNAs of D. nobile [29]. The phylogenetic trees were constructed using MEGA7 – a Windows OS based software. The precursor sequences of family members of the identified miRNAs, belonging to other plant species were downloaded from miRBase and collated with the D.nobile miRNA precursors. Multiple sequence alignment was carried out using MUSCLE algorithm and phylogenetic trees developed using the Maximum likelihood approach.

Results:

miRNA identification and characterization:

From a total of 15,383 published ESTs of D. nobile, 306 of them showed homology with previously deposited miRNAs in miRBase 22.1. Following the criteria given by Axtell and Meyers (2018) for plant miRNA annotation, these were further filtered to retain only the miRNAs ≥ 19 nucleotides in length [15]. As a result only 249 miRNAs were taken from which further removal of redundancies in miRNAs and ESTs yielded 247 potential miRNA sequences. Blastx analysis of these ESTs against the NCBI non-redundant database resulted in identification of 89 sequences as non-coding sequences.

miRNA secondary structure:

The potential miRNAs were subjected to structural validation analysis in Mfold v3.5 for prediction of miRNA secondary structure. The miRNAs which showed valid stem-loop hairpin precursor, presence of complementary miRNA* sequence in the precursor with less than 6 mismatches, and an MFEI value greater than 0.5 were considered for further analysis of their target genes. Fourteen such conserved miRNAs were identified belonging to three miRNA families (Figure 2). miR528 is represented by two members, miR390 represented by 11 members and miR414 by one member. The ΔG values ranged from -47.8 to -35.5 kcal/mol. It is often considered that, lower the value of ΔG, higher is the thermodynamic stability of the miRNA precursor [30]. A lower value of ΔG corresponds to a higher MFEI value as MFE is equivalent to (-ΔG). miRNA characterization indicates that the precursor length of miRNAs varied between 79-169 bases and the mature miRNA length ranged from 19 to 21 nucleotides (Table 1).
Figure 2

Secondary hairpin structure of precursor sequences of three identified miRNA families.

Table 1

Identified putative miRNAs of D.nobile from ESTs

Accession no.miRNAMature miRNA sequencePL*(C+G)%MFEAMFEMFEI
HO190899.1zma-miR528a-5pTGGAAGGGGCATGCAGAGGAG7951.335.546.710.91
zma-miR528b-5pTGGAAGGGGCATGCAGAGGAG7951.335.546.710.91
HO191179.1>cca-miR390AAGCTCAGGAGGGATAGCG1074342.5539.760.92
>lus-miR390aAAGCTCAGGAGGGATAGCGCC10643.442.9540.510.93
>lus-miR390bAAGCTCAGGAGGGATAGCGCC10643.442.9540.510.93
>csi-miR390b-5pAGCTCAGGAGGGATAGCGCC10543.8141.6539.660.9
>lus-miR390cAAGCTCAGGAGGGATAGCGCC10643.442.9540.510.93
>ppt-miR390c-5pAGCTCAGGAGGGATAGCGCC10543.8141.6539.660.9
>lus-miR390dAAGCTCAGGAGGGATAGCGCC10643.442.9540.510.93
>gma-miR390eAGCTCAGGAGGGATAGCGCC10543.8141.6539.660.9
>gma-miR390fAAGCTCAGGAGGGATAGCGCC10643.442.9540.510.93
>gma-miR390gAAGCTCAGGAGGGATAGCGCC10643.442.9540.510.93
>atr-miR390.1TAAAGCTCAGGAGGGATAGCG11141.4445.2540.760.98
HO194934.1>ath-miR414GACGATGATGATGAAGATGA16947.9347.828.280.59
Precursor Length

Target gene prediction and annotation:

It has been demonstrated in several studies that most plant miRNAs bind to their target mRNA sequences with perfect or near-perfect sequence complementarity [31, 32]. This provides an effective approach for discovering probable miRNA targets by comparing and aligning miRNAs with mRNA sequences. In order to identify genes plausibly recognised by the potential miRNAs, psRNA Target - a web-based server was used for searching target genes against A. thaliana transcriptome acquired from TAIR10. A total of 138 genes were identified as target genes of 14 identified miRNAs, where 4 genes having unknown functions were discarded. Out of the 134 retained targets, only 3 genes exhibit translational repression by corresponding miRNAs whereas all the rest of the genes show cleavage mode of regulation (Table 2).
Table 2

Predicted target genes of D. nobile miRNAs.

miRNA_Acc.Target_Acc.ExpectDescriptionInhibition
dno-miR390AT3G17185.13predicted proteinCleavage
dno-miR390AT5G03640.13serine/threonine-protein kinaseTranslation
dno-miR390.1AT1G05500.13synaptotagmin-5Cleavage
dno-miR390.1AT1G78950.13beta-amyrinsynthaseCleavage
dno-miR390.1AT2G41600.53Mitochondrial glycoprotein familyCleavage
dno-miR390.1AT4G12980.13cytochrome b561 and DOMON domain-containing protein At4g12980Cleavage
dno-miR390.1AT5G11700.13ephrin type-B receptorCleavage
dno-miR390.1AT5G39862.13putative non-LTR retroelement reverse transcriptaseCleavage
dno-miR390bAT5G48480.13Lactoylglutathionelyase / glyoxalase I family proteinCleavage
dno-miR390b-5pAT3G52890.23serine/threonine-protein kinaseCleavage
dno-miR390c-5pAT1G47890.13receptor-like protein 12Cleavage
dno-miR390eAT5G05570.12.5transducin family protein / WD-40 repeat family proteinCleavage
dno-miR390eAT5G05570.22.5transducin family protein / WD-40 repeat family proteinCleavage
dno-miR390eAT3G11050.13putative ferritin subunit precursorCleavage
dno-miR390fAT4G32820.13Tetratricopeptide repeat (TPR)-like superfamily proteinCleavage
dno-miR390fAT4G32820.23Tetratricopeptide repeat (TPR)-like superfamily proteinCleavage
dno-miR414AT1G74890.10.5two-component response regulator ARR15-likeCleavage
dno-miR414AT1G80960.21F-box and Leucine Rich Repeat domains containing proteinCleavage
dno-miR414AT3G59220.11PRN1_ARATHRecName: Full=Pirin-1; AltName: Full=AtPirin1Cleavage
dno-miR414AT5G20370.11serine-rich protein-like proteinCleavage
dno-miR414AT5G23240.11DNAJ heat shock N-terminal domain-containing proteinCleavage
dno-miR414AT5G61510.11GroES-like zinc-binding alcohol dehydrogenase family proteinCleavage
dno-miR414AT1G05490.11.5SNF2 domain-containing protein CLASSY 3-likeCleavage
dno-miR414AT1G45160.11.5Protein kinasesuperfamily proteinCleavage
dno-miR414AT1G48970.11.5translation initiation factor eIF-2B subunit deltaCleavage
dno-miR414AT1G53770.21.5O-fucosyltransferase family proteinCleavage
dno-miR414AT1G78270.11.5UDP-glycosyltransferase 85A4Cleavage
dno-miR414AT2G15345.11.5Plant invertase/pectin methylesterase inhibitor superfamily proteinCleavage
dno-miR414AT2G17525.11.5pentatricopeptide repeat-containing protein At2g17525, mitochondrialCleavage
dno-miR414AT2G22000.11.5elicitor peptide 6 precursorCleavage
dno-miR414AT2G30790.11.5oxygen-evolving enhancer protein 2-1, chloroplasticCleavage
dno-miR414AT2G35960.11.5NDR1/HIN1-like protein 12Cleavage
dno-miR414AT2G35970.11.5NDR1/HIN1-like protein 12Cleavage
dno-miR414AT2G36460.21.5Aldolasesuperfamily proteinCleavage
dno-miR414AT3G13730.11.53-epi-6-deoxocathasterone 23-monooxygenaseCleavage
dno-miR414AT3G17100.11.5transcription factor bHLH147-likeCleavage
dno-miR414AT3G27640.11.5denticleless protein homologCleavage
dno-miR414AT3G43590.11.5protein AIR1Cleavage
dno-miR414AT4G16790.11.5glycoprotein homologCleavage
dno-miR414AT1G22850.12SNARE associated Golgi protein familyCleavage
dno-miR414AT1G26780.22transcription factor MYB117Cleavage
dno-miR414AT1G75180.22Erythronate-4-phosphate dehydrogenase family proteinCleavage
dno-miR414AT2G36320.12zinc finger A20 and AN1 domain-containing stress-associated protein 6-likeCleavage
dno-miR414AT3G13930.12dihydrolipoyllysine-residue acetyltransferase component 2 of pyruvatedehydrogenase complexCleavage
dno-miR414AT3G21380.12jacalin-related lectin 36Cleavage
dno-miR414AT4G32300.12G-type lectin S-receptor-like serine/threonine-protein kinase SD2-5Cleavage
dno-miR414AT4G37630.12cyclin d5Cleavage
dno-miR414AT4G39410.12probable WRKY transcription factor 13Cleavage
dno-miR414AT5G11720.12alpha-glucosidaseCleavage
dno-miR414AT1G05310.12.5probable pectinesterase 8Cleavage
dno-miR414AT1G13430.12.5P-loop containing nucleoside triphosphatehydrolasessuperfamily proteinCleavage
dno-miR414AT1G14920.12.5DELLA protein GAICleavage
dno-miR414AT1G15710.12.5Arogenatedehydrogenase 2, chloroplasticCleavage
dno-miR414AT1G21326.12.5Nuclear speckle RNA-binding protein BCleavage
dno-miR414AT1G26390.12.5berberine bridge enzyme-like 4Cleavage
dno-miR414AT1G28450.12.5agamous-like MADS-box protein AGL29Cleavage
dno-miR414AT1G44830.12.5ethylene-responsive transcription factor ERF014Cleavage
dno-miR414AT1G51640.12.5exocyst complex component EXO70A1Cleavage
dno-miR414AT1G52160.12.5tRNase Z TRZ3, mitochondrialCleavage
dno-miR414AT1G54160.12.5nuclear transcription factor Y subunit A-5Cleavage
dno-miR414AT1G60940.12.5serine/threonine-protein kinase SRK2ACleavage
dno-miR414AT1G66090.12.5Disease resistance protein (TIR-NBS-LRR class) familyCleavage
dno-miR414AT1G68720.12.5tRNA(adenine(34)) deaminase, chloroplasticCleavage
dno-miR414AT1G69690.12.5transcription factor TCP15-likeCleavage
dno-miR414AT1G71220.22.5UDP-glucose:glycoproteinglucosyltransferaseCleavage
dno-miR414AT2G01530.12.5MLP-like protein 328Cleavage
dno-miR414AT2G04620.12.5zinc transporter-like proteinCleavage
dno-miR414AT2G06850.12.5xyloglucanendotransglucosylase/hydrolaseCleavage
dno-miR414AT2G21530.12.5SMAD/FHA domain-containing proteinCleavage
dno-miR414AT2G23530.12.5cell division cycle-associated protein 7Cleavage
dno-miR414AT2G25110.12.5stromal cell-derived factor 2-like proteinCleavage
dno-miR414AT2G28610.12.5WUSCHEL-related homeobox 3Cleavage
dno-miR414AT2G32310.12.5CCT motif family proteinCleavage
dno-miR414AT2G35110.22.5protein NAP1 isoform X1Cleavage
dno-miR414AT2G37410.12.5mitochondrial import inner membrane translocase subunit TIM17-2-likeCleavage
dno-miR414AT2G43970.12.5la-related protein 6BCleavage
dno-miR414AT2G43970.22.5la-related protein 6BCleavage
dno-miR414AT2G45880.12.5beta-amylase 7Cleavage
dno-miR414AT2G47350.22.5HIT zinc finger and PAPA-1-like domain-containing proteinCleavage
dno-miR414AT2G47830.12.5metal tolerance protein C1Cleavage
dno-miR414AT3G01770.12.5transcription factor GTE9 isoform X1Cleavage
dno-miR414AT3G01830.12.5probable calcium-binding protein CML40Cleavage
dno-miR414AT3G02150.12.5transcription factor TCP13Cleavage
dno-miR414AT3G02150.22.5transcription factor TCP13Cleavage
dno-miR414AT3G22770.12.5putative F-box protein At3g23420Cleavage
dno-miR414AT3G23270.12.5Regulator of chromosome condensation (RCC1) family with FYVE zinc finger domain-containing proteinCleavage
dno-miR414AT3G24650.12.5B3 domain-containing transcription factor ABI3Cleavage
dno-miR414AT3G45190.12.5serine/threonine-protein phosphatase 6 regulatory subunit 3-like isoform X1Cleavage
dno-miR414AT3G49350.12.5GTPase-activating protein gyp7Cleavage
dno-miR414AT3G54920.12.5probable pectatelyase 13Cleavage
dno-miR414AT3G60790.12.5F-box protein At3g60790-likeCleavage
dno-miR414AT4G03030.12.5F-box/kelch-repeat protein OR23Cleavage
dno-miR414AT4G11600.12.5probable phospholipidhydroperoxide glutathione peroxidase 6, mitochondrialCleavage
dno-miR414AT4G18390.12.5transcription factor TCP2Cleavage
dno-miR414AT4G18780.12.5cellulose synthase A catalytic subunit 8 [UDP-forming]Cleavage
dno-miR414AT4G19830.12.5peptidyl-prolylcis-trans isomerase FKBP17-1, chloroplasticCleavage
dno-miR414AT4G23680.12.5MLP-like protein 328Cleavage
dno-miR414AT4G24340.12.5Phosphorylasesuperfamily proteinCleavage
dno-miR414AT4G27320.12.5universal stress protein PHOS34Cleavage
dno-miR414AT4G28620.12.5ABC transporter B family member 23, mitochondrialCleavage
dno-miR414AT4G29180.12.5root hair specific 16Cleavage
dno-miR414AT4G29180.22.5root hair specific 16Cleavage
dno-miR414AT4G30600.12.5signal recognition particle receptor subunit alpha-likeCleavage
dno-miR414AT4G34390.12.5extra-large GTP-binding protein 2Cleavage
dno-miR414AT4G35900.12.5bZIP transcription factorCleavage
dno-miR414AT5G03340.12.5cell division control protein 48 homolog ECleavage
dno-miR414AT5G03545.12.5expressed in response to phosphate starvation proteinCleavage
dno-miR414AT5G13640.12.5phospholipid:diacylglycerolacyltransferase 1Cleavage
dno-miR414AT5G16830.12.5syntaxin-21Cleavage
dno-miR414AT5G40630.12.5BAG family molecular chaperone regulator 2Cleavage
dno-miR414AT5G41410.12.5homeobox protein BEL1 homologCleavage
dno-miR414AT5G42780.12.5zinc-finger homeodomain protein 13Cleavage
dno-miR414AT5G47220.12.5ethylene responsive element binding factor 2 (ATERF2)Cleavage
dno-miR414AT5G48380.12.5probably inactive leucine-rich repeat receptor-like protein kinase At5g48380Cleavage
dno-miR414AT5G49740.12.5ferric reduction oxidase 7, chloroplasticCleavage
dno-miR414AT5G53730.12.5NDR1/HIN1-like protein 12Cleavage
dno-miR414AT5G56040.12.5probable LRR receptor-like serine/threonine-protein kinase At4g26540Cleavage
dno-miR414AT5G56860.12.5GATA transcription factor 21-likeCleavage
dno-miR414AT5G59030.12.5copper transporter 1Cleavage
dno-miR414AT1G12760.13Zinc finger, C3HC4 type (RING finger) family proteinCleavage
dno-miR414AT1G19770.13probable purinepermease 14Cleavage
dno-miR414AT1G68550.23ethylene-responsive transcription factor ERF118-likeTranslation
dno-miR414AT1G68552.13ethylene-responsive transcription factor ERF118-likeTranslation
dno-miR414AT1G69935.13protein SHORT HYPOCOTYL IN WHITE LIGHT 1Cleavage
dno-miR414AT2G23810.13tetraspanin-8Cleavage
dno-miR414AT2G42710.13Ribosomal protein L1p/L10e familyCleavage
dno-miR414AT4G31180.13aspartate--tRNAligase 2, cytoplasmicCleavage
dno-miR414AT5G50210.13quinolinatesynthase, chloroplasticCleavage
dno-miR414AT5G67520.13adenosine-5'-phosphosulfate (APS) kinase 4Cleavage
dno-miR528a-5pAT4G32770.12.5tocopherolcyclase, chloroplasticCleavage
dno-miR528a-5pAT1G80370.13cyclin-A2-4-likeCleavage
dno-miR528a-5pAT2G40920.13F-box/LRR-repeat proteinCleavage
dno-miR528a-5pAT5G62380.13NAC domain-containing protein 101-likeCleavage
dno-miR528b-5pAT5G17710.23Co-chaperone GrpE family proteinCleavage

GO and KEGG pathway analysis:

To further understand the regulatory functions of miRNAs, the target genes were subjected to Gene Ontology (level 2) and KEGG pathway enrichment analysis, using Blast2Go v5.2. The results suggested that D.nobile miRNAs were involved in regulation of 14 broadly defined biological processes and 3 basic molecular functions. The target genes were also found to be part of 9 different types of cellular components (Figure 3). Pathway enrichment analysis of target genes based on KEGG database demonstrated the participation of identified miRNAs in 34 different metabolism networks (Figure 4). These networks are involved in various important pathways such as purine metabolism, antibiotic synthesis, caffeine metabolism, pentose phosphate pathway and TCA cycle.
Figure 3

GO reports of the identified target genes showing percentage of sequences representing each class in three different categories viz. Biological processes, Cellular component and Molecular Function

Figure 4

KEGG pathway analysis reports of the target genes showing the number of genes belonging to each pathway.

Phylogenetic Analysis:

Phylogenetic analysis was carried out to understand the relationship between the identified miRNAs in D. nobile with the other plant species available in miRNA database for same family identification (Figure 5). No miRNAs have been reported for D. nobile in miRBase. Maximum likelihood method was used for carrying out three different phylogenetic analyses for three identified miRNA families and their representative members. miR390 is a conserved miRNA family and its members have reported in many important species including Arabidopsis, Brassica and rice, whereas miR528a and miR528b have been reported only in Zea mays. miR414 have been reported only in three species in miRBase viz. A. thaliana, Oryza sativa and Physcomitrella patens.
Figure 5

Neighbour joining Phylogenetic trees constructed using stem-loop precursor sequences for three different groups of miRNAs i.e. miR390, miR414 and miR528a and b. Entries marked with green dots have been identified from D. nobile ESTs.

Discussion:

Identification and annotation of genetic modulators help in deciphering the critical roles played by such components in regulation of specific biological processes and their associated cellular properties. miRNA's are considered as one such group of regulatory molecules which inhibit gene expression by cleavage mediated target mRNA degradation or translational repression. Before this study, no comprehensive work was done on identification of putative miRNAs from Expressed Sequence tags of D. nobile. In this research we considered all the important criteria such as the MFEI values, mismatch inhibition and sequence length which have been used for miRNA identification in other angiospermic species. The MFEI values of the 14 identified miRNAs in our work were mostly in the range of 0.5 to 1.0, among which 13 of them have MFEI values even greater than 0.9. As compared to the miRNAs identified in some other plants from EST sequences [33-36],this is a comparatively higher range of MFEI values, and a higher value of MFEI indicates greater thermodynamic stability of the secondary structure of the miRNAs, and hence lesser chance of encountering false positives. The G+C% of most of the miRNAs was found to be in the range of 41-47%, however only the members of miR528 family presented a G+C% value greater than 50. Among the predicted targets 13% genes are sequence specific transcription factors, 33% genes with various catalytic functions and 54% genes act as sequence specific DNA-binding, metal ion binding or protein binding factors. In the gene ontology analysis, the two main categories represented among the biological processes are cellular processes and metabolic processes (18% and 16% genes respectively). 22% of the target proteins have been found to be part of the nucleus, 18% proteins are present in various cell organelles and 13% proteins act as integral part of the cell membrane. Transcription factors (TFs) are the master regulators of gene expression patterns in eukaryotes, and are responsible for facilitation of growth and development in plants [37]. dno-miR414 identified in this study has been shown to target several transcription factors including those from MYB as well as TCP family of TFs. Members of MYB DNA-binding domain superfamily protein are involved in many important biochemical and physiological processes in plants [38]. Furthermore, previous studies have also reported that miR414 can target the MYB family transcription factors in Allium cepa, Solanum tuberosum and Brachypodium distachyon [39-41]. The plant-specific TCP (TEOSINTE BRANCHED 1, CYCLOIDEA, PCF 1 and 2) transcription factor family is involved in plant development throughout its vegetative phase, i.e. from seed germination until the formation of flowers and fruits [42]. Members of a few other families of transcription factors have also been found to be probable targets of dno-miR414, such as ERF, GATA and WRKY family of transcription factors. The ERF (Ethylene responsive) transcription factors are responsible for establishment of floral meristem and tissue repair processes [43]. GATA transcription factors (binding to GATA rich sequences) are the DNA motifs that have been mostly implicated in light-dependent gene regulation in plants [44],and the WRKY family of transcription factors has a significant role in regulation of abiotic stress responses in plants [45]. Our results also show that dno-miR414 and dno-miR528a may also target several genes which encode various F-box proteins. These proteins are characterized as components of the SCF ubiquitin-ligase complexes (Skp I, Cullin, and an F-box protein), in which they bind substrates for ubiquitin-mediated proteolysis [46]. Protein ubiquitination is considered as a critical post-translational modification process that is employed by eukaryotes in order to regulate various types of cellular processes [47]. Another important gene found to be targeted by dno-miR528 family is the co-chaperone that assists in protein folding mediated by HSP70 or HSP90 [48]. The KEGG pathway analysis also reveals involvement of miRNAs in regulation of genes associated with various significant metabolic pathways. Our findings have shown that ESTs can be a major source of functional information similar to previous reports of SSRs identified from ESTs [49].

Conclusions:

We report the mining of miRNAs from EST data in Dendrobium nobile. We describe 14 potential miRNAs from 3 ESTs of D. nobile. They belong to 3 miRNA families (miR390, miR528 and miR414) linking to transcription factor regulation, signal transduction, DNA and protein binding, and various cellular processes covering 34 different metabolic networks in KEGG. These results help in the understanding of miRNA-mRNAs functional networks in Dendrobium nobile.
  1 in total

1.  In silico identification of conserved miRNAs in the genome of fibre biogenesis crop Corchorus capsularis.

Authors:  Milad Ahmed; Foeaz Ahmed; Jamil Ahmed; Mst Rubaiat Nazneen Akhand; Kazi Faizul Azim; Md Abdus Shukur Imran; Syeda Farjana Hoque; Mahmudul Hasan
Journal:  Heliyon       Date:  2021-04-08
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.