Literature DB >> 35570207

Small RNA sequencing and identification of papaya (Carica papaya L.) miRNAs with potential cross-kingdom human gene targets.

Neha Jha1, Naman Mangukia1,2, Harshida Gadhavi1, Maulik Patel1,3, Mansi Bhavsar1, Rakesh Rawal4, Saumya Patel5.   

Abstract

Several studies have demonstrated potential role of plant-derived miRNAs in cross-kingdom species relationships by transferring into non-plant host cells to regulate certain host cellular functions. How nutrient-rich plants regulate host cellular functions, which in turn alleviate physiological and disease conditions in the host remains to be explored in detail. This computational study explores the potential targets, putative role, and functional implications of miRNAs derived from Carica papaya L., one of the most cultivated tropical crops in the world and a rich source of phytochemicals and enzymes, in human diet. Using the next-generation sequencing, -Illumina HiSeq2500, ~ 30 million small RNA sequence reads were generated from C. papaya young leaves, resulting in the identification of a total of 1798 known and 49 novel miRNAs. Selected novel C. papaya miRNAs were predicted to regulate certain human targets, and subsequent annotation of gene functions indicated a probable role in various biological processes and pathways, such as MAPK, WNT, and GPCR signaling pathways, and platelet activation. These presumptive target gene in humans were predominantly linked to various diseases, including cancer, diabetes, mental illness, and platelet disorder. The computational finding of this study provides insights into how C. papaya-derived miRNAs may regulate certain conditions of human disease and provide a new perspective on human health. However, the therapeutic potential of C. papaya miRNA can be further explored through experimental studies.
© 2022. The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature.

Entities:  

Keywords:  Carica papaya; Cross-kingdom; Next-generation sequencing; miRNA

Mesh:

Substances:

Year:  2022        PMID: 35570207      PMCID: PMC9107959          DOI: 10.1007/s00438-022-01904-3

Source DB:  PubMed          Journal:  Mol Genet Genomics        ISSN: 1617-4623            Impact factor:   2.980


Introduction

MicroRNAs (miRNAs) are one among the various classes of non-coding RNAs with a length of 18–24 nucleotides (O'Brien et al. 2018; Pirrò et al. 2021). They are evolutionarily conserved, single-stranded and extensively studied in plants, animals, insects, and viruses (Li et al. 2018). The primary function of miRNA is to regulate the expression of target genes by direct binding of miRNA to the complementary sequence in the 3' untranslated regions of the target messenger RNA (mRNA) within the RNA-induced silencing complex (RISC) by a mechanism called miRNA-mediated gene repression (O'Brien et al. 2018). There are two processes involved in this mechanism, viz., site-specific cleavage and translational repression (Gu and Kay 2010). The former occurs at the post-transcriptional level and involves argonaute proteins that cleave target mRNA preferentially at the 5' monophosphate via endonuclease activity, followed by mRNA degradation (Carthew and Sontheimer 2009). While the latter consists of the repression of initiation and post-initiation during translation, ribosome stalling and ribosome drop-off are also observed (Richter and Coller 2015). The process of translational repression is well established in animals, whereas there are only a few reports that indicate miRNAs' role in inhibition of translation initiation in plants (Huntzinger and Izaurralde 2011). It is estimated that miRNAs may regulate about 60% of human protein-coding genes with at least one known miRNA-binding site in their target sequences (Friedman et al. 2009). Plant and animal miRNAs regulate gene expression through subtle biogenesis mechanisms, with DROSHA in animals and DCL1 in plants carrying out the maturation process (Wang et al. 2019). miRNAs play a role in cellular processes like proliferation, differentiation, and apoptosis, as well as development, metabolism, immune response, and hormone signaling in animals (Libri et al. 2013; Bushati and Cohen 2007; Zhang et al. 2007). In contrast, plant-derived miRNAs are involved in stress response, homeostasis, and flowering processes (Dugas and Bartel 2004; Kruszka et al. 2012). The first pioneering work on detection of plant-derived miRNA (miR168a) in sera and tissues stemming from food ingestion in animals, and the subsequent inhibition of lipoprotein receptor adapter protein 1, laid the foundation for the concept of cross-kingdom regulation (Zhang et al. 2012). This study opened the avenue for researchers to investigate plant-derived miRNAs as potential bioactive molecules for therapeutic interventions. Food consumption is a major route for miRNA transfer from plant to animal, allowing exogenous miRNAs from the diet to influence critical cellular processes in the animal (Samad et al. 2021; Li et al. 2021). Emerging evidence also pointed out that such exogenous miRNAs can sustain in the human gastrointestinal tract (Mar-Aguilar et al. 2020). An alternative approach is to synthesize plant-derived functional miRNAs using bioengineering techniques to readily deploy in human disease models to promote the utilization of such miRNAs in human clinical trials. For example, oral administration of plant-derived miRNA159 suppresses breast cancer growth in mice (Chin et al. 2016). Recently, an in vitro study revealed that the plant miRNA156a found in dietary green vegetables can directly regulate junction adhesion molecule-A (JAM-A), one of the up-regulated genes of atherosclerotic lesions. This miRNA in human aortic endothelial cells was found to suppress JAM-A, implying that this miRNA could be used as a clinical intervention in anti-inflammatory treatments for cardiovascular disease (Hou et al. 2018). Li et al. in 2021 published a comprehensive review on plant-derived miRNAs on cross-kingdom regulation of human target genes/diseases covering various therapeutic applications (Li et al. 2021). Research supporting the cross-kingdom miRNA-mediated gene regulation is gaining attention, while some reports challenge the effectiveness and detection of orally taken plant-derived miRNAs. One such study by Dickinson et al. showed that miR168 derived from rice exhibited had no cross-kingdom regulation in the levels of LDLRAP1 found in mouse liver which contradicted Zhang et al.'s work. However, Zhang et al. defended this, saying that the results would have been based on a biased sequencing method between plants and animals (Zhang et al. 2012; Dickinson et al. 2013). Since then, the debate has continued based on the results by different groups of scientists who has provided evidence for or against the concept, of cross-kingdom transfer of miRNAs through various dietary sources and their regulation of mammalian genes over issues, such as methodology, contamination, and detection sensitivity (Li et al. 2021; Chen et al. 2013). Scientists reviewed evidences for and against the diet-derived miRNAs from plants, meat, milk, and exosomes. They concluded that the biological relevance of plant miRNA-mediated gene regulation via cross-kingdom transfer was inconclusive due to lack of reproducibility of experimental results (Samad et al. 2021; Mar-Aguilar et al. 2020). However, there was a study that provided convincing evidence that plant miRNAs in honeybees can regulate caste development and cause similar changes when fed to Drosophila (Zhu et al. 2017). Along these lines, computer-based prediction of human targets using plant-derived miRNAs encourages the narrows down the list of miRNAs from a large pool. Subsequent studies animal cell models can be done to validate these target genes and their possible functional roles in physiological and disease conditions (Jha et al. 2021; Patel et al. 2019a, b; Gadhavi et al. 2020). Carica papaya L. belongs to the Caricaceace family. It is a common and delicious fruit plant due to its nutritional and medicinal value attributed to the vast number of flavonoids, saponins, papain, and several vitamins (Nurowidah 2019; Yogiraj et al. 2014). Globally, it is primarily used as a traditional herbal medicine for treating various diseases (Hossain et al. 2020). Wall (2006) documented the application of C. papaya in treating several conditions, such as stomach disorders, diarrhea, skin diseases, male contraceptives, and home remedies for the common cold (Wall 2006). The extracts from C. papaya leaves, fruits, and seeds were shown to possess anti-cancer activities in colorectal, prostate, cervical, and breast cancers (Aravind et al. 2013; Lohsoonthorn and Danvivat 1995; Shahar et al. 2011; Pandey et al. 2017; Siegel et al. 2010). Moreover, few studies reported the significance of C. papaya leaves juice in increasing the platelet count among patients suffering from Dengue fever (Ahmad et al. 2011; Rajapakse et al. 2019). With the advent of next-generation sequencing technology and advanced bioinformatics methods, research efforts were actively pursued to identify miRNAs from expressed sequence tag (EST) and small RNA sequencing datasets (Jha et al. 2021; Patel et al. 2019a; Kumar et al. 2017). In 2012, Rishi et al. reported 60 miRNAs from leaf tissues, flowers and PSRV infected leaves, by analyzing the annotated miRNA expression revealed that PRSV-infected leaves have a higher accumulation of some miRNA* (Aryal et al. 2012). It has been reported in C. papaya that miRNAs (75 conserved and 11novel) isolated from young leaves and female flowers regulate genes involved in ethylene signaling pathway that implies their role during fruit development and ripening (Liang et al. 2013). However, most studies were performed to study how miRNAs regulate their regulatory pathways, implying an inter-species analysis. Although the inter-species analysis was made to study the role of C. papaya-derived miRNAs in regulating certain physiological conditions (e.g., fruit ripening, seed storage in freeze state), this is the first-such effort addressing how C. papaya-derived miRNAs regulate human target genes with strong associations in human diseases in an intra-species (also known as cross-kingdom) manner (Liang et al. 2013). We utilized the small RNA high-throughput technology to generate sequence reads of C. papaya young leaves and subsequently, identified the known and the novel miRNAs. The predicted novel miRNAs were found to regulate certain human genes involved in platelet activation, cancer, and mental illness. The putative role of targeted human genes in relevance to disease conditions was delineated using gene ontology and pathway enrichment analysis along with the literature-driven data to develop a hypothesis of miRNA-mediated gene regulation. We believe that this computational novel miRNA prediction using the cross-kingdom approach will shed light on the understanding of how C. papaya-derived miRNAs regulate human genes in disease states.

Materials and methods

Small RNA library construction of C. papaya leaves

A group of four C. papaya plants germinated from the seeds obtained from same source were grown on the grounds of Gujarat University Girls Hostel, Ahmedabad, Gujarat, at the ambient temperature of 30−37 °C. Young leaves were sampled from 2-month-old papaya plants in February 2020 and immediately frozen in liquid nitrogen for laboratory studies. The samples were stored at −80 °C until RNA isolation. The total RNA was isolated using ZR plant RNA miniprep Kit (Zymo Research). The quality and quantity of the isolated RNA were checked on 1% denaturing RNA agarose gel and Nanodrop/Qubit fluorometer, respectively. The small RNA sequencing libraries were prepared from the isolated total RNA using the TruSeq Small RNA library preparation kit (Eurofins Scientific, Bangalore, India) as per the manufacturer’s instruction. The protocol includes adapter ligation, reverse transcription, PCR amplification, and pooled gel purification to generate a library product. In this protocol, a RNA 3' adapter is specifically modified to target microRNAs and other small RNAs that have a 3' hydroxyl group resulting from enzymatic cleavage by Dicer or other RNA processing enzymes. The adapters are ligated to each end of the RNA molecule, and a reverse transcription reaction is used to create single-stranded cDNA. The cDNA is then PCR-amplified using a common primer and a primer containing one of the index sequences. The amplified PCR product with index sequences was size-selected and purified on a 6% TBE gel. For quantity and quality control (QC) of the library, the gel area pertaining to the average length of the library was selected and purified libraries were analyzed in the 4200 Tape Station system (Agilent Technologies) using High sensitivity D1000 Screen tape. Finally, the cluster generation and sequencing of the sample were carried out after obtaining the Qubit concentration for the libraries and the mean peak sizes from the Agilent Tape Station profile; the PE Illumina libraries were loaded onto HiSeq2500 for cluster generation and sequencing.

Bioinformatics data analysis and identification of miRNAs from C. papaya

Filtration and quality check of generated sequences

The raw reads generated by Illumina HiSeq2500 were analyzed using FastQC to check the quality of small RNA data (Andrews 2010). Cutadapt tool was utilized to remove adapters and low-quality bases. The high-quality reads were then mapped to the Rfam database (Rfam 13.0) (http://rfam.sanger.ac.uk/) (Martin 2011; Griffiths-Jones et al. 2003). Using the Burrows–Wheeler Aligner (BWA) package resulted in the removal of other non-coding RNAs (rRNA, tRNA, snRNA, SnoRNA) and degraded fragments of mRNA (Li and Durbin 2009). Sequences longer than 18 nt were used as a clean read for the identification of known and novel miRNAs. The schematic workflow is shown (Supplementary Material 5, Figure S1).

Identification of known and novel miRNAs in C. papaya and phylogenetic analysis

For the identification of known miRNAs, the remaining clean reads were aligned to the publicly available miRbase (Release 22.1: October 2018) database with plant species. The 10,414 plant mature miRNA sequences were used as a query in homology searches using Blastn against the clean reads with parameters set as follows “e-value 1000; maximum targets 1000; word size 17 and maximum mismatch 2”. The MIREAP program was used to predict novel miRNAs according to the following parameters: “minimal miRNA reference sequence length of 20 nt and maximal miRNA reference sequence length of 24 nt; minimal depth of Drosha/Dicer cutting site is 3; maximal copy number of miRNAs on the reference is 20; iv) maximal free energy allowed for a miRNA precursor was − 20 kcal/mol; maximal space between the miRNA and miRNA* is 35; minimal base pairs of the miRNA and miRNA* is 14; maximal bulge of the miRNA and miRNA* is 4; maximal asymmetry of the miRNA/miRNA* duplex, 5; and flank sequence length of miRNA precursor, 100” (Li et al. 2012). The selected sequences were then folded into a secondary structure using an RNA folding program UNAfold (Markham and Zuker 2008). Additionally, we have constructed the phylogeny tree to understand the known nature of identified novel miRNAs (Kumar et al. 2016).

Target prediction of known and novel miRNAs in human

To predict the putative target genes of the identified miRNAs, the mature miRNA sequences were queried against human transcripts using the psRNATarget webserver (Dai et al. 2018). Stringent parameters used for verifying complementary and highest homology between the miRNA and complementary human transcript sequences were: “multiplicity of target sites 2; maximum expectation value 3; range of central mismatch for translational inhibition 9–11 nucleotides; maximum mismatches at the complimentary site ≤ 4 without any gaps”.

Functional enrichment and pathway analysis of identified targeted genes

The target sequences identified from the psRNATarget webserver were then annotated through similarity searches against protein databases using the BlastX tool. Further, the result file was loaded the OmicsBox package which resulted in gene ontology annotations with Biological Process (BP), Molecular Function (MF), and Cellular Component (CC) levels based on a cut-off of p value < 0.05 (Conesa et al. 2005). To understand the complex network of regulatory pathways modulated by miRNAs, it is important to study the roles or functions of the identified miRNA target genes. Biological pathways of humans and involvement of genes were retrieved from the REACTOME database and KEGG maps p values < 0.05 were regarded as statistically significant (Fabregat et al. 2018; Kanehisa and Goto 2000).

Network regulatory interaction of targeted genes

To further analyze the function of predicted genes, Cytoscape software (version 3.6.1) was employed to construct and analyze the miRNA-hub gene network. Hub is the top 10% of the nodes in the high-confidence protein interactome ranked by node degree scores (the number of interactions that are associated with a given node), this high-confidence scores based on the experimental evidence (Tripathi et al. 2013). Hub nodes in these complexes were detected with several parameters, such as Degree, Bottleneck and MCC, DMNC, MNC, ECCENTRICITY and CLOSENESS, to assess its property, significance and organization using Cytohubba plugin (Chin et al. 2014).

Results

Sequencing and bioinformatics data analysis of C. papaya

The libraries were prepared from QC-passed RNA samples using Illumina TruSeq Small RNA library preparation kit and generated ~ 30 million reads (Supplementary Material 5, Figure S2). A total of 33,852,518 raw sequence reads were generated using the HiSeq2500 Illumina platform. A total of 25,092,134 (74.12%) clean reads were obtained after removing low-quality reads, short fragments, and adaptor sequences. The sequences of other non-coding RNAs leading to 23,11,591 (9.2%) reads were filtered out. In the remaining clean reads of 22,780,543 (67.3%), a length of 18–50 bp was used to identify known and novel miRNAs. The base frequency distribution of clean reads is shown in (Supplementary Material 5, Figure S3).

Identification of known miRNAs in C. papaya

A total of 1798 known miRNAs belonging to 127 families were identified from the small RNA population using BlastN homology search against mature plant miRNAs in the miRBase repository (version 22). The known miRNA families, miR156, miR166 and miR159 were identified which consisted of a maximum number of 163, 160 and 147 members, respectively while 20 miRNAs families contained only one member. The total read count of each miRNA was studied to obtain an outline of the apparent expression level of miRNA in C. papaya young leaf tissue. Our bioinformatic analysis revealed, the maximum expression was observed for the miR166 family, i.e., 147,868 reads, followed by the miR156 and miR396 families with 123,144 and 70,233 reads, respectively. 25 miRNA families were observed to have less than 10 reads (Fig. 1). Additionally, the length of the miRNAs was found in the range of 18–24 nt wherein the miRNAs containing 21 nt were the most abundant (31.45%) followed by miRNAs of length 20 nt (Supplementary Material 1).
Fig. 1

Reads and member distribution of identified known miRNAs. MiR156 and miR166 shows the highest family members and miR157 shows the lowest

Reads and member distribution of identified known miRNAs. MiR156 and miR166 shows the highest family members and miR157 shows the lowest

Identification of novel miRNAs in C. papaya

A total of 49 novel miRNAs were identified from the high-quality sequence reads. The maximum expression was observed for the Cpa_mir-m2262 i.e., 72,228 reads, followed by the Cpa_mir-m1481 with 72,161 reads respectively (Supplementary Material 1). The length of the putative novel miRNAs varied from 19 to 25 nt within which 51.61% were 20 nt long and the precursor length varied from 74–101 nt. Loong and co-workers have shown that miRNA precursors have lower minimum free energy (MFE) to ensure the thermodynamic stability of the hairpin loop (Loong and Mishra 2007). Here, the MFE value of the identified miRNAs was in the range of −26.3 to −53.4 kcal/mol (Table 1, Supplementary Material 2). To standardize the possible effect of sequence length and to differentiate miRNAs from other non-coding RNAs, the MFEI value was calculated and was found to be in the range of −0.64 to −1.76 indicating that the identified miRNAs were accurate. Further, the AMFE value of the predicted miRNAs was in the range of −40.41 to −64.93 kcal/mol (Table 1). Most miRNAs were located on the 3' arm of the secondary structure compared to the 5' arm. Out of 49, 36 miRNAs were in the 3'arm of secondary structure. Furthermore, (G + C) % and the (A + U) % ranged from 23.81% to 71.43% and 25% to 76.19%, respectively (Supplementary Material 3).
Table 1

Details and characterization of identified novel miRNAs in terms of MFE = minimal folding energy, AMFE = adjusted minimal folding free energy, MEFI = minimal folding free energy index

S. noNovel miRNAs of Carica papayaMature sequencesPrecursor lengthMFE (− Kcal/mol)AMFEMFEI
1Cpa_mir-m0016GGGGCCCAGAACGCUAAAAGGUAGG77 − 26.3 − 34.16 − 0.82
2Cpa_mir-m0063UCACUGCAGAUUUGAAUAUUCAUC101 − 53 − 52.48 − 1.56
3Cpa_mir-m0233GGUUGGUCGGGGCUUCUGAC96 − 41.9 − 43.2 − 0.64
4Cpa_mir-m0310AUCCGUGUGCGCAGGAACGGAUCU87 − 47.2 − 54.25 − 0.8
5Cpa_mir-m0370CACUGAAAAACGGAGCUUACCGGC92 − 42.2 − 45.87 − 0.75
6Cpa_mir-m0749CGCGUUGUGGACACGCUGACAUCG79 − 37.7 − 47.72 − 0.77
7Cpa_mir-m0440AGCGGUCAGAAGUGGCUCCGGCAC95 − 41.3 − 43.02 − 0.66
8Cpa_mir-m0508CGCACGGACGGCCUGAAUGCG96 − 45.2 − 47.58 − 0.7
9Cpa_mir-m0511UGGAUCUUCUUCUCUGGCAAGGG99 − 43.5 − 43.94 − 0.67
10Cpa_mir-m0974CUGCUGUGUGAACGAUGUGUGCAC97 − 41.5 − 42.78 − 0.73
11Cpa_mir-m1108ACGAACUGGAGCACACUGCAUGGC74 − 29.9 − 40.41 − 0.79
12Cpa_mir-m1192CACUGAAAAACGGAGCUUACCGGC92 − 42.2 − 45.87 − 0.75
13Cpa_mir-m1285CUGACGGCGCCAUCUCUCCUG95 − 46.9 − 47.86 − 0.85
14Cpa_mir-m1392AAUCUGGGUCGUUCGUGCGCGCGG84 − 40.6 − 48.33 − 0.69
15Cpa_mir-m1403UGUAUAUGUAUGCACAUGCCA77 − 47 − 61.04 − 1.74
16Cpa_mir-m1414AUAUGUAUGCACAUGCCACA83 − 49.2 − 59.28 − 1.76
17Cpa_mir-m1481AUAGAUCAUGUGGCAGUUUCA98 − 49.2 − 50.2 − 1.14
18Cpa_mir-m1488CUGACCGGGGCUGCUGACGUG73 − 47.4 − 64.93 − 0.91
19Cpa_mir-m1566UCUGACCGGGCCUGCUGACGUGGC75 − 43.1 − 57.47 − 0.83
20Cpa_mir-m1590UGCUGACUGGGCUAUUGACGUGGC101 − 41.6 − 41.19 − 0.64
21Cpa_mir-m1612CUGUCCGAUCGCGCUCACGC83 − 40.7 − 49.04 − 0.74
22Cpa_mir-m1645UGGACCGUCCGAUCGCGUUCACGC83 − 42.4 − 51.08 − 0.8
23Cpa_mir-m1648UUCUCUGGCGACUGGUUUUGC81 − 44.1 − 54.44 − 0.83
24Cpa_mir-m1990GCUUACCGGGCCUGCUGACGUGGC76 − 46.9 − 61.71 − 0.88
25Cpa_mir-m2256UUUCGGGACAGUCCGCGUGCG95 − 48.2 − 50.74 − 0.98
26Cpa_mir-m2262GAUCAUGUGGUAGCUUCACC91 − 41.8 − 45.93 − 0.97
27Cpa_mir-m2314CCCGGGUCGACGUGGUUUUGCC75 − 42.3 − 46.48 − 0.98
28Cpa_mir-m2379AGGACUUCCUGGAAUUCCGGGACU75 − 42.8 − 57.07 − 1
29Cpa_mir-m2511GUCCGAUCGUGCGCACACGG67 − 41.6 − 62.09 − 0.87
30Cpa_mir-m2749UGGGACGUACGCACUAUGUGC83 − 45.7 − 55.06 − 0.88
31Cpa_mir-m3042UGUGCACGCUUGAUCUAGGCCGU88 − 53.4 − 60.68 − 0.83
32Cpa_mir-m3110CUGACCGGGGCUGCUGACGUG72 − 43.2 − 60 − 0.82
33Cpa_mir-m3128UAUGUCUCUGAUUCUGUCAUC93 − 41.6 − 44.73 − 1.39
34Cpa_mir-m3254UUGGGCCAUCCGAUCGCGCGCACG88 − 43 − 48.86 − 0.68
35Cpa_mir-m3258AGAGAUCGAUGAACCGCUGCC98 − 44.4 − 45.31 − 0.99
36Cpa_mir-m3284AGCGACAGAUGACACAGAACGAUAGA81 − 20.8 − 25.68 − 0.55
37Cpa_mir-m3400CACGUGCUCCCCUUCUCCAAC88 − 43.5 − 49.43 − 0.89
38Cpa_mir-m3590UGCCAAAGGAGAGUUGCCCUG94 − 46.5 − 49.47 − 1.01
39Cpa_mir-m3812UGACGUGCCUGCUGACGUGG91 − 43.7 − 48.02 − 0.74
40Cpa_mir-m3953CAAGGCUAUGGAAAUUCUAA80 − 21.2 − 26.5 − 0.56
41Cpa_mir-m3967UGGUCAGGGCUGCUGACGUG77 − 40.2 − 52.21 − 0.82
42Cpa_mir-m4000CUGACCGGGGCUGCUGACGUG72 − 43.2 − 60 − 0.82
43Cpa_mir-m4028CGCACGGACGGCCUGAAUGAG92 − 40.2 − 43.7 − 0.66
44Cpa_mir-m4265UGUCCGUGUGCGUGAGAUCUGG92 − 44.4 − 48.26 − 0.75
45Cpa_mir-m4113CUGACGUGGCUGUUGAUGUGGCGC10141.240.790.76
46Cpa_mir-m4268UGGAUCUUCUUCUCUGGCAAGGG99 − 43.5 − 43.94 − 0.67
47Cpa_mir-m4298CGAAUCUGAGCCGUCCGAUCGCAC101 − 40.7 − 40.3 − 0.65
48Cpa_mir-m4304CUGACCGGGGCUGCUGACGUG71 − 43.6 − 61.41 − 0.82
49Cpa_mir-m4316GCACGCGGAUCGCCUAUGUGGCU96 − 40.8 − 41.63 − 0.68
Details and characterization of identified novel miRNAs in terms of MFE = minimal folding energy, AMFE = adjusted minimal folding free energy, MEFI = minimal folding free energy index

Phylogenetic tree analysis of novel C. papaya miRNAs

Sequence alignment was developed using the T-Coffee multiple sequence alignment (MSA) server and the evolutionary history was inferred using the Neighbor-Joining method (Kumar et al. 2016). The optimal tree was generated with the sum of branch length of 36.6. The evolutionary distances were computed using the maximum composite likelihood method, expressed in units of the number of base substitutions per site. This analysis involved 130 nucleotide sequences and removed all positions containing gaps and missing data. A total of 9 out of 49 novel miRNAs (Cpa_mir-m0016, Cpa_mir-m2379,Cpa_mir-m3128,Cpa_mir-m0440,Cpa_mirm3953,Cpa_mir-m3042, Cpa_mir-m3284, Cpa_mir-m3258, Cpa_mir-m0974 belonging to families Cpa-miR398, Cpa-miR8135, Cpa-miR159a, Cpa-miR8149,Cpa-miR8138, Cpa-miR8134, Cpa-miR8144, Cpa-miR8150, Cpa-miR408 respectively) were observed to be highly conserved in papaya family. The final refined phylogenetic tree is illustrated in Fig. 2.
Fig. 2

Phylogenetic tree of C. papaya shows the relationship between novel miRNA and reported miRNA of C. papaya miRNA family. Purple diamond indicates the identified novel miRNA

Phylogenetic tree of C. papaya shows the relationship between novel miRNA and reported miRNA of C. papaya miRNA family. Purple diamond indicates the identified novel miRNA

Human target prediction of known and novel miRNAs of C. papaya

The human target genes were predicted using the psRNATarget webserver for 1798 known and 49 novel C. papaya miRNAs. Most of the targets indicated the inhibition function as 'cleavage' and very few were classified in the 'translation' category. A total of 1643 and 383 targets were identified from known and novel miRNAs, respectively. The novel miRNA, Cpa_miR-m1403 mapped the maximum number of 233 targets, followed by Cpa-miRm0016 with 30 targets (Table 2). Overall, target genes were predicted for ~ 73% of known and ~ 78% of novel miRNAs.
Table 2

List of potential targets genes and their inhibition type of identified novel miRNAs

S. noNovel miRNAs of Carica papayaType of inhibitionTarget human genes
1Cpa_mir-m1403Cleavage

FAM227A, LRP8, HNF4A, SLC4A8, SCN2A, SLC12A2, BCLAF1, CASZ1, DGKG, PRDM7, ISPD, PLAG1, BEX4,PWWP2A, API5, IMPA1, FKBP5, ZEB2, CSRNP3, SLITRK4, VAMP4, LYPD6, BMPR1B, TMPRSS13, UNC5B, PRMT8, HAL, KCNJ3, AHNAK, CTGF, ELK4, HMGB2, KCNJ3, KIF3C, KPNA4, LAMP2, PRKCA, NR2C2, STAM, DENR, STBD1, BTAF1, CDC23, IRS1, TSPAN2, NFATC1, SPIN1, QKI, WWP2, MTF2, CD2AP, RABGAP1, TIMM10, FHOD1, DIEXF, VENTX, SV2B, SRGAP3, WDTC1, RTF1, SMCHD1, AMMECR1, PI15, FAM46A, ZNF562, SNRK, NRDE2, BLOC1S4, PLCXD1, ASH1L, AJAP1, FAM105A, USP53, PEAK1, KIAA0226L, TAOK1, RAI1, CALD1, WDR5B,NDFIP2,FAM160B1,TSEN34,IRX3,PLA2G4A,SH3TC2,FBXO9,SLC44A1,ZNF618,BMPER,KLHL23,TMEM196, PODN, NFATC1, NLRP8, WDR17, OPN5, WDR72, ADAMTS9, LONRF2, TMEM110, RASSF6, RPAIN, IGF2BP1, C16orf52, ATXN3, CELF5, SCAMP5, ZMYM2, QKI, KIT, GRIK3, SPOPL,TPM1, ITPRIPL2, NRF1, GLRA3, UNK, RELL1, TMEM192, NCOA7, CACFD1, AKIRIN1, SLC25A22, ANKRD44, CLRN1, IFRD1, EGR3, C18orf21, C18orf21, UBE2H, WISP1, CACFD1, FAM47E, STBD1, CLRN1, ZNF248, HOMER1, XTL3, IL1RAP, MATN1, PCK1, RBP3, SORT1, SPN, TFAM, TDGF1, ZNF264, BSN, AGPS, STK17B, HOMER1, CREB5, AQP3, ATXN3, ATF1, ACTR1A, LILRB2, SMC1AWDR3, MALT1, PCGF2, ZNF146, ZNF175, CASP8AP2, PITPNB, TLL2, WDR37, LEMD3, SLC7A11, ZNF585B, PDIK1L, VGLL2, CREBRF, FAM26E, OR2L13, PSME3,ZDHHC21, IL21R, IKZF3, ZBTB41, SRC, ZC3H6, GTF2H5, FAM174B, RAPH1, C4orf40ASCC3, ACSL4, AACS,

WDR82, DNAL1, IRX2, CTTNBP2, DYNLL2, GPR82, NXPE3, ZFF4, KIAA0355, STARD8, DTX4, STK36, SELT,

FAM120C, RIC8B, PARVA, CCDC91, SLC35E3, CCDC93, TM9SF3, NUFIP2, ZNF248, CACNA1I

TranslationEMP2, AGPS, ACSL4, SSH2, PIK3R1, PHLDA1, REEP3, L1CAM, CXADR, HOOK3
2Cpa_mir-m0016CleavageGRIN2A, FLT1, CHM, GSTM3, TTLL3, WIPI2, PHACTR2, PIRT, JAKMIP3, CRY2, HOGA1, AHCY, ARPC4-TTLL3, CALU, POU2F2, CLIC5, C20orf112, KLF7, ATG12, CA5A, EGF, POU2F2, IL18R1, ATG12, TESPA1, GIPC2, CRY2, HTR5A
3Cpa_mir-m0063CleavagePAM, TFEC, ANGPT1, DDHD1, PDK4, ZNF280D, PCDH11Y, PXT1, DNAH12
4Cpa_mir-m0370CleavageZNF99, NEURL1B, TSPAN5
5Cpa_mir-m0233CleavagePRC1, CBFA2T3, SCARA5
6Cpa_mir-m0440CleavageGRM5, PTCH1, ADAM22
7Cpa_mir-m0511CleavageC9orf47, ADCYAP1R1, GABPA, ARL11, SERPINF2
8Cpa_mir-m0749CleavageGPR55
9Cpa_mir-m0974CleavageEIF5AL1, DCC, CCNJL, C17orf103
10Cpa_mir-m1108CleavageKLF9, STAC, WNK1, DSN1
11Cpa_mir-m1285CleavageTRAPPC9, PFKL, RGS14
12Cpa_mir-m1285CleavageRGS14
13Cpa_mir-m1392CleavageADAMTSL1
14Cpa_mir-m1414CleavageGABRA6, DENND2C, SPAG7, ADAMTS4, INSIG2, FKBP14, TEAD1, CHMP4C
15Cpa_mir-m1481CleavageMBL2, CREM, PARG, SLC16A7, GAB2, ORC4, CREM, MYOZ3, PPP3R2
16Cpa_mir-m1590CleavageC1orf173, FASTKD2, FAM178A, RBM27, TXNDC17
17Cpa_mir-m4316CleavagePDCD6IP, DCLK1, COL15A1, ZBTB43, TECPR1, CLN8, FBXL18, ITPRIP
18Cpa_mir-m4268CleavageSFMBT2, NCALD, KCNS1, MSR1, UCP3, ZDHHC9, RALGAPB
19Cpa_mir-m4113CleavagePPM1A
20Cpa_mir-m3953CleavageLYSMD1, PPP1R12A, LOC729020, HOXA7, CRISPLD1, EML5
21Cpa_mir-m3590CleavageITGA4, KLF6, C22orf43
22Cpa_mir-m3400CleavageCACNB3, MID1IP1, KLHL26, PLEKHG1, ZXDC, C17orf107, NOS1, ILF3, ST8SIA4, PCNX, HPCAL4, TTC31, ILF3, GAB1
23Cpa_mir-m3128CleavageTSHZ2, DYNLT3, KIAA1549L, ITIH6
24Cpa_mir-m1648CleavagePKNOX2
25Cpa_mir-m1990CleavageC19orf53
26Cpa_mir-m2262CleavageNAT8L
27Cpa_mir-m2314CleavageTSPAN33
28Cpa_mir-m2379CleavageRBMS3, TGM4, GET4,
29Cpa_mir-m3042CleavageSIM1, PHC3
List of potential targets genes and their inhibition type of identified novel miRNAs FAM227A, LRP8, HNF4A, SLC4A8, SCN2A, SLC12A2, BCLAF1, CASZ1, DGKG, PRDM7, ISPD, PLAG1, BEX4,PWWP2A, API5, IMPA1, FKBP5, ZEB2, CSRNP3, SLITRK4, VAMP4, LYPD6, BMPR1B, TMPRSS13, UNC5B, PRMT8, HAL, KCNJ3, AHNAK, CTGF, ELK4, HMGB2, KCNJ3, KIF3C, KPNA4, LAMP2, PRKCA, NR2C2, STAM, DENR, STBD1, BTAF1, CDC23, IRS1, TSPAN2, NFATC1, SPIN1, QKI, WWP2, MTF2, CD2AP, RABGAP1, TIMM10, FHOD1, DIEXF, VENTX, SV2B, SRGAP3, WDTC1, RTF1, SMCHD1, AMMECR1, PI15, FAM46A, ZNF562, SNRK, NRDE2, BLOC1S4, PLCXD1, ASH1L, AJAP1, FAM105A, USP53, PEAK1, KIAA0226L, TAOK1, RAI1, CALD1, WDR5B,NDFIP2,FAM160B1,TSEN34,IRX3,PLA2G4A,SH3TC2,FBXO9,SLC44A1,ZNF618,BMPER,KLHL23,TMEM196, PODN, NFATC1, NLRP8, WDR17, OPN5, WDR72, ADAMTS9, LONRF2, TMEM110, RASSF6, RPAIN, IGF2BP1, C16orf52, ATXN3, CELF5, SCAMP5, ZMYM2, QKI, KIT, GRIK3, SPOPL,TPM1, ITPRIPL2, NRF1, GLRA3, UNK, RELL1, TMEM192, NCOA7, CACFD1, AKIRIN1, SLC25A22, ANKRD44, CLRN1, IFRD1, EGR3, C18orf21, C18orf21, UBE2H, WISP1, CACFD1, FAM47E, STBD1, CLRN1, ZNF248, HOMER1, XTL3, IL1RAP, MATN1, PCK1, RBP3, SORT1, SPN, TFAM, TDGF1, ZNF264, BSN, AGPS, STK17B, HOMER1, CREB5, AQP3, ATXN3, ATF1, ACTR1A, LILRB2, SMC1AWDR3, MALT1, PCGF2, ZNF146, ZNF175, CASP8AP2, PITPNB, TLL2, WDR37, LEMD3, SLC7A11, ZNF585B, PDIK1L, VGLL2, CREBRF, FAM26E, OR2L13, PSME3,ZDHHC21, IL21R, IKZF3, ZBTB41, SRC, ZC3H6, GTF2H5, FAM174B, RAPH1, C4orf40ASCC3, ACSL4, AACS, WDR82, DNAL1, IRX2, CTTNBP2, DYNLL2, GPR82, NXPE3, ZFF4, KIAA0355, STARD8, DTX4, STK36, SELT, FAM120C, RIC8B, PARVA, CCDC91, SLC35E3, CCDC93, TM9SF3, NUFIP2, ZNF248, CACNA1I

Gene ontology and functional enrichment of C. papaya target genes

Functional annotation was carried out using the Blast2GO program for known and novel miRNA-identified target genes. The Gene ontology results from known miRNA target genes indicated the involvement of 38% of genes in localization (GO: 0051179), response to stimulus (GO: 0050896), immune system process (GO: 0002376) and developmental process (GO: 0032502), 16% in molecular function and 8% in cellular components. On further analysis, majority of genes were associated in the molecular functions (GO: 0008152), translation regulatory activity (GO: 0045182), binding (GO: 0005488) and catalytic activity (GO: 0003824) (Fig. 3, Section A).
Fig. 3

The representative bar graphs illustrate the functional annotation analysis of C. papaya regulated human target genes. A Gene ontology for known miRNAs performed at level 2 for three main categories (cellular component, molecular function, and biological process). B Gene ontology for novel miRNAs performed at level 2 for three main categories (cellular component, molecular function, and biological process)

The representative bar graphs illustrate the functional annotation analysis of C. papaya regulated human target genes. A Gene ontology for known miRNAs performed at level 2 for three main categories (cellular component, molecular function, and biological process). B Gene ontology for novel miRNAs performed at level 2 for three main categories (cellular component, molecular function, and biological process) In addition, gene ontology studies were performed adapting the same procedure used for known miRNA to understand the comprehensive network of genes modulated by novel miRNAs. A total of 383 targets were annotated, out of which only 247 were found to cover various molecular functions, biological processes, and cellular components. Of genes involved in biological process, 120 genes were involved in cellular process (GO: 0009987), 32 in localization (GO: 0051179), 6 in signaling (GO: 0023052) and only one gene each were annotated as genes involved in growth (GO: 0040007), reproduction (GO: 0000003) and immune system response (GO: 0002376). Similarly, transcription regulator activity (GO: 0140110; 8 genes), catalytic activity (GO: 0003824; 99 genes), and binding (GO: 0005488; 93 genes) were the most represented GO terms. Moreover, the genes from the cell part (GO: 0110165) and membrane part (GO: 0032991) were the most represented among cellular component (Fig. 3, Section B).

Pathway analysis of predicted novel miRNAs targets

We further studied the novel miRNA targets and their potential role in functional annotation and pathways. Out of 383 gene targets, eight were related to transcription factors including TFEC, GABPA and ATF1 which are involved in ovarian cancer, breast cancer and carcinoma progression (Jin et al. 2018; Chen et al. 2018; Huang et al. 2016). The details of these targets are provided in Supplementary Material 4. SRC, one among the significant target genes, is a non-receptor tyrosine kinase and proto-oncogene involved in the MAPK signaling pathway and is highly activated in a wide variety of human cancers, such as pancreatic, colon, and breast cancer and Kaposi's sarcoma (Irby and Yeatman 2000). We also observed that 25% of genes were involved in various predominant pathways like WNT pathway, Nanog, ERK signaling, NF-kappa B, cAMP-dependent PKA, GPCR signaling cascade and pathways related to the response to elevated platelet cytosolic Ca2+. Similarly, genes, such as PHACTR2, CALU, SERPINF2, LRP8, DGKG, CXADR, SPN and SLC7A11, participate in platelets activation, aggregation, and response to platelet cytosolic Ca2+.

Network analysis and disease association of novel miRNA targets of C. papaya

Using cytohubba program, we could choose top ten hub genes among 383 target genes, viz. GRIN2A, CTGF, EGF, NFATC1, TAOK1, PPP3R2, PIK3R1, CLIC5, SRC and TPM1 scored 96 to 15 wherein the SRC gene secured the top rank followed by EGF (Fig. 4, Table 3). Nonetheless, these genes were participating in various downstream signaling pathways including PIP3 activation-based AKT signaling and GAB1 signalosome with a p value < 0.05. The identified genes are associated with various diseases categorized as mental illness, cancer, syndrome, diabetes, and other platelets disorder (Supplementary Material 4).
Fig. 4

Gene network analysis showing association between significantly modulated genes. Top ten genes marked with red color have been detected from bottleneck method

Table 3

Top hub nodes calculated through topological Analysis method

MCCDMNCMNCDegreeBottleneckEccentricityCloseness
SRCSRCSRCSRCSRCSRCSRC
EGFEGFEGFEGFNFATC1PIK3R1EGF
CTGFPIK3R1PIK3R1PIK3R1CLIC5NFATC1CTGF
PIK3R1CTGFCTGFCTGFCTGFCLIC5NFATC1
NFATC1NFATC1NFATC1NFATC1PIK3R1EGFCLIC5
CLIC5TPM1TPM1TPM1TPM1CTGFPIK3R1
TPM1CLIC5CLIC5CLIC5EGFGRIN2AGRIN2A
PPP3R2PPP3R2PPP3R2PPP3R2PPP3R2TPM1TPM1
GRIN2AGRIN2AGRIN2AGRIN2AGRIN2APPP3R2PPP3R2
TAOK1TAOK1TAOK1TAOK1TAOK1TAOK1TAOK1
Gene network analysis showing association between significantly modulated genes. Top ten genes marked with red color have been detected from bottleneck method Top hub nodes calculated through topological Analysis method

Discussion

MicroRNAs (miRNAs) are well-known non-coding RNAs that play a major role in the regulation of gene expression in both plants and animals. For the first time in 2012, Zhang et al., reported that consumption of plant-derived miRNA168 regulates the expression gene in mammals (Zhang et al. 2012). Inspired by this finding, the cross-kingdom approach has drawn much attention in the scientific community regarding plant-derived miRNAs in mammals. miR2911 from honeysuckle was reported to inhibit the replication of influenza A virus, besides inhibiting the SARS-CoV-2 replication and accelerates the recovery process in the infected patients (Zhou et al. 2015, 2020). Our study employs the cross-kingdom approach to determine the prospective involvement of C. papaya-derived miRNAs in regulating human target genes that may have functional consequences in physiological and pathological situations, such as platelet disorder, cancer, and mental illnesses. Owing to its nutrient-rich fruit, enzymes, and bio-components, ethno-botanical and gene expression investigations have revealed that C. papaya is well suited for the treatment of different ailments and diseases, such as diabetes, cancer, and immunomodulatory activities (Karunamoorthi et al. 2014). Researchers studied the miRNAs identified from a broad range of foods like fruit, leaves, meat, grains, and milk that regulate the gene expression in human. These miRNAs also have shown to serve as biomarkers for various pathological conditions like cardiovascular disorders, pancreatic cancer, and aid in liver cancer therapy (Khurana et al. 2021). Additionally, the bioavailability of exogenous miRNAs in the host depends on the source (Plant or Animal) and the pre-supplementation level. A number of studies have already reported that green leafy vegetables are rich in fibers, antioxidants, vitamins, and polyphenols. A similar study by Hou et al., in 2018 reported that miR156a and miR164a are abundantly expressed in green leafy veggies like spinach, lettuce and cabbage. These two miRNAs were observed to be stable in the serum when consumed orally (Hou et al. 2018). Henceforth, we hypothesized that papaya leaves which are consumed often due to their therapeutic qualities might have miRNAs that can be transferred in their stable forms into human hosts upon oral consumption. We used next-generation sequencing technology, Illumina HiSeq2500 to perform high-throughput sequencing of C. papaya young leaves, followed by bioinformatics analysis to identify known and novel miRNAs along with their putative targets in humans. Forty-nine novel miRNAs were identified that targeted 383 human genes, among which Cpa_miR-m1403 was found to regulate the highest number of target genes, i.e., 233 followed by Cpa_miR-m0016 with 30 target genes. Target annotation using the Blast2GO program unveiled that most of the targets were significantly enriched in the developmental processes, growth, localization, and signaling. Pathway analysis of these targeted human genes showed their participation in various predominant signaling cascades, including GPCR and ERK signaling, and response to elevated platelet cytosolic Ca2+ (Supplementary Material 4). Additionally, novel miRNAs like Cpa-mir-m0016, Cpa-mir-m0511, Cpa-mir-m1403 and Cpa-mir-m1481 target genes have direct association with pathological conditions like mental illnesses, diabetes, and cancer. In light of the primary functions of human gene targets, we investigated the probable roles of novel miRNAs (Cpa miR-m1403 and Cpa miR-m0016) in two diseases, dengue and cancer, as well as in mental disorders. Interestingly, we observed genes including PHACTR2, CALU, SERPINF2, LRP8, DGKG, CXADR, SPN, SLC7A11, and LAMP2 participate in platelets activation, aggregation, response to platelet cytosolic Ca2+. Thrombosis occurs due to blood clot formation wherein plasminogen activator inhibitor plays a leading role, and the SERPINF2 target gene acts as a potent plasmin inhibitor. It is anticipated that Cpa_miR-m1403 and Cpa_miR-m051 may regulate the expression of SERPINF2 and LRP8 genes in humans. Since SERPINF2 is one of the putative targets of Cpa_miR051, it would be interesting to explore the probability of this interaction in the activation of plasmin and subsequent reduction of circulating fragments. Variations in LRP8 gene can alter thrombosis, which presents a scenario in which clot formation could be affected in humans (Robertson et al. 2009). Similarly, PHACTR2, DGKG and SPN play a pivotal role in platelet secretion, activation and development, respectively (Fig. 5B) (Gorski et al. 2019). While, LAMP2 is a membrane glycoprotein that plays a vital role in activation-dependent platelet surface glycoprotein (Saboor et al. 2013). Che et al., 2017 showed that Mir27a is a critical regulator of autophagy in chronic brain hypo-perfusion (CBH) that affects LAMP2 protein expression at the post-transcriptional level (Che et al. 2017). SLC7A11 is a light chain subunit that can be triggered by drugs and glutamate and can cause ferroptosis (iron-dependent programmed cell death) in platelets and macrophages. However, the activity of SLC7A11 can be influenced by numerous factors that modulate GSH (glutathione), resulting in SLC7A11 suppression, which can cause ferroptosis (Fig. 5B) (Tang et al. 2020). Evidence suggests that Cpa_mir-m1403 can regulate SRC, the most predominant proto-oncogene highly expressed in distinct cancer types, such as breast, colon, pancreatic and gastric cancers (Irby and Yeatman 2000). It was demonstrated earlier that mir-23b mediated the inhibition of the SRC-AKT pathway leading to cisplatin resistance in chondrosarcoma cells (Huang et al. 2017). Further in vitro investigations are needed to better elucidate the role of novel Cpa_mir-m1403 in modulating the SRC gene expression. We also noted that 29% of the human gene targets in this study were implicated in brain diseases, which is consistent with previously established functions of miRNA-mediated gene regulation mechanisms in anxiety, schizophrenia, and psychiatric illnesses (Murphy and Singewald 2019; Hunsberger et al. 2009).
Fig. 5

A Four novel miRNA and their human targets are depicted by circos diagram, and their connection is shown by respective color links. B Novel miRNAs and their targeted genes involved in platelet disorders

A Four novel miRNA and their human targets are depicted by circos diagram, and their connection is shown by respective color links. B Novel miRNAs and their targeted genes involved in platelet disorders One of the noteworthy findings of this study was the top ten hub genes deduced from the interaction and significance of nodes in the network; GRIN2A, CTGF, EGF, NFATC1, TAOK1, PPP3R2, PIK3R1, CLIC5, SRC, and TPM1 constituted influential role in pathway involvement and biological processes (Fig. 4). It was reported that in breast cancer and melanoma, oncogene activation caused by GRIN2A, SRC, and TPM1 resulted in a severe reduction in patient survival (D'mello et al. 2014; Botezatu et al. 2016). It is expected that both novel C. papaya miRNAs, Cpa_mir-m0016 and Cpa_mir-m1403, will likely control the expression of activated oncogenes by inhibiting the progression of malignancy. This cross-kingdom analysis showed that the gene PIK3R1, a prominent regulator of isomer PI3K may govern the PI3K pathway, which has documented evidence of profound implications in breast cancer and as an independent prognostic marker (Cizkova et al. 2013). Few miRNAs have previously been shown to reduce PIK3R1 overexpression responsible for HCC (hepatocellular carcinoma) proliferation. It will be exciting to elucidate how Cpa_mir-m1403 regulates the PIK3R1 expression, which has a significant role in the modulation of tumor growth (Huang et al. 2015), and further in vitro studies are needed in this direction to this hypothesis. Overall, top hub genes are plausible regulators of signaling pathways and responsible for modulating various downstream targets by interacting with various protein complex networks. We earlier reported a cross-kingdom analysis to identify novel C. papaya miRNAs from EST sequences available in the NCBI database and studied their putative role in regulating various types of cancer and platelet regulatory pathways. The novel miRNAs, cpa-miR6034 and cpa-miR3629b belong to two unique miRNA families EX294530.1 and EX271034.1, and targeted 38 human targets (Nurowidah 2019). Unfortunately, these EST sequences are of mixed type and sequenced from C. papaya leaf, root, stem, flower, fruit, and seed, published as one of the sequence entries of papaya draft genome by Alam and his co-workers (2008) (Ming et al. 2008). We could not discern which plant part these two novel miRNAs belong to. Since the young leaf samples of C. papaya are sequenced in this study, we can conclusively claim that the identified novel miRNAs originate exclusively from the plant leaves. A thorough search for C. papaya leaf sequence reads in the NCBI SRA database linked to 3 Bio Projects (as of 28/10/2021) dealing with the study of gene expression patterns in freeze–thaw treated C. papaya leaves, gene expression profiling for immune response infected with Papaya leaf-distortion mosaic virus, and effect of miRNAs in leaf juice for blood platelet count (An et al. 2020). Although numerous C. papaya SRA sequence reads are publicly available, a single study on miRNA identification has been published so far, which reported C. papaya-specific miR-c4, which was predicted to regulate the ethylene signaling pathway (Liang et al. 2013). On the other hand, C. papaya-derived miRNAs from fruit parts obtained from high-throughput sequencing were investigated for their role in the ripening and developmental stages of the fruit but not devoted to human disease association (Cai et al. 2021). To the best of our knowledge, the present study is the first such effort made to derive miRNAs from small RNA sequencing data and explore their role in human gene regulations using a cross-kingdom approach. The present study also augments the knowledge of novel C. papaya miRNAs that can be exploited further in biochemical and clinical studies. Although this cross-kingdom analysis utilizes sequence homology techniques and literature-driven data to sketch the possible regulatory mechanism of selected novel C. papaya miRNAs in certain human target genes, it is critical to validate these research hypotheses through wet-lab experiments to thoroughly comprehend the role of C. papaya miRNAs in controlling or alleviating disease conditions, with implications in dengue, cancer, and mental illness as our study lacks more in vivo and in vitro evidence.

Conclusion

Over 30 million sequence reads were generated from a small RNA library to explore the role of miRNA from C. papaya young leaf and effectively identify the known and novel miRNAs. A total of 1798 known and 49 novel C. papaya miRNAs were identified, which were further assessed for their cross-kingdom regulation in human gene targets. Novel miRNAs, such as Cpa_miR-m1403 and Cpa_miR-m0016, are predicted to target 70% of genes out of 383 predicted human targets. The identified target genes engaged in various signaling pathways viz. WNT, Nanog, MAPK-AKT, ERK signaling, platelet activation and aggregation. The network interaction study revealed top hub genes, namely GRIN2A, CTGF, EGF, NFATC1, TAOK1, PPP3R2, PIK3R1, CLIC5, SRC, and TPM1, which were involved in various signaling pathways and acted as oncogenes. This cross-kingdom study elucidates possible gene target-disease association in diabetes, numerous types of cancers, mental illnesses, and platelet disorders. Collectively, the human genes targeted by C. papaya-derived novel miRNAs may unravel the biological role of these miRNAs in regulating numerous downstream signaling cascades. However, further in vitro validation using techniques such as RNA interference is required to promote the application of the cross-kingdom approach in clinical interventions. Below is the link to the electronic supplementary material. Supplementary file1 (XLSX 64 KB) Supplementary file2 (PDF 1500 KB) Supplementary file3 (XLSX 22 KB) Supplementary file4 (XLSX 49 KB) Supplementary file5 (PDF 410 KB)
  65 in total

Review 1.  Role of Src expression and activation in human cancer.

Authors:  R B Irby; T J Yeatman
Journal:  Oncogene       Date:  2000-11-20       Impact factor: 9.867

Review 2.  Origins and Mechanisms of miRNAs and siRNAs.

Authors:  Richard W Carthew; Erik J Sontheimer
Journal:  Cell       Date:  2009-02-20       Impact factor: 41.582

3.  The draft genome of the transgenic tropical fruit tree papaya (Carica papaya Linnaeus).

Authors:  Ray Ming; Shaobin Hou; Yun Feng; Qingyi Yu; Alexandre Dionne-Laporte; Jimmy H Saw; Pavel Senin; Wei Wang; Benjamin V Ly; Kanako L T Lewis; Steven L Salzberg; Lu Feng; Meghan R Jones; Rachel L Skelton; Jan E Murray; Cuixia Chen; Wubin Qian; Junguo Shen; Peng Du; Moriah Eustice; Eric Tong; Haibao Tang; Eric Lyons; Robert E Paull; Todd P Michael; Kerr Wall; Danny W Rice; Henrik Albert; Ming-Li Wang; Yun J Zhu; Michael Schatz; Niranjan Nagarajan; Ricelle A Acob; Peizhu Guan; Andrea Blas; Ching Man Wai; Christine M Ackerman; Yan Ren; Chao Liu; Jianmei Wang; Jianping Wang; Jong-Kuk Na; Eugene V Shakirov; Brian Haas; Jyothi Thimmapuram; David Nelson; Xiyin Wang; John E Bowers; Andrea R Gschwend; Arthur L Delcher; Ratnesh Singh; Jon Y Suzuki; Savarni Tripathi; Kabi Neupane; Hairong Wei; Beth Irikura; Maya Paidi; Ning Jiang; Wenli Zhang; Gernot Presting; Aaron Windsor; Rafael Navajas-Pérez; Manuel J Torres; F Alex Feltus; Brad Porter; Yingjun Li; A Max Burroughs; Ming-Cheng Luo; Lei Liu; David A Christopher; Stephen M Mount; Paul H Moore; Tak Sugimura; Jiming Jiang; Mary A Schuler; Vikki Friedman; Thomas Mitchell-Olds; Dorothy E Shippen; Claude W dePamphilis; Jeffrey D Palmer; Michael Freeling; Andrew H Paterson; Dennis Gonsalves; Lei Wang; Maqsudul Alam
Journal:  Nature       Date:  2008-04-24       Impact factor: 49.962

4.  Ocimum basilicum miRNOME revisited: A cross kingdom approach.

Authors:  Maulikkumar Patel; Shanaya Patel; Naman Mangukia; Saumya Patel; Archana Mankad; Himanshu Pandya; Rakesh Rawal
Journal:  Genomics       Date:  2018-05-15       Impact factor: 5.736

5.  MicroRNAs in mental health: from biological underpinnings to potential therapies.

Authors:  Joshua G Hunsberger; Daniel R Austin; Guang Chen; Husseini K Manji
Journal:  Neuromolecular Med       Date:  2009-06-20       Impact factor: 3.843

6.  The Reactome Pathway Knowledgebase.

Authors:  Antonio Fabregat; Steven Jupe; Lisa Matthews; Konstantinos Sidiropoulos; Marc Gillespie; Phani Garapati; Robin Haw; Bijay Jassal; Florian Korninger; Bruce May; Marija Milacic; Corina Duenas Roca; Karen Rothfels; Cristoffer Sevilla; Veronica Shamovsky; Solomon Shorser; Thawfeek Varusai; Guilherme Viteri; Joel Weiser; Guanming Wu; Lincoln Stein; Henning Hermjakob; Peter D'Eustachio
Journal:  Nucleic Acids Res       Date:  2018-01-04       Impact factor: 16.971

Review 7.  Ferroptosis: molecular mechanisms and health implications.

Authors:  Daolin Tang; Xin Chen; Rui Kang; Guido Kroemer
Journal:  Cell Res       Date:  2020-12-02       Impact factor: 25.617

8.  Fast and accurate short read alignment with Burrows-Wheeler transform.

Authors:  Heng Li; Richard Durbin
Journal:  Bioinformatics       Date:  2009-05-18       Impact factor: 6.937

9.  Honeysuckle-encoded atypical microRNA2911 directly targets influenza A viruses.

Authors:  Zhen Zhou; Xihan Li; Jinxiong Liu; Lei Dong; Qun Chen; Jialing Liu; Huihui Kong; Qianyi Zhang; Xian Qi; Dongxia Hou; Lin Zhang; Guoquan Zhang; Yuchen Liu; Yujing Zhang; Jing Li; Jin Wang; Xi Chen; Hua Wang; Junfeng Zhang; Hualan Chen; Ke Zen; Chen-Yu Zhang
Journal:  Cell Res       Date:  2014-10-07       Impact factor: 25.617

10.  Plant microRNAs in larval food regulate honeybee caste development.

Authors:  Kegan Zhu; Minghui Liu; Zheng Fu; Zhen Zhou; Yan Kong; Hongwei Liang; Zheguang Lin; Jun Luo; Huoqing Zheng; Ping Wan; Junfeng Zhang; Ke Zen; Jiong Chen; Fuliang Hu; Chen-Yu Zhang; Jie Ren; Xi Chen
Journal:  PLoS Genet       Date:  2017-08-31       Impact factor: 5.917

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.