Literature DB >> 26770029

Computational identification of putative miRNAs and their target genes in pathogenic amoeba Naegleria fowleri.

Dyavegowda Padmashree1, Narayanaswamy Ramachandra Swamy1.   

Abstract

Naegleria fowleri is a parasitic unicellular free living eukaryotic amoeba. The parasite spreads through contaminated water and causes primary amoebic meningoencephalitis (PAM). Therefore, it is of interest to understand its molecular pathogenesis. Hence, we analyzed the parasite genome for miRNAs (microRNAs) that are non-coding, single stranded RNA molecules. We identified 245 miRNAs using computational methods in N. fowleri, of which five miRNAs are conserved. The predicted miRNA targets were analyzed by using miRanda (software) and further studied the functions by subsequently annotating using AmiGo (a gene ontology web tool).

Entities:  

Keywords:  Computational approach; Naegleria fowleri; PAM; Prediction; miRNAs

Year:  2015        PMID: 26770029      PMCID: PMC4702033          DOI: 10.6026/97320630011550

Source DB:  PubMed          Journal:  Bioinformation        ISSN: 0973-2063


Background

The genus Naegleria is a free-living amoeba, most commonly found in warm fresh water and soil across the world [1, 2]. More than 40 species of Naegleria have been identified but only Naegleria fowleri (N. fowleri) acts as a pathogenic species in human [3, 4]. N. fowleri is a thermophilic organism primitively grown at 37°C and can survive with temperature upto 45°C [5, 6]. N. fowleri exists in three stages; trophozoite (amoeba), flagellate and cyst. It enters body through the nasal route along the olfactory nerve tracts and finally infects central nervous system causing the disease primary amoebic meningoencephalitis (PAM) [7, 8]. The mitochondrial genome and a 60-kb nuclear DNA segment are responsible for PAM [9]. Once the host is infected, the symptoms observed are severe headache, vomiting, stiff neck, nausea, fever, etc., and the host dies within one to two weeks [7, 4]. Previous reports have shown that, the presence of liver hydrolysate (LH) enhanced pathogenicity of N. fowleri [10]. Reports about RNA interference in protozoan parasites [11], motivated us to identify the putative miRNAs and their targets in this protozoan parasite. microRNAs (miRNAs) are endogenous, single stranded, small regulatory RNA molecules which are non-coding and are ~20- 25 nucleotides in length [12– 14]. MiRNAs form complementary base pairs to target mRNA which controls the expression of gene by inducing degradation or repressing the translation of mRNA or chromatin modification [14]. In 1993 Victor Ambros discovered first miRNA, lin-4 in Caenorhabditis elegans [15]. Today thousands of miRNAs have been identified in various multicellular organisms, which include drosophila, plants, viruses, vertebrates, nematodes etc., [16– 20]. Recent studies have reported the identification of miRNA in Entamoeba histolytica and Chlamydomonas reinhardtii [21, 22]. MiRNA is a key regulator in various cellular and biological processes, which have been experimentally demonstrated in cell cycle, apoptosis, developmental stages, differentiation, response to stress, and in various types of cancers [23– 25]. Following three procedures are involved in identifying miRNA (a) Classic cloning methods; (b) Deep sequencing method and (c) Computational approach. Computational approach can be further divided into three types: (i) ab initio prediction based on the sequence and structural features; (ii) comparative genomic strategy based on evolutionary conservation; and (iii) integrated approach. In this study, we used computational approach based on sequence and structural feature to identify miRNA in N. fowleri and their targets.

Methodology

Collection of data:

In this study, nucleotide sequences of DNA-contigs of N. fowleri were downloaded from NCBI ( http://www.ncbi.nlm. nih.gov/). Each contigs containing DNA nucleotides were downloaded and stored in FASTA format which were used for further analyses. The following steps were followed to identify miRNA genes from N. fowleri (Figure 1).
Figure 1

The flow chart followed for identification of miRNA in N. fowleri.

Prediction of miRNA genes by computational methods:

N. fowleri strain ATCC 30863, genome size 27.79Mb, consists of 574 DNA contigs, which were downloaded from NCBI ( http://www.ncbi.nlm.nih.gov/). Individual contigs were subjected to scan by using software called Einverted EMBOSS which is available in the website http://emboss. bioinformatics.nl/cgibin/emboss/einverted. EMBOSS (the European Molecular Biology Open Software Suite) is an open source software analysis package. This package contains Einverted, which is used to find DNA inverted repeats in nucleotide sequences to form secondary structure of DNA. It is also used to identify the sequences that can form hairpin like structure. In our study, we used parameter condition as described; threshold score 30, match score 3, mismatch score 4, gap penalty 6, maximum repeat length 200. The resulted outputs from Einverted EMBOSS were further filtered by collecting the sequences which are greater than and equal to (≥) 15 base pairs in complementary strand and the length was separated by 40 bases in second step, this avoided more bulges and mismatches in the secondary structures. Since, the precursor miRNA length varies [26], we collected the sequences of 60-120 nucleotides in length in third step. In the Fourth step, secondary structures of the miRNAs were predicted by using RNA-fold program with minimum folding free energy (MFE) –20kcal/mol. In the fifth step, the miRNA sequences were reduced to 30-60% by screening GC content. The results were subjected to screening by using BlastX program to remove the protein coding sequences and excluded them in sixth step. In next step, results for miRNA candidates were further narrowed by using miPred software (http://www.bioinf.seu. edu.cn/ miRNA/) to distinguish the real miRNA from that of a pre-miRNA like hairpin and pseudo miRNA precursor in ninth step [27]. Further, we removed the homologous sequences and repetitive sequences using the program RepeatMasker version 4.0.5 [28]. Finally, mature miRNA sequences were identified using mature naïve bayes [29].

Identification of conserved miRNA:

To identify the conserved miRNA candidates in N. fowleri, we retrieved all mature miRNA sequences from miRBase Release 21 (http://www.mirbase.org/) [30]. A BlastN search was performed for all mature miRNA sequences from miRBase against 245 precursor miRNA of N. fowleri with default parameters and e-value 0.001. The results of blastN were further analyzed by applying two conditions: (1) more than 90% identity between the sequences and (2) mismatches between the sequences not more than three bases Table 1 (see supplementary material).

Prediction of miRNA targets:

We used miRanda software version 3.3a released in 2010 to identify the target genes in N. fowleri. The following parameters were used in these studies; match with minimum threshold score of 120; target mRNA duplex with minimum folding free energy threshold -20kcal/mol; gap opening penalty -8; gap extension penalty -2; scaling parameter 4 for complementary nucleotide match score. The output of results were analyzed by using following conditions as (1) Counting nucleotide position started from 5' end of miRNA; (2) the sequence from two to eight nucleotides having complete complementary region with respective mRNA (miRNA: mRNA) not more than two mismatch is allowed; (3) no mismatch from position 2 to 4 (5' end miRNA); (4) not more than one gap is allowed in the alignment; (5) not more than one G-U pairing allowed in the seed region 6) at least 18 nucleotide alignment between complementary region with respective mRNA (miRNA: mRNA).

Gene ontology of target genes:

For better understanding of target gene functions and metabolic role in N. fowleri, the target genes were subjected to BLAST in AmiGo version 1.8 ( http://amigo1. geneontology.org/cgi-bin/amigo/go.cgi), the annotated results of target genes based on sequence similarity against NCBI and SwissProt database. For each target gene best hit were selected. Gene ontology was classified into biological processes, molecular functions and cellular components with the GO terms at AmiGo.

Results & Discussion

Computational prediction of miRNA in N. fowleri:

Different computational approaches are used to identify miRNA in both animal and plants [16, 17& 25]. In this study, we used existing methodology with slight modifications [21]. The collected DNA contigs of N. fowleri from NCBI were scanned in Einverted EMBOSS software. The resulted output hits of Einverted EMBOSS software is 54896, this contain multiloop structures and gap formed in hairpin like structures. Einverted program inverted the sequences that can form reverse complementary sequence to form hairpin loop like structure which contain mismatches and bulges in the stem loop part. To minimize the gaps and multi-loop structures, we applied the condition by writing the Java script to retrieve the duplexes greater than or equal to (≥) 15 base pair in length and separated by less than or equal to (≤) 40 nucleotides, this would narrow the result to 12821. Since the miRNA length varies [31], we have collected the sequences of 60-120 nucleotides length which come to 8603. The minimal folding free energy (MFE) plays an important role to determine the secondary structure of RNA. We identified the secondary structure of RNA by using the program called RNA fold, this software works based on algorithm called Vienna RNA package [32]. Individual miRNA secondary structures were calculated by using MFE which is available in RNA fold software. miRNA sequences have lower folding free energy than that of shuffled sequences, this characteristic of miRNA allows the formation of stable secondary structure [33]. However MFE values also depend on length of the RNA. The free energy range considered in this study would be less than or equal to -20 to -40 kcal/mol. The following requirements should satisfy to select precursor miRNA candidates: (1) RNA sequence fold with appropriate structure to form hair pin like secondary structure; (2) the mature miRNA sequence is present in any one of the arm; (3) not more than five mismatches between predicted mature miRNA and the opposite miRNA* sequence in the hairpin structure; (4) no breakage or loop in mature miRNA and miRNA* sequences; (5) at least 16 base pair should be present in miRNA stem loop structure. Manually, we analyzed each miRNA candidate to satisfy the above criteria and were retained. We further narrowed down the miRNA candidates by filtering the GC content. The overall GC content of N. fowleri genome is 37%, in our analyses we retained GC content of 30-60% which reduced the data to 2201. We ran BlastX 2.2.30 in order to remove the protein coding gene in N. fowleri which narrow down the data to 1468. Then, according to miPred software we characterized the pre-miRNA candidates to identify the real miRNA and to remove pseudo miRNA and not real precursor miRNA with the confidence greater than and equal to 70%. After performing miPred software out of 1468 only 288 miRNA were retained. However, few miRNA copies in N. fowleri genome sequences found to be repeated, two to four copies present in same contigs ID, are eliminated. Remaining miRNA data were further analyzed to remove repeated elements in the sequences by using the tool called RepeatMasker. The software searches for repetitive sequence by aligning input genomic sequence against repbase by performing alignment program with cross_match as search engine [28]. Dataset decreases to 246 miRNA candidates. Finally, MatureBayes tool (http://mirna.imbb.forth.gr/MatureBayes.html) is used to identify mature miRNA sequence in miRNA precursor [29]. Out of 246 miRNAs one miRNA was not able form mature sequence which reduced the data to 245 novel miRNAs.

Characterization of N. fowleri miRNA:

Previous reports have shown that some of the features of premiRNA help us to identify conserved and non-conserved miRNAs; miRNA are conserved during the evolution from worm to human based on this study. We collected all the published mature miRNAs from miRBase [34] by performing BlastN search using BioEdit software [35] with lower e-value (0.001). Out of 245 five miRNAs were conserved. Gap in the alignment is due to insertion or deletion of the nucleotides. Alignments were performed using both the strand of 3' stem and 5' stem mature miRNA/miRNA* of all the putative novel miRNA sequences. Hits were collected based on the alignment having fewer than three mismatches between the sequences. Separated alignments are studied for the mature sequences in 5' and 3' stem (Figure 2). In our study, we were able to identify some of the features of precursor miRNA, such as: (a) MFE, range from -20 to -40.90 kcal/mol, respectively Table 2 (Available with authors) (b) GC content (c) minimal free energy of the thermodynamics ensemble (d) adjusted minimal free energy (AMFE). AMFE is calculated by (MEF/Length of RNA sequence) ×100; minimal free energy index (MFEI) calculated by AMFE/ (G+C) % all these features are calculated for each miRNA candidates. MFEI is an important criterion to distinguish miRNA from other types of RNA (coding and noncoding). For majority of precursor miRNAs MFEI was more than 0.85 with an average of 0.97 for other types of RNA like tRNA (0.64), rRNA (0.59) and mRNA (0.65) [36]. Along with these, we also calculated the percentage of individual nucleotides A%, G%, C% and U% in the precursor miRNA candidates (Table 2 – available with authors). In miRNA the percentage of G and C is less than the A and U [36]. Previous studies, shows that U is the predominant nucleotide presented at first position of 5' in majority of mature miRNAs [26] which follows the same for N. fowleri. Out of 245 novel miRNA 44.25% have U nucleotide in first position, only 33.2%, 14.6% and 7.75% have first position of nucleotide at A, C and G, respectively. These suggest that position of nucleotide in miRNA play an important role in identifying the mature sequences and binding the site in target mRNA.
Figure 2

Sequences alignment of novel miRNA of N. fowleri and the mature miRNA from miRBase: A) sequence form 5׳ stem of miRNA; B) sequence form 3׳ stem of miRNA; prd-Panagrellus redivivus, oan-Ornithorhynchus anatinus, ath-Arabidopsis thaliana, ssc-Sus scrofa, dre- Danio rerio, ssa-Salmo salar, eca-Equus caballus.

Target prediction:

Prediction of miRNA in plants and animals follows similar features such as thermodynamics stability of pre-miRNA, hairpin loop structures, MFE, number of mismatches, bulge sizes, etc. Several computational approaches have been used to predict and to identify the miRNA and their targets in both plants and animals [16–18, 21&25]. miRNA regulates the mRNA by forming complementary binding site. From the 5' end of miRNA the nucleotides position 2-8 is called “seed region”. This region is the most important region to form complementary binding site to the target transcript mRNA [37]. However, in plants the binding between miRNA and its target mRNA happens at a perfect or very near perfect complementary binding site [38]. Whereas in most of the animals few nucleotides are unbound which makes complementary binding site of miRNA to its target less tight [15, 19]. This happen due to the presence of more mismatches and wobbles bases (i.e. G-U) between the complementary binding sites. In our study to identify the target in N. fowleri, we used miRanda software version 3.3a. A set of parameters was followed as described in the Methodology. From the resulted outputs, we achieved only 30 target genes after screening the condition which is described in method. The identified target genes play a vital role in various biological activities especially in mitochondria as a component of the respiratory chain, oxidation-reduction process, electron transport chain and apoptosis (Table 1 in supplementary material). All these activities play a key role in cellular growth and development.

Target Gene annotation:

To understand the miRNA target genes in N. fowleri, individual target genes sequences were analyzed using AmiGo Software version 1.8 (http://amigo1.geneontology.org/cgi-bin/amigo/go.cgi). The predicted result showed that most of the target genes are involved in oxidation-reduction process, dehydrogenase activity, electron transport chain in mitochondria etc., this suggests that most of the target genes involved in mitochondria biological processes Table 3 (see supplementary material).

Conclusion

N. fowleri is a free living amoeba that acts as a human pathogen causing PAM. Therefore, it is of interest to understand its molecular pathogenesis using miRNA. Hence, we report 245 predicted miRNAs from N. fowleri. Out of that five miRNAs show high homology with mature sequences in the miRBase. Predicted miRNA sequences shows that U is the predominant nucleotide present in the precursor and mature sequences at the 5׳ end; this is one of the features of miRNA. Gene annotation shows that target gene functions are mainly involved in mitochondrial regulation. This data provides insights to design experimental approach for understanding regulatory mechanism.
  38 in total

1.  Evidence that microRNA precursors, unlike other non-coding RNAs, have lower folding free energies than random sequences.

Authors:  Eric Bonnet; Jan Wuyts; Pierre Rouzé; Yves Van de Peer
Journal:  Bioinformatics       Date:  2004-06-24       Impact factor: 6.937

Review 2.  The functions of animal microRNAs.

Authors:  Victor Ambros
Journal:  Nature       Date:  2004-09-16       Impact factor: 49.962

Review 3.  RNA interference in protozoan parasites.

Authors:  Elisabetta Ullu; Christian Tschudi; Tirtha Chakraborty
Journal:  Cell Microbiol       Date:  2004-06       Impact factor: 3.715

Review 4.  A computational view of microRNAs and their targets.

Authors:  James R Brown; Philippe Sanseau
Journal:  Drug Discov Today       Date:  2005-04-15       Impact factor: 7.851

5.  The effects of some factors on the growth and morphology of Naegleria sp. and three strains of the genus Acanthamoeba.

Authors:  V Kadlec
Journal:  Folia Parasitol (Praha)       Date:  1975       Impact factor: 2.122

6.  Development of a high- versus low-pathogenicity model of the free-living amoeba Naegleria fowleri.

Authors:  Denise C Burri; Bruno Gottstein; Béatrice Zumkehr; Andrew Hemphill; Nadia Schürch; Matthias Wittwer; Norbert Müller
Journal:  Microbiology       Date:  2012-08-09       Impact factor: 2.777

7.  A complex system of small RNAs in the unicellular green alga Chlamydomonas reinhardtii.

Authors:  Tao Zhao; Guanglin Li; Shijun Mi; Shan Li; Gregory J Hannon; Xiu-Jie Wang; Yijun Qi
Journal:  Genes Dev       Date:  2007-04-30       Impact factor: 11.361

8.  Primary amebic meningoencephalitis deaths associated with sinus irrigation using contaminated tap water.

Authors:  Jonathan S Yoder; Susanne Straif-Bourgeois; Sharon L Roy; Thomas A Moore; Govinda S Visvesvara; Raoult C Ratard; Vincent R Hill; Jon D Wilson; Andrea J Linscott; Ron Crager; Natalia A Kozak; Rama Sriram; Jothikumar Narayanan; Bonnie Mull; Amy M Kahler; Chandra Schneeberger; Alexandre J da Silva; Mahendra Poudel; Katherine L Baumgarten; Lihua Xiao; Michael J Beach
Journal:  Clin Infect Dis       Date:  2012-08-22       Impact factor: 9.079

Review 9.  Pathogenic and opportunistic free-living amoebae: Acanthamoeba spp., Balamuthia mandrillaris, Naegleria fowleri, and Sappinia diploidea.

Authors:  Govinda S Visvesvara; Hercules Moura; Frederick L Schuster
Journal:  FEMS Immunol Med Microbiol       Date:  2007-04-11

10.  Principles of microRNA-target recognition.

Authors:  Julius Brennecke; Alexander Stark; Robert B Russell; Stephen M Cohen
Journal:  PLoS Biol       Date:  2005-03       Impact factor: 8.029

View more
  2 in total

1.  A novel regulatory network among LncRpa, CircRar1, MiR-671 and apoptotic genes promotes lead-induced neuronal cell apoptosis.

Authors:  Aruo Nan; Lijian Chen; Nan Zhang; Zhenzhong Liu; Ti Yang; Zhishan Wang; Chengfeng Yang; Yiguo Jiang
Journal:  Arch Toxicol       Date:  2016-09-07       Impact factor: 5.153

Review 2.  Parasite-derived microRNAs as a diagnostic biomarker: potential roles, characteristics, and limitations.

Authors:  Vahid Raissi; Mohammad Zibaei; Omid Raiesi; Zahra Samani; Mohammad Yarahmadi; Soudabeh Etemadi; Afrida Istiqomah; Zahra Alizadeh; Shahrzad Shadabi; Nasrin Sohrabi; Asmaa Ibrahim
Journal:  J Parasit Dis       Date:  2021-04-22
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.