| Literature DB >> 31602478 |
Zhonglong Guo1,2, Zheng Kuang1,2, Ying Wang1,2, Yongxin Zhao1, Yihan Tao2, Chen Cheng1, Jing Yang1,3, Xiayang Lu1,3, Chen Hao2, Tianxin Wang2, Xiaoyan Cao3, Jianhua Wei1, Lei Li2, Xiaozeng Yang1.
Abstract
MicroRNAs (miRNAs) are small non-coding RNA molecules that function as diverse endogenous gene regulators at the post-transcriptional level. In the past two decades, as research effort on miRNA identification, function and evolution has soared, so has the demand for miRNA databases. However, the current plant miRNA databases suffer from several typical drawbacks, including a lack of entries for many important species, uneven annotation standards across different species, abundant questionable entries, and limited annotation. To address these issues, we developed a knowledge-based database called Plant miRNA Encyclopedia (PmiREN, http://www.pmiren.com/), which was based on uniform processing of sequenced small RNA libraries using miRDeep-P2, followed by manual curation using newly updated plant miRNA identification criteria, and comprehensive annotation. PmiREN currently contains 16,422 high confidence novel miRNA loci in 88 plant species and 3,966 retrieved from miRBase. For every miRNA entry, information on precursor sequence, precursor secondary structure, expression pattern, clusters and synteny in the genome, potential targets supported by Parallel Analysis of RNA Ends (PARE) sequencing, and references is attached whenever possible. PmiREN is hierarchically accessible and has eight built-in search engines. We believe PmiREN is useful for plant miRNA cataloguing and data mining, therefore a resource for data-driven miRNA research in plants.Entities:
Mesh:
Substances:
Year: 2020 PMID: 31602478 PMCID: PMC6943064 DOI: 10.1093/nar/gkz894
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.A data processing workflow of PmiREN and the comparison with miRBase. (A) The data processing workflow and outcomes of PmiREN. (B) The comparison of plant species between PmiREN and miRBase (21). I, species lack of both genome references and sRNA-Seq datasets. II, species with available genome references but without available sRNA-Seq datasets. (C) The comparison of miRNA entries between PmiREN and miRBase (21). I, entries retrieved with identical annotation. II, entries corrected by name or genomic coordinates. III, entries whose mature/star miRNAs re-annotated. IV, entries unable to be mapped to genome references and discarded in PmiREN. V, entries filtered by newly updated miRNA criteria and abandoned in PmiREN. (D) The comparison of miRNA annotation integrity between miRBase (left) (21) and PmiREN (right). 30 most conserved plant miRNA families were selected, and a box is highlighted by blue if at least one member of a given miRNA family was annotated.
Figure 2.Unique features of PmiREN. (A) An example of miRNA expression patterns. Expression values (RPM) of all miRNAs in Arabidopsis were achieved from 39 sRNA-Seq datasets from different tissues including flower, inflorescence, leaf, root, seeding and silique. The log10 values of RPM means of each miRNA in a specific tissue was employed to make heat map analysis. The hierarchical clustering was generated by Pearson correlation coefficients of these values. Gradient colors from red to blue indicate expression values from high to low. (B) A possible evolutionary model for the dynamic divergence of miR156/157/529 superfamily in land plants. Weblogo (45) on the left side shows the sequence similarity and difference of miR156, miR157 and miR529 while the cartoon on the right side presents their existence in different evolutionary branches. Red Stars on the bottom indicate the variations of superfamily core region. (C) An examples of miRNA syntenic analyses. Syntenic miRNA pairs of 3 species in grass family including Oryza sativa (rice), Brachypodium distachyon and Oryza nivara were displayed by Circos diagram (40). (D) Examples of predicted miRNA targets and their validations by PARE-Seq datasets. Two examples including one conserved miRNA isoform (miR157a/b/c/d) and one non-conserved one (miRN3078) from tomato (Solanum lycopersicum) were displayed. The complimentary between miRNAs and targets was predicted by both RNAhybrid (34) and psRNATarget (33). Validations from two tissues (fruit and leaf) by PARE-Seq data were achieved by CleaveLand4 (36).
Figure 3.A schematic view of PmiREN features.