| Literature DB >> 24135875 |
Zhidong Yuan1, Hongde Liu, Yumin Nie, Suping Ding, Mingli Yan, Shuhua Tan, Yuanchang Jin, Xiao Sun.
Abstract
Current technologies that are used for genome-wide microRNA (miRNA) prediction are mainly based on BLAST tool. They often produce a large number of false positives. Here, we describe an effective approach for identifying orthologous pre-miRNAs in several primates based on syntenic information. Some of them have been validated by small RNA high throughput sequencing data. This approach uses the synteny information and experimentally validated miRNAs of human, and incorporates currently available algorithms and tools to identify the pre-miRNAs in five other primates. First, we identified 929 potential pre-miRNAs in the marmoset in which miRNAs have not yet been reported. Then, we predicted the miRNAs in other primates, and we successfully re-identified most of the published miRNAs and found 721, 979, 650 and 639 new potential pre-miRNAs in chimpanzee, gorilla, orangutan and rhesus macaque, respectively. Furthermore, the miRNA transcriptome in the four primates have been re-analyzed and some novel predicted miRNAs have been supported by the small RNA sequencing data. Finally, we analyzed the potential functions of those validated miRNAs and explored the regulatory elements and transcription factors of some validated miRNA genes of interest. The results show that our approach can effectively identify novel miRNAs and some miRNAs that supported by small RNA sequencing data maybe play roles in the nervous system.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24135875 PMCID: PMC3821645 DOI: 10.3390/ijms141020820
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
The number of pre-miRNAs predicted by our method and new pre-miRNAs identified from small RNA sequencing data. The potential pre-miRNAs were identified by MiPred program. ND: No data; UA: The small RNA sequencing data is unavailable.
| Primates | miRBase 19 (Known miRNA Genes) | Candidate miRNA Genes | Supported by Small RNA Sequencing Data | New miRNA Genes Identified from Deep Sequencing Data | |
|---|---|---|---|---|---|
|
| |||||
| Re-identified miRNA Genes | Novel miRNA Genes | ||||
| Human | 1600 | ND | ND | ND | ND |
| Chimpanzee | 655 | 601 | 721 | 56 | 234 |
| Gorilla | 322 | 301 | 979 | 25 | 237 |
| Orangutan | 633 | 590 | 650 | 16 | 393 |
| Rhesus macaque | 535 | 480 | 639 | 29 | 230 |
| Marmoset | 0 | 0 | 929 | UA | UA |
Figure 1The Venn diagrams of miRNAs identified by using a comparative genomics approach and small RNA sequencing data. (A) Chimpanzee; (B) Gorilla; (C) Orangutan; (D) Rhesus macaque. Predicted: miRNAs which were predicted by using a comparative genomics approach. Sequencing: miRNAs which were identified from small RNA sequencing data.
Some validated miRNAs and their host genes in chimpanzee and orangutan. The validated miRNAs which targets are potentially involved in nervous system are located in protein-coding genes. Moreover, the number that follows “intron_” in this table means the miRNA location in which intron of the corresponding protein-coding gene. These coordinates of protein-coding genes and miRNAs of chimpanzee were derived from panTro4 assembly instead of panTro3. Moreover, all the sequence information of chimpanzee in this study was derived from panTro3 assembly if not explained specially.
| Chromosome | Start | End | Gene Symbol | Intron | Strand | Chromosome | Start | End | pre-miRNA | Strand |
|---|---|---|---|---|---|---|---|---|---|---|
| chr1 | 196871470 | 196873147 | PHC2 | intron_11 | + | chr1 | 196873044 | 196873102 | ppy-mir-3605 | + |
| chr7 | 130301326 | 130318076 | EXOC4 | intron_3 | + | chr7 | 130303121 | 130303184 | ppy-mir-4419a | + |
| chr19 | 51404019 | 51404127 | PTOV1 | intron_2 | + | chr19 | 51404069 | 51404128 | ppy-mir-4706 | + |
| chr20 | 13523168 | 13526970 | POLR3F | intron_2 | + | chr20 | 13524745 | 13524802 | ppy-mir-3192 | + |
| chr17_random | 2435896 | 2459049 | SKA2 | intron_1 | + | chr17_random | 2439995 | 2440051 | ppy-mir-301a | + |
| chr3 | 49052547 | 49062118 | SPINK8 | intron_2 | − | chr3 | 49059002 | 49059061 | ptr-mir-2115 | − |
| chr4 | 176472868 | 176516173 | GALNT7 | intron_2 | + | chr4 | 176492588 | 176492646 | ptr-mir-548t | + |
| chr11 | 65715927 | 65719260 | NDUFS8 | intron_6 | + | chr11 | 65716414 | 65716476 | ptr-mir-4691 | + |
| chr13 | 102152614 | 102687690 | FGF14 | intron_4 | − | chr13 | 102247593 | 102247653 | ptr-mir-2681 | − |
| chr15 | 63169799 | 63174675 | DENND4A | intron_21 | − | chr15 | 63171059 | 63171120 | ptr-mir-4511 | − |
| chr21 | 26267629 | 26354246 | DSCAM | intron_22 | − | chr21 | 26290766 | 26290830 | ptr-mir-4760 | − |
| chr22 | 44543820 | 44616196 | ATXN10 | intron_9 | + | chr22 | 44567304 | 44567367 | ptr-mir-4762 | + |
Figure 2The flow chart used to identify novel miRNAs by using a comparative genomics approach and small RNA sequencing data. WGA: whole genome alignment.