| Literature DB >> 23762306 |
Irena Godnic1, Minja Zorc, Dasa Jevsinek Skok, George Adrian Calin, Simon Horvat, Peter Dovc, Milena Kovac, Tanja Kunej.
Abstract
MicroRNAs (miRNAs) are non-coding RNAs (ncRNAs) involved in regulation of gene expression. Intragenic miRNAs, especially those exhibiting a high degree of evolutionary conservation, have been shown to be coordinately regulated and/or expressed with their host genes, either with synergistic or antagonistic correlation patterns. However, the degree of cross-species conservation of miRNA/host gene co-location is not known and co-expression information is incomplete and fragmented among several studies. Using the genomic resources (miRBase and Ensembl) we performed a genome-wide in silico screening (GWISS) for miRNA/host gene pairs in three well-annotated vertebrate species: human, mouse, and chicken. Approximately half of currently annotated miRNA genes resided within host genes: 53.0% (849/1,600) in human, 48.8% (418/855) in mouse, and 42.0% (210/499) in chicken, which we present in a central publicly available Catalog of intragenic miRNAs (http://www.integratomics-time.com/miR-host/catalog). The miRNA genes resided within either protein-coding or ncRNA genes, which include long intergenic ncRNAs (lincRNAs) and small nucleolar RNAs (snoRNAs). Twenty-seven miRNA genes were found to be located within the same host genes in all three species and the data integration from literature and databases showed that most (26/27) have been found to be co-expressed. Particularly interesting are miRNA genes located within genes encoding for miRNA silencing machinery (DGCR8, DICER1, and SND1 in human and Cnot3, Gdcr8, Eif4e, Tnrc6b, and Xpo5 in mouse). We furthermore discuss a potential for phenotype misattribution of miRNA host gene polymorphism or gene modification studies due to possible collateral effects on miRNAs hosted within them. In conclusion, the catalog of intragenic miRNAs and identified 27 miRNA/host gene pairs with cross-species conserved co-location, co-expression, and potential co-regulation, provide excellent candidates for further functional annotation of intragenic miRNAs in health and disease.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23762306 PMCID: PMC3675212 DOI: 10.1371/journal.pone.0065165
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Workflow of the study.
GEA – Gene Expression Atlas.
Figure 2Diagram of genomic distribution of miRNA genes in human, mouse, and chicken.
* - microRNA genes overlapping protein-coding and ncRNA genes; mixed - microRNA genes overlapping intron, exon or UTR, depending on overlapping host gene transcripts. For details see online table: http://www.integratomics-time.com/miR-host/catalog.
Figure 3Examples of co-location of miRNA genes with protein-coding and ncRNA genes.
A) Protein-coding gene HTR2C with four resident miRNA genes, two of which form a cluster. B) A miRNA gene cluster located within lincRNA gene FTX. C) MicroRNA gene hsa-mir-10a located within two overlapping protein-coding genes. D) Overlapping miRNA gene (hsa-mir-664b) comprising a miR-seed-SNP, and snoRNA gene (SNORA36A) residing within protein-coding DKC1. E) Gene DGCR8, associated with miRNA biogenesis, hosts two miRNA genes, one of which comprises a miR-seed-SNP.
Figure 4Cross talk of miRNA-related genomic elements.
Overlapping miRNA genes (hsa-mir-3618 and mir-1306, mir-3173, and mir-593), miRNA polymorphisms (miR-seed-SNPs (rs12159555 and rs73721294), host genes encoding for miRNA processing machinery components (DGCR8, DICER1, and SND1), miRNA target sites within host genes, and miRNAs targeting other host genes. Arrow with solid line: experimentally validated miRNA targets; arrow with dashed line: predicted miRNA targets.
Twenty-seven miRNA/host gene pairs with conserved co-location in human, mouse, and chicken.
| Human | Mouse | Chicken | ||||||
| miRNA gene | location within host gene | host gene (synonym) | miRNA gene | location within host gene | host gene (synonym) | miRNA gene | location within host gene | host gene (synonym) |
|
| intron 2, 3 |
|
| intron 2 |
|
| intron 2 |
|
|
| intron 4, 7, 8 |
|
| intron 8 |
|
| intron 8 |
|
|
| intron 2, 5 |
|
| intron 5 |
|
| intron 4, 5 |
|
|
| intron 4, 5 |
|
| intron 5 |
|
| intron 5 |
|
|
| intron 5–7 |
|
| intron 1, 6, 7, exon 4 |
|
| intron 7 |
|
|
| intron 7, 15, 18 |
|
| intron 19 |
|
| intron 18 |
|
|
| intron 3, 6, 17, 18 |
|
| intron 14–17, 19 |
|
| intron 16 |
|
|
| 5′-UTR |
|
| exon 1–3 |
|
| 5′-UTR |
|
|
| intron 3, 6, 7, 9, 14, 16 |
|
| intron 16 |
|
| intron 16, 17 |
|
|
| intron 1–5 |
|
| intron 5 |
|
| intron 4 |
|
|
| intron 1–5 |
|
| intron 5 |
|
| intron 4 |
|
|
| intron 22, 27, 51, 53 |
|
| intron 53 |
|
| intron 54 |
|
|
| intron 4–7 |
|
| intron 1, 2, 4–6 |
|
| intron 5 |
|
|
| intron 14–16 |
|
| intron 1, 8, 14–16 |
|
| intron 13, 15 |
|
|
| intron 4, 14 |
|
| intron 14 |
|
| intron 1, 9 |
|
|
| intron 4–6, 14, 15 |
|
| intron 3, 15 |
|
| intron 15 |
|
|
| intron 4–6, 14,15, exon 7 |
|
| intron 3, 15 |
|
| 3′-UTR |
|
|
| intron 2, 4, 5 |
|
| intron 4, 5 |
|
| intron 5 |
|
|
| intron 4–6, 14, 15 |
|
| intron 3, 15 |
|
| intron 15 |
|
|
| intron 1 |
|
| intron 1 |
|
| intron 1 |
|
|
| intron 1–6, 10 |
|
| intron 3–5 |
|
| intron 4 |
|
|
| intron 1–6, 10 |
|
| intron 3–5 |
|
| intron 4 |
|
|
| intron 8, 12, 14 |
|
| intron 8, 12, 14 |
|
| intron 23 |
|
|
| intron 1, 2, 9, 10, 16, 18, 19 |
|
| intron 16 |
|
| intron 13 |
|
|
| intron 5–7, 10 |
|
| intron 7, 10 |
|
| intron 18 |
|
|
| intron 20 |
|
| intron 19 |
|
| intron 18, 19 |
|
|
| intron 1, 3, 15, 16 |
|
| intron 2, 3, 5, 7, 11, 14, 15, 17 |
|
| intron 15 |
|
Host gene names: ARPP21: cAMP-regulated phosphoprotein, 21kDa; COL27A1: collagen, type XXVII, alpha 1; CTDSPL: CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small phosphatase-like; C9orf3: chromosome 9 open reading frame 3; C9orf5: chromosome 9 open reading frame 5; EGFL7: EGF-like-domain, multiple 7; Fam33a: spindle and kinetochore associated complex subunit 2; DGCR8: DiGeorge syndrome critical region gene 8; HNRNPK: heterogeneous nuclear ribonucleoprotein K; MYH7B: myosin, heavy chain 7B, cardiac muscle, beta; NFYC: nuclear transcription factor Y, gamma; PANK1: pantothenate kinase 1; PANK3: pantothenate kinase 3; R3HDM1: R3H domain containing 1; RCL1: RNA terminal phosphate cyclase-like 1; SKA2: spindle and kinetochore associated complex subunit 2; SLIT2: slit homolog 2 (Drosophila); SLIT3: slit homolog 3 (Drosophila); SMC4: structural maintenance of chromosomes 4; SREBF2: sterol regulatory element binding transcription factor 2; TLN2: talin 2; TMEM245: transmembrane protein 245; TRPM1: transient receptor potential cation channel, subfamily M, member 1; WDR82: WD repeat domain 82; WWP2: WW domain containing E3 ubiquitin protein ligase 2.
Host genes for epigenetically silenced miRNA genes in cancer.
| miRNA gene | location within host gene | host gene | study describing epigenetic regulation of host gene |
|
| |||
|
| exon 5 |
| / |
|
| intron 4–6 |
| / |
|
| intron 1, 5′-UTR |
| / |
| 5′-UTR |
| Zhend et al., 2009 | |
|
| intron 1 |
| Kron et al., 2010 |
|
| intron 1, 2 |
| / |
|
| intron 5–7 |
| Saito et al., 2009 |
|
| intron 1–3 |
| / |
|
| intron 3, 6, 7, 9, 13, 14, 16 |
| / |
|
| intron 1, 2 |
| / |
|
| intron 1, 2 |
| Tsuruta et al., 2011 |
|
| intron 3 |
| / |
|
| intron 1 |
| He et al., 2011 |
|
| intron 1 |
| Bennett et al., 2009 |
|
| exon 1–3 |
| Bandyopadhyay et al., 2012 |
| intron 1 |
| / | |
|
| 3′-UTR |
| / |
|
| intron 3–7 |
| / |
|
| intron 4–6, 14, 15 |
| / |
|
| intron 4, 5, 14, 15, exon 7 |
| / |
|
| intron 2, 4, 8, 12, 13 |
| / |
|
| intron 4–6, 14, 15 |
| / |
|
| intron 2–4 |
| Grady et al., 2008 |
|
| intron 1 |
| He et al., 2011 |
|
| intron 4, 5 |
| Anderton et al., 2008 |
|
| intron 2, 3, 5 |
| Dejeux et al., 2009 |
| intron 5 |
| / | |
|
| intron 14–16 |
| / |
|
| intron 3 |
| / |
|
| intron 1–3 |
| / |
|
| intron 1 |
| / |
| intron 1 |
| Issa, 2009 | |
|
| intron 1–5 |
| / |
|
| intron 1, 2 |
| / |
|
| |||
|
| exon 1, 3, 4 |
| / |
|
| intron 1 |
| / |
|
| exon 3 |
| / |
|
| exon 3, intron 3 |
| / |
|
| intron 1 |
| / |
|
| exon 2, 4, intron 2, 3 |
| / |
|
| exon 3, intron 3 |
| / |
|
| intron 3 |
| / |
|
| intron 1 |
| / |
|
| intron 5 |
| / |
|
| exon 3, 4, intron 2, 3 |
| / |
|
| intron 1 |
| / |
|
| intron 1, 3, 5, 6 |
| / |
/− host gene not found to be regulated by DNA methylation in references.
- studies describing epigenetically regulated host gene and resident miRNA gene.
Host gene names: ATAD2: ATPase family, AAA domain containing 2; C1orf61: chromosome 1 open reading frame 61; C20orf166: chromosome 20 open reading frame 166; CLCN5: chloride channel, voltage-sensitive 5; COPZ1: coatomer protein complex, subunit zeta 1; COPZ2: coatomer protein complex, subunit zeta 2; DALRD3: DALR anticodon binding domain containing 3; EVL: Enah/Vasp-like; FSTL1: follistatin-like 1; HOXA9: homeobox A9; HOXB4: homeobox B4; HOXB7: homeobox B7; HOXC4: homeobox C4; HOXC5: homeobox C5; HOXD3: homeobox D3; HTR2C: 5-hydroxytryptamine (serotonin) receptor 2C, G protein-coupled; IGF2: insulin-like growth factor 2 (somatomedin A); INS-IGF2: INS-IGF2 readthrough; LINC00461: long intergenic non-protein coding RNA 461; LINC00478: long intergenic non-protein coding RNA 478; LINC00472: long intergenic non-protein coding RNA 472; MAP2K4: mitogen-activated protein kinase kinase 4; MCM7: minichromosome maintenance complex component 7; MIR137HG: mir-137 host gene (non-protein coding); MEG8: maternally expressed 8 (non-protein coding); MIR17HG: mir-17–92 cluster host gene (non-protein coding); MIR205HG: mir-205 host gene (non-protein coding); MIR31HG: mir-31 host gene (non-protein coding); MUC20: mucin 20, cell surface associated; PDE2A: phosphodiesterase 2A, cGMP-stimulated; PDE4D: phosphodiesterase 4D, cAMP-specific; TRPM3: transient receptor potential cation channel, subfamily M, member 3.