| Literature DB >> 26989613 |
Umberto Rosani1, Alberto Pallavicini2, Paola Venier1.
Abstract
Small non-coding RNAs include powerful regulators of gene expression, transposon mobility and virus activity. Among the various categories, mature microRNAs (miRNAs) guide the translational repression and decay of several targeted mRNAs. The biogenesis of miRNAs depends on few gene products, essentially conserved from basal to higher metazoans, whose protein domains allow specific interactions with dsRNA. Here, we report the identification of key genes responsible of the miRNA biogenesis in 32 bivalves, with particular attention to the aquaculture species Mytilus galloprovincialis and Crassostrea gigas. In detail, we have identified and phylogenetically compared eight evolutionary conserved proteins: DROSHA, DGCR8, EXP5, RAN, DICER TARBP2, AGO and PIWI. In mussels, we recognized several other proteins participating in the miRNA biogenesis or in the subsequent RNA silencing. According to digital expression analysis, these genes display low and not inducible expression levels in adult mussels and oysters whereas they are considerably expressed during development. As miRNAs play an important role also in the antiviral responses, knowledge on their production and regulative effects can shed light on essential molecular processes and provide new hints for disease prevention in bivalves.Entities:
Keywords: Bivalves; Crassostrea gigas; Mytilus galloprovincialis; RNAi; miRNA biogenesis
Year: 2016 PMID: 26989613 PMCID: PMC4793324 DOI: 10.7717/peerj.1763
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Organisms included in the present work.
Phylum, organism name, sequence origin and reference, ID used in phylogenetic trees and identified sequences are reported. Protostomia (green), Deuterostomia (orange) and novel protein sequences (numbers in bold) are well discernible.
| Phylum | Species | Sequence origin | Ref | Used ID | DROSHA | DGCR8 | XPO5 | RAN | DICER | TARBP2 | AGO or PIWI |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Ctenophora | G | EM | Ple_bac | No | No | 1 | No | 1 | No | 4 | |
| G | EM | Mne_ lei | No | No | 1 | 1 | 1 | No | 4 | ||
| Porifera | G | EM | Aq | No | 1 | 1 | 1 | 2 | No | 2 | |
| Placozoa | G | EM | Tri_ adh | No | No | 1 | 1 | 2 | No | 1 | |
| Cnidaria | G | EM | Nvec | 1 | 1 | 1 | 1 | 2 | No | 4 | |
| T | TSA | Por_aus | 1 | 1 | 1 | 1 | 1 | No | 6 | ||
| T | TSA | Ant_ ele | 1 | 1 | 1 | 1 | 1 | No | 6 | ||
| Nematoda | G | M | Ce | 1 | 1 | 1 | 1 | 1 | 23 | ||
| Arthropoda | G | EM | Dap_ pul | 1 | 1 | 1 | 1 | 2 | 1 | 10 | |
| G | EM | Cq | 1 | 1 | 1 | 1 | 2 | 1 | 4 | ||
| G | EM | Dm | 1 | 1 | 1 | 1 | 2 | 1 | 4 | ||
| G | M | Nv | 1 | 1 | 1 | 1 | 2 | 1 | 4 | ||
| G | M | Tc | 1 | 1 | 1 | 1 | 2 | 1 | 4 | ||
| G | EM | Am | 1 | 1 | 1 | 1 | 2 | 1 | 4 | ||
| G | K | La | 1 | 1 | 1 | 1 | 2 | 1 | 4 | ||
| G | A | Ap | 1 | 1 | 1 | 1 | 2 | 1 | 15 | ||
| Platyhelmintes | G | GD | Sch_ man | 1 | 1 | 2 | 1 | 2 | 1 | 3 | |
| G | SG | Sch_med | 1 | 1 | 1 | 2 | 1 | 4 | |||
| Rotifera | G | V | Adi_vag | 1 | 1 | 1 | 1 | 1 | 1 | 4 | |
| Brachiopoda | G | L | Lin_ ana | 1 | 1 | 1 | 1 | 1 | 3 | ||
| Annelida | G | EM | Ct | 1 | 1 | 1 | 1 | 1 | 1 | 3 | |
| G | EM | Hel_ rob | 1 | 1 | 1 | 1 | 1 | 1 | 4 | ||
| Cephalopoda | G | M | Oct_ bim | 1 | 1 | 1 | 1 | 1 | 1 | 4 | |
| Mollusca | G | B | Ac | 1 | 1 | 1 | 3 | 1 | 1 | 4 | |
| G | M | Lg | 1 | 1 | 1 | 1 | 1 | 1 | 3 | ||
| T | Local | Mg | |||||||||
| T | Local | Me | |||||||||
| T | Local | Mc | |||||||||
| T | Local | Mt | |||||||||
| T | Local | At | |||||||||
| T | Local | Tg | |||||||||
| T | Local | Ba | |||||||||
| T | Local | Pv | |||||||||
| T | Local | Et | |||||||||
| T | Local | Cc | |||||||||
| G | EM | Cg | 1 | 1 | 1 | 1 | 1 | 1 | 4 | ||
| T | Local | Ch | |||||||||
| T | Local | Cv | |||||||||
| T | local | Ca | |||||||||
| T | Local | Oc | |||||||||
| T | Local | Oe | |||||||||
| T | local | Ol | |||||||||
| T | Local | Os | |||||||||
| T | Local | Sg | |||||||||
| T | Local | Ai | |||||||||
| T | Local | My | |||||||||
| T | Local | Pm | |||||||||
| G | F | Pf | 1 | 1 | 1 | 1 | |||||
| T | Local | Sv | |||||||||
| T | Local | Ec | |||||||||
| T | Local | Pg | |||||||||
| T | Local | Ut | |||||||||
| T | Local | Vl | |||||||||
| T | local | Cf | |||||||||
| T | local | Mm | |||||||||
| T | local | Rd | |||||||||
| T | local | Rp | |||||||||
| Echinodermata | G | M | Sp | 1 | 1 | 1 | 1 | 1 | 1 | 3 | |
| Hemichordata | G | M | Sk | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| Chordata | G | M | Hs | 1 | 1 | 1 | 1 | 1 | 1 | 8 | |
| G | M | Ci | 1 | 1 | 1 | 1 | 1 | 1 | 3 | ||
| G | M | Bf | 1 | 1 | 1 | 1 | 1 | 1 | 7 | ||
| G | O | Om | 1 | 1 | 1 | 1 | 1 | 1 | 5 | ||
| G | M | Dr | 1 | 1 | 1 | 1 | 1 | 1 | 5 | ||
| Streptophyta | G | P | At | No | No | 1 | 1 | 4 | No | 10 | |
| G | P | Pt | No | No | 1 | 1 | 5 | No | 11 |
Notes.
Aphidbase
broadinstitute.org/ftp/pub/assemblies/invertebrates/aplysia/
Ensambl Metazoa v.29
Takeuchi et al. (2012) DNA Res. 19(2): 117–130;
Genome
GeneDB
Kocher et al. (2013) Genome Biology 14 (12): R142
Lou et al. (2015) Nat Commun. 8; 6:8301
Metazome v3.0
Berthelot et al. (2014) Nat Commun. 22; 5: 3657
Phytozome 11
SmedGD
Transcriptome
NCBI Transcriptome shotgun assembly
Genoscope
Key proteins of the miRNA biogenesis with their structural domains.
| Process step | Protein name | Domains |
|---|---|---|
| Microprocessor complex | DROSHA | 2× RIBO III + 1× DSRM |
| DGCR8 | 1× WW + 2× DSRM | |
| Moving to cytoplasm | XPO5 | 1× XPO1 |
| RAN | 1× RAN | |
| RISC loading complex | DICER | PDB + Helicase + DICER + PAZ + 2× RIBO + DSRM |
| TARBP2 | 3× DSRM | |
| Final miRNA maturation | AGO | DUF + PAZ + PIWI |
| PIWI-like | PAZ + PIWI |
miRNA biogenesis proteins of Mytilus galloprovincialis.
Protein name, GenBank ID, transcript (bp) and protein length (aa), identified domains and annotation (first hit, e-value and percentage of similarity) are reported.
| Protein name | GenBank ID | Transcript length (bp) | Protein length (aa) | Identified domain(s) | Annotation (first hit) | Similarity (%) | ||
|---|---|---|---|---|---|---|---|---|
| Key miRNA biogenesis proteins | MgDROSHA |
| 4,384 | 1,377 | 2× RIBO III + 1× DSRM | Ribonuclease 3-like ( | 0 | 67 |
| MgDGCR8 |
| 2,483 | 728 | 1× WW + 2× DSRM | Microprocessor complex subunit DGCR8-like ( | 0 | 50 | |
| MgXPO5 |
| 3,875 | 1,201 | XPO1 | Exportin-5-like ( | 0 | 55 | |
| MgRAN |
| 1,113 | 214 | RAN | GTP-binding nuclear protein Ran ( | −143 | 93 | |
| MgDICER |
| 6,013 | 1,850 | PDB + Helicase + DICER + PAZ + 2× RIBO + DSRM | Endoribonuclease Dicer-like ( | 0 | 58 | |
| MgTARBP2 |
| 7,583 | 321 | 3× DSRM | Probable RISC-loading complex subunit ( | −143 | 69 | |
| MgAGO |
| 3,337 | 892 | DUF + PAZ + PIWI | Protein argonaute-2-like ( | 0 | 84 | |
| MgPIWIa |
| 2,686 | 867 | PAZ + PIWI | Piwi-like protein 1 ( | 0 | 75 | |
| MgPIWIb |
| 3,603 | 948 | PAZ + PIWI | Piwi-like protein 2 (Hydra vulgaris) | 0 | 59 | |
| Other interacting proteins | MgGW182 |
| 3,825 | 1,274 | UBA + RRM | Trinucleotide repeat-containing gene 6C protein-like ( | 0 | 45 |
| MgCNOT1 |
| 5,373 | 1,791 | DUF3819 + NOT1 | CCR4-NOT transcription complex subunit 1-like ( | 0 | 69 | |
| MgCNOT2 |
| 864 | 288 | NOT2_ 3_ 5 | CCR4-NOT transcription complex subunit 2 (Pinctada fucata) | −156 | 82 | |
| MgCNOT3 |
| 2,142 | 714 | NOT3 + NOT2_ 3_ 5 | CCR4-NOT transcription complex subunit 3-like ( | 0 | 97 | |
| MgCNOT6 |
| 2,592 | 864 | Exo_ endo_ phos | Uncharacterized protein LOC105348954 isoform X1 ( | 0 | 71 | |
| MgCNOT7 |
| 897 | 299 | CAF1 | CCR4-NOT transcription complex subunit 7-like ( | 0 | 84 | |
| MgCNOT9 |
| 927 | 309 | RCD1 | Cell differentiation protein RCD1 homolog ( | 0 | 93 | |
| MgCNOT10 |
| 2,133 | 711 | TPR_ 1 | CCR4-NOT transcription complex subunit 10-like ( | 0 | 71 | |
| MgDDX5 |
| 1,740 | 538 | DEAD + Helic | ATP-dependent RNA helicase DDX5 ( | 0 | 75 | |
| MgDDX6 |
| 1,332 | 443 | DEAD + Helic | ATP-dependent RNA helicase me31b ( | 0 | 88 | |
| MgDDX20 |
| 1,836 | 612 | DEAD + Helic | ATP-dependent RNA helicase DDX20 ( | 0 | 77 | |
| MgDDX42 |
| 2,196 | 731 | DEAD + Helic | ATP-dependent RNA helicase DDX42 ( | 0 | 72 | |
| MgPABP |
| 1,881 | 627 | 4× RRM + PABP | polyadenylate-binding protein 4 (Hydra vulgaris) | 0 | 74 | |
| MgeIF4G |
| 5,019 | 1,672 | MIF4G + MA3 + W2 | eukaryotic translation initiation factor 4 gamma ( | 0 | 57 | |
| MgPAN2 |
| 3,606 | 1,202 | UCH_ 1 + RNase_T | PAB-dependent poly(A)-specific ribonuclease subunit PAN2 ( | 0 | 72 | |
| MgPAN3 |
| 2,334 | 778 | None | PAB-dependent poly(A)-specific ribonuclease subunit PAN3 ( | 0 | 67 | |
| MgDCP1 |
| 1,611 | 536 | DCP1 | mRNA-decapping enzyme 1A-like ( | −126 | 73 | |
| MgDCP2 |
| 1,313 | 385 | DCP2 + NUDIX | m7GpppN-mRNA hydrolase ( | −117 | 67 | |
| MgPRMT5 |
| 1,893 | 631 | PRMT5 | protein arginine N-methyltransferase 5-like ( | 0 | 72 | |
| MgTudor-11 |
| 2,682 | 894 | 4× SNc + TUDOR | Hypothetical protein mRNA ( | 0 | 73 | |
| MgMaelstrom |
| 1,321 | 404 | HMG + MAEL | Protein maelstrom ( | −155 | 62 |
Figure 1(A) Graphical reconstruction of mussel miRNA biogenesis process.
(Modified from Kapinas & Delany, 2011). (B) Conserved domains of the mussel miRNA complements.
Figure 2Mussel gene structures of DROSHA (A), DGCR8 (B), EXP5 (C), DICER (D) and TARBP2 (E).
Green boxes represent exons, length is reported as base pair scale.
Number of exons of five key miRNA biogenesis genes.
Metazome 3.0 and Ensembl Metazoa v.29 genome browsers were interrogated with the previously analyzed hits for each organism. La and Ap genomes were downloaded and analyzed locally. Mg gene structures were retrieved as described in Methods. In green are reported Protostomia; in orange Deuterostomia.
| Species | ||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ID | Hs | Ci | Bf | Dr | Sk | Sp | Nvec | Aq | Ce | Ct | Cq | Dm | Nv | Tc | Am | La | Ap | Lg | Cg | Mg |
| DROSHA | 27 | 24 | 29 | 17 | 24 | 20 | 13 | 14 | 6 | 28 | 3 | 3 | 11 | 9 | 13 | 23 | 1 | 23 | 30 | 23 |
| DGCR8 | 14 | 10 | 15 | 10 | 15 | 13 | 7 | No | 11 | 18 | 4 | 5 | 11 | 6 | 6 | 6 | 2 | 11 | 18 | 16 |
| XPO5 | 32 | 4 | 8 | 21 | 29 | 31 | 2 | 28 | No | 30 | 9 | 2 | 11 | 10 | 9 | 10 | 1 | 32 | 34 | 31 |
| DICER | 27 | 23 | 26 | 17 | 14 | 19 | 12 | 10 | 26 | 13 | 7 | 8 | 5 | 9 | 29 | 33 | 19 | 16 | 19 | 18 |
| TARBP2 | 9 | 1 | 6 | 9 | 6 | 2 | No | No | 11 | 8 | 4 | 5 | 7 | 7 | 7 | 6 | 7 | 6 | 7 | 6 |
Figure 3Phylogenetic relationships of four miRNA biogenesis proteins.
(A) DROSHA, (B) DGCR8, (C) DICER and (D) TARBP2. Inferred protein sequences were aligned using MUSCLE, conserved positions were extracted using Gblocks and subjected to MrBayes analysis.
Figure 4Phylogenetic relationships of Argonaute-like proteins.
Proteins were aligned using MUSCLE and tree was generated using Neighbor Joining algorithm with 1,000 bootstrap replicates. Plant proteins are highlighted in green, whereas C. elegans hits are reported in grey. Blue lines represent mollusk hits, red lines represent hits from basal metazoans.
Figure 5Digital expression analysis in oyster.
The expression of the 8 miRNA biogenesis genes were computed in tissue-specific RNA libraries and in RNA libraries from different developmental phases. (A) Expression values represented as percentage of El1α. (B) Cumulative TPM expression values of the 8 genes in the same samples.