| Literature DB >> 16480513 |
Jana Hertel1, Manuela Lindemeyer, Kristin Missal, Claudia Fried, Andrea Tanzer, Christoph Flamm, Ivo L Hofacker, Peter F Stadler.
Abstract
BACKGROUND: MicroRNAs have been identified as crucial regulators in both animals and plants. Here we report on a comprehensive comparative study of all known miRNA families in animals. We expand the MicroRNA Registry 6.0 by more than 1000 new homologs of miRNA precursors whose expression has been verified in at least one species. Using this uniform data basis we analyze their evolutionary history in terms of individual gene phylogenies and in terms of preservation of genomic nearness across species. This allows us to reliably identify microRNA clusters that are derived from a common transcript.Entities:
Mesh:
Substances:
Year: 2006 PMID: 16480513 PMCID: PMC1388199 DOI: 10.1186/1471-2164-7-25
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Summary statistics of the dataset used in this study. MicroRNA genes detected by homology search relative to the contents of the MR 6.0.
| Genome | MR 6.0 | known | new | all |
| hsa | 227 | 215+12 | 23 | 238 |
| ptr | - | 0 | 183 | 183 |
| cfa | 6 | 6 | 195 | 201 |
| bta | - | 0 | 138 | 138 |
| mmu | 230 | 215+17 | 26 | 241 |
| rno | 191 | 180+6 | 39 | 219 |
| mdo | - | 0 | 139 | 139 |
| gga | 122 | 122 | 17 | 139 |
| xla/xtr | (7) | (7) | 126 | 133 |
| tru | - | 0 | 171 | 171 |
| tni | - | 0 | 179 | 179 |
| ola | - | 0 | 152 | 152 |
| dre | 33 | 60 | 205 | 265 |
| spu | - | 0 | 40 | 40 |
| cin | - | 0 | 6 | 6 |
| csa | - | 0 | 3 | 3 |
| odi | - | 0 | 5 | 5 |
| dme | 78 | 78 | 0 | 78 |
| dps | 73 | 72 | 0 | 72 |
| dya | - | 0 | 74 | 74 |
| dan | - | 0 | 64 | 64 |
| dvi | - | 0 | 67 | 67 |
| dmo | - | 0 | 69 | 69 |
| aga | 38 | 42 | 10 | 52 |
| tca | - | 0 | 24 | 24 |
| ame | 25 | 26 | 12 | 38 |
| bmo | - | 0 | 17 | 17 |
| cel | 116 | 117 | 2 | 119 |
| cbr | 79 | 82 | 3 | 85 |
| sma | - | 0 | 4 | 4 |
| Σ | 1222 | 1993 | 3215 | |
The set of "known" microRNAs differs in some cases from MR 6.0 because some database entries could not be mapped to the current genome assembly, or mapped to more than one genomic locus. The mir-134 cluster is excluded from this list (its known members are indicated separately for human, mouse and rat in the MR 6.0 column). The last column ("all") provides the statistics for the data set provided in the electronic supplement, the column "new" lists all those pre-miRNA sequences that were detected by homology search and are contained in MR 6.0. For Xenopus 7 microRNAs were reported for Xenopus laevis, a close relative of the sequenced Xenopus tropicalis.
Species abbreviations.
Mammals: hsa, Hs: Homo sapiens; ptr, Pt: Pan troglodytes; cfa, Cf: Canis familiaris; bta, Bt: Bos taurus; mmu, Mm: Mus musculus; rno, Rn: Rattus norvegicus; mdo, Md: Monodelphis domesticus; other tetrapods: gga, Gg: Gallus gallus; xla: Xenopus laevis; xtr, Xt: Xenopus tropicalis; teleost fishes: tru, Tr: Takifugu rubripes; tni, Tn: Tetraodon nigroviridis; dre, Dr: Danio rerio; basal deuterostomes: spu, Sp: Strongylocentrotus purpuratus; cin, Ci: Ciona intestinalis; csa, Cs: Ciona savignyii; odi, Od: Oikopleura dioica; insects: dme, Dm: Drosophila melanogaster, dps, Dp: Drosophila pseudoobscura, dya, Dy: Drosophila yakuba, dan, Da: Drosophila ananassae, dvi, Dv: Drosophila viridis, dmo, Do: Drosophila mohavensis, aga, Ag: Anopheles gambiae, tca, Tc: Tribolium castaneum, ame, Am: Apis mellifera, bmo, Bm: Bombyx mori, nematods: cel, Ce: Caenorhabditis elegans, cbr, Cb: Caenorhabditis briggsae, platyhelmint: sma, Sm: Schistosoma mansoni.
Figure 7(a) All microRNAs in the mir-134 cluster appear to have arisen from a common ancestral sequence. The individual paralog groups have diverged rapidly in the ancestor of extant eutherian. Surprisingly, there is very little sequence variation between human and rodents in each of the paralog groups. The six families of alignable microRNAs are indicated in color. (b) WPGMA dendrogram derived from pairwise z-scores of the members of the mir-35 cluster. The analysis of the mature sequences demonstrates that the members of the cluster probably have arisen by means of tandem duplications.
Figure 3Examples of microRNA gene duplication histories. (a) Gene tree and most plausible reconstructed history of the mir9 cluster. The fourth member of the cluster, mir-306, evolves rapidly in flies. Its homology with mir-9/mir-79 is likely but this hairpin might also have evolved de novo. (b) The two most plausible reconstructions for the history of the mir-23 cluster. Scenario (1) postulates four paralogs in the ancestral vertebrate, where, presumably after the first duplication, one lineage either lost or gained mir-27 in the middle position of the cluster. Subsequently, in this scenario one copy of the three-membered cluster was lost in actinopterygians, while the two-membered clusters were lost in tetrapoda. Scenario (2) postulates three paralogs in the ancestral vertebrate and the independent loss of the mir-27 in two distinct clusters in the teleosts. (c) Duplication history of the mir-130 cluster reconstructed from genomic position information and the gene tree.
Figure 4Clustalw multiple sequence alignment of mir-421 homologs on the mammalian X chromosome. Additional features (top down): mfe: minimum free energy structure calculated using RNAfold -d2 -noLP, part. func: partition function fold, L2/LINE: direction and position of L2 elements relative to mir-421, mat miRNA: position of mature miRNA, conservat.: conserved positions in sequence alignment.
Figure 5RNA secondary structures of human (a) and zebrafish (b) mir-220 sequences. Calculations were performed using RNAfold -p -d2 -noLP.
Figure 6Some microRNA families, such as the mir-10 and mir-100 (left), exhibit very similar mature miRNA sequences, while their precursor sequences show little sequence similarity. Right: A table of alignment z-score for both mature and precursor sequences summarizes the four most likely candidates for distance homologies. While the mir-8/mir-429 pair is most likely a true homolog, the other three pairs are unconvincing, see text.
Vertebrate microRNA clusters. The table lists the maximal number of microRNAs in a single copy of the cluster ("Members"), the maximal number of non-homologous microRNAs in a single copy ("Families"), and the maximal number of paralogous cluster copies in any of the investigated genomes.
| Cluster | Members | Families | Paralogs |
| 3 | 3 | 18 | |
| 2 | 2 | 4 | |
| 4 | 2 | 5 | |
| 9 | 6 | 3 | |
| 4 | 3 | 7 | |
| 2 | 2 | 1 | |
| 2 | 1 | 5 | |
| 6 | 3 | 9 | |
| 3 | 3 | 6 | |
| 3 | 2 | 8 | |
| 2 | 1 | 3 | |
| 2 | 2 | 3 | |
| 7 | 7 | 1 | |
| 3 | 3 | 2 | |
| 2 | 2 | 5 | |
| 4 | 4 | 1 | |
| 3 | 3 | 1 | |
| 2 | 2 | 1 | |
| 4 | 4 | 1 | |
| 2 | 2 | 1 | |
| 2 | 1 | 1 | |
| 3 | 3 | 2 | |
| 3 | 1 | 1 | |
| 2 | 1 | * 2 | |
| 2 | 2 | 5 | |
| 2 | 1 | 2 | |
| >50 | 6 | * 1 | |
| 2 | 1 | * 2 | |
| 2 | 2 | 1 | |
| 2 | 1 | 8 | |
| 2 | 2 | * 1 | |
| 2 | 2 | 2 | |
| 2 | 1 | 1 | |
| 2 | 1 | 3 | |
| 2 | 1 | 2 | |
| 2 | 1 | 4 | |
| 2 | 1 | 1 | |
| 2 | 1 | 1 | |
| 2 | 2 | 1 | |
| 2 | 1 | 1 | |
| 6 | 1 | 6 | |
| 2 | 1 | 2 | |
| 5 | 2 | 5 | |
| 4 | 4 | 1 | |
| 3 | 1 | 1 | |
| 2 | 2 | 2 | |
| 3 | 2 | 1 | |
| 3 | 1 | 1 |
* part of the human mir-134 cluster experimentally investigated in [33]. In the same study it is reported that mir-144 and mir-224 are also parts of clusters with additional microRNAs that do not have orthologs in the MR 6.0.