| Literature DB >> 25505842 |
Guilherme Loss-Morais1, Daniela C R Ferreira2, Rogério Margis3, Márcio Alves-Ferreira2, Régis L Corrêa4.
Abstract
As microRNAs (miRNAs) are important regulators of many biological processes, a series of small RNAomes from plants have been produced in the last decade. However, miRNA data from several groups of plants are still lacking, including some economically important crops. Here microRNAs from Coffea canephora leaves were profiled and 58 unique sequences belonging to 33 families were found, including two novel microRNAs that have never been described before in plants. Some of the microRNA sequences were also identified in Coffea arabica that, together with C. canephora, correspond to the two major sources of coffee production in the world. The targets of almost all miRNAs were also predicted on coffee expressed sequences. This is the first report of novel miRNAs in the genus Coffea, and also the first in the plant order Gentianales. The data obtained establishes the basis for the understanding of the complex miRNA-target network on those two important crops.Entities:
Keywords: coffee; illumina sequencing; microRNA profiling
Year: 2014 PMID: 25505842 PMCID: PMC4261967 DOI: 10.1590/S1415-47572014005000020
Source DB: PubMed Journal: Genet Mol Biol ISSN: 1415-4757 Impact factor: 1.771
Figure 1Abundance of the different classes of sequences found in the C. canephora sRNA library. Based on BLAST searchers, reads ranging from 16 to 26 nt were divided into six categories (indicated by colors). Numbers in the x-axis indicate their respective size in nucleotides.
Conserved microRNAs identified in C. canephora and C. arabica.
| Family | Acronym | miRNA sequence (5′-3′) | Size (nt) | Reads | Precursor | Precursor |
|---|---|---|---|---|---|---|
| 156 | miR157 | UUGACAGAAGAUAGAGAGCAC | 21 | 112 | nd | nd |
| 159 | miR159a | UUUGGAUUGAAGGGAGCUCUA | 21 | 44504 | nd | nd |
| miR159b | UUUGGAUUGAAGGGAGCUCUU | 21 | 148 | nd | nd | |
| miR159c | CUUGGAUUGAAGGGAGCUCUA | 21 | 28 | nd | nd | |
| miR159d | UUUGGACUGAAGGGAGCUCUA | 21 | 11 | nd | nd | |
| 160 | miR160 | UGCCUGGCUCCCUGUAUGCCA | 21 | 188 | nd | nd |
| 162 | miR162 | UCGAUAAACCUCUGCAUCCAG | 21 | 482 | nd | nd |
| 164 | miR164 | UGGAGAAGCAGGGCACGUGCA | 21 | 142 | nd | nd |
| 165 | miR165 | UCGGACCAGGCUUCAUCCCCC | 21 | 327 | nd | nd |
| 166 | miR166a | UCGGACCAGGCUUCAUUCCCC | 21 | 81200 | nd | nd |
| miR166b | UCUCGGACCAGGCUUCAUUCC | 21 | 41113 | nd | nd | |
| miR166c | CUCGGACCAGGCUUCAUUCCC | 21 | 168 | nd | nd | |
| miR166d | UCGGACCAGGCUUCAUUCCUC | 21 | 62 | nd | nd | |
| miR166e | UCGGACCAGGCUUCAUUCCCCC | 22 | 43 | nd | nd | |
| miR166f | UCGAACCAGGCUUCAUUCCCC | 21 | 22 | nd | nd | |
| miR166g | UCGGACCAGGCUUCAUUCCCU | 21 | 19 | nd | nd | |
| miR166h | UCGGACCACGCUUCAUUCCCC | 21 | 11 | nd | nd | |
| 167 | miR167a | UGAAGCUGCCAGCAUGAUCUGA | 22 | 17006 | nd | nd |
| miR167b | UGAAGCUGCCAGCAUGAUCUGG | 22 | 3553 | nd | GT007358.1 | |
| miR167c | UGAAGCUGCCAGCAUGAUCUA | 21 | 1829 | nd | nd | |
| miR167d | UGAAGCUGCCAGCAUGAUCUAA | 22 | 300 | nd | nd | |
| 168 | miR168a | UCGCUUGGUGCAGGUCGGGAA | 21 | 8427 | nd | nd |
| 169 | miR169a | CAGCCAAGGAUGACUUGCCGG | 21 | 5194 | nd | nd |
| miR169b | CAGCCAAGGAUGACUUGCCGA | 21 | 29 | nd | nd | |
| 171 | miR171a | UAUUGGCCUGGUUCACUCAGA | 21 | 2724 | nd | nd |
| miR171b | UGAUUGAGCCGUGCCAAUAUC | 21 | 232 | nd | nd | |
| miR171c | UUGAGCCGCGCCAAUAUCACU | 21 | 50 | nd | nd | |
| miR171d | UUGAGCCGUGCCAAUAUCACGA | 22 | 14 | nd | nd | |
| 172 | miR172a | GGAAUCUUGAUGAUGCUGCAU | 21 | 2431 | nd | nd |
| miR172b | AGAAUCUUGAUGAUGCUGCAU | 21 | 1027 | nd | CA00-XX-SH2-017-G01-EM_F | |
| 319 | miR319 | UUGGACUGAAGGGAGCUCCCU | 21 | 591 | nd | nd |
| 390 | miR390a | AAGCUCAGGAGGGAUAGCGCC | 21 | 198 | nd | CA00-XX-EA1-060-H07-EC_F |
| 394 | miR394 | UUGGCAUUCUGUCCACCUCC | 20 | 398 | nd | nd |
| 395 | miR395 | CUGAAGUGUUUGGGGGAACUC | 21 | 37 | nd | nd |
| 396 | miR396a | UUCCACAGCUUUCUUGAACUU | 21 | 21826 | nd | nd |
| miR396b | UUCCACAGCUUUCUUGAACUG | 21 | 7862 | Contig4530 | nd | |
| miR396c | UUCCAUAGCUUUCUUGAACUG | 21 | 161 | nd | nd | |
| 397 | miR397 | UCAUUGAGUGCAGCGUUGAUG | 21 | 20 | nd | nd |
| 398 | miR398a | UGUGUUCUCAGGUCACCCCUU | 21 | 3036 | nd | Contig15966 |
| miR398b | UGUGUUCUCAGGUCGCCCCUG | 21 | 172 | nd | nd | |
| 399 | miR399a | UGCCAAAGGAGAGUUGCCCUA | 21 | 443 | nd | nd |
| miR399b | UGCCAAAGGAGAAUUGCCCUG | 21 | 245 | nd | nd | |
| miR399c | UGCCAAAGGAGAUUUGCCCGG | 21 | 232 | nd | nd | |
| 403 | miR403 | UUAGAUUCACGCACAAACUCG | 21 | 430 | nd | nd |
| 408 | miR408 | UGCACUGCCUCUUCCCUGGCUG | 22 | 21 | nd | nd |
| 828 | miR828 | UCUUGCUCAAAUGAGUAUUCCA | 22 | 36 | nd | nd |
| 2111 | miR2111 | UAAUCUGCAUCCUGAGGUUUA | 21 | 539 | nd | nd |
Non-Conserved microRNAs identified in C. canephora and C. arabica.
| Family | Acronym | miRNA Sequence (5′- 3′) | Size | Reads | Precursor | Precursor |
|---|---|---|---|---|---|---|
| 159 | miR159e | AGCUCCUUGAAGUCCAAUAGA | 21 | 193 | nd | CA00-XX-CS1-080-F09-MC_F |
| 393 | miR393 | AUCAUGCUAUCCCUUUGGAUA | 21 | 13 | nd | CA00-XX-CS1-106-B03-EQ_F |
| 479 | miR479 | CGUGAUACUGGUUGCGGCUCAUA | 23 | 165 | CC00-XX-EC1-029-H11-EC.F | nd |
| 482 | miR482a | GGAAUGGGCUGCUAGGGAUGG | 21 | 4876 | Contig1415 | Contig9668 |
| miR482b | GGGGUGGGAGACUGGGGAAGA | 21 | 5430 | nd | Contig8555 | |
| miR482c | UUUCCCAGGCCUCCCAUGCCGG | 22 | 4630 | Contig3994 | CA00-XX-RX1-032-C04-EB_F | |
| 5225 | miR5225 | UCGCAGGAGAGAUGAAACCGAA | 22 | 348 | Contig170 | nd |
| 8558 | miR8558 | UUUCCAAUUCCGGGCAUGCCGA | 22 | 11365 | nd | Contig12082 |
| 8697 | miR8697 | AUGUAAUUGAUUCAAUUGAGG | 21 | 29 | nd | Contig12950 |
| Seq-1 | miR-Seq-1 | UUAUACACUAUUGGACUUGGAUGC | 24 | 17 | nd | CA00-XX-CS1-081-E04-MC_F |
| Seq-2 | miR-Seq-2 | AAUAUACUGAGAAAUGAGCCU | 21 | 20 | nd | 32274 |
nd – not determined.
Variants of known miRNAs.
Novel miRNAs.
Found in the RNA-seq data from Combes MC .
Isoforms of microRNAs found in C. canephora and C. arabica.
| Group | Acronym | Sequence (5′ - 3′) | Size (nt) | Reads |
|---|---|---|---|---|
| 2 | miR159e | AGCUCCUUGAAGUCCAAUAGA | 21 | 193 |
| miR159e_Iso1 | GAGCUCCUUGAAGUCCAAUAG | 21 | 28 | |
| miR159e_Iso2 | GAGCUCCUUGAAGUCCAAUA | 20 | 86 | |
| 1 | miR172b | GUAGCAUCAUCAAGAUUCACA | 21 | 2351 |
| miR172b_Iso1 | UAGCAUCAUCAAGAUUCACAU | 21 | 20 | |
| 1 | miR390b | CGCUAUCCAUCCUGAGUUUUA | 21 | 253 |
| miR390b_Iso1 | CGCUAUCCAUCCUGAGUUUU | 20 | 299 | |
| 2 | miR393 | AUCAUGCUAUCCCUUUGGAUA | 21 | 13 |
| miR393_Iso1 | GAUCAUGCUAUCCCUUUGGAU | 21 | 16 | |
| 1 | miR396b | UUCCACAGCUUUCUUGAACUG | 21 | 7862 |
| miR396b_Iso1 | UUCCACAGCUUUCUUGAACU | 20 | 2704 | |
| 2 | miR482a | GGAAUGGGCUGCUAGGGAUGG | 21 | 4876 |
| miR482a_Iso1 | GGAAUGGGCUGCUAGGGAUG | 20 | 1321 | |
| 2 | miR482b | GGGGUGGGAGACUGGGGAAGA | 21 | 5430 |
| miR482b_Iso1 | GGGGUGGGAGACUGGGGAAG | 20 | 2731 | |
| 2 | miR482c | UUUCCCAGGCCUCCCAUGCCGG | 22 | 4036 |
| miR482c_iso1 | UCCCAGGCCUCCCAUGCCGGUG | 22 | 148 | |
| miR482c_iso2 | CCCAGGCCUCCCAUGCCGGUG | 21 | 37 | |
| miR482c_iso3 | CCCAGGCCUCCCAUGCCGGUGA | 22 | 16 | |
| miR482c_iso4 | CCAGGCCUCCCAUGCCGGUGA | 21 | 15 | |
| miR482c_iso5 | CCAGGCCUCCCAUGCCGGUGAU | 22 | 15 | |
| 3 | miR5225 | UCGCAGGAGAGAUGAAACCGAA | 22 | 348 |
| miR5225_iso1 | UUCGCAGGAGAGAUGAAACCGA | 22 | 48 | |
| 2 | miR8558 | UUUCCAAUUCCGGGCAUGCCGA | 22 | 11365 |
| miR8558_Iso1 | UUUCCAAUUCCGGGCAUGCC | 20 | 19 | |
| 3 | miR-Seq-1 | UUAUACACUAUUGGACUUGGAUGC | 24 | 17 |
| miR-Seq-1_Iso1 | UAUGCUGAUAGUUUAUACACU | 21 | 35 | |
| miR-Seq-1_Iso2 | UACACUAUUGGACUUGGAUGCAUG | 24 | 21 |
Groups are defined by: (1) Conserved microRNAs found in miRBase database; (2) Novel microRNAs identified in the genus Coffea belonging to know families of microRNAs; (3) Novel families of microRNAs identified in the genus Coffea.
Figure 2Predicted precursor structures of miRNAs found in C. canephora expressed sequences. C. canephora sRNAs were aligned to available C. canephora contigs/ESTs or RNA-seq sequences and the structure predicted by M-Fold software. The guide and star sequences are highlighted in red and magenta, respectively.
Figure 3Predicted precursor structures of miRNAs found in C. arabica contigs/ESTs. C. canephora sRNAs were aligned to available C. arabica contigs/ESTs and the structure predicted by M-Fold software. The guide and star sequences are highlighted in red and magenta, respectively.
Targets of non-conserved microRNAs found in C. canephora and C. arabica.
| Acronym | Targeted contig | Score | Function of targeted contig | e-value | Species |
|---|---|---|---|---|---|
| miR159e | CC00-XX-PP1-022-H10-TL.F | 3 | medium chain reductase/dehydrogenases (MDR) family | 6.79e–16 | |
| 37514* | 3 | AGO6 | 5e–26 | ||
| miR393 | CA00-XX-SI3-095-E08-EM_F | 1.5 | Helicase MCM 2/3/5 protein | 8.77e-34 | |
| Contig5451 | 3 | Photosystem I psaA/psaB protein | 0e+00 | ||
| 23975* | 2.5 | transcription factor bZIP113 (BZIP113) | 6e–69 | ||
| miR479 | 31680* | 3 | serine/threonine-protein kinase-like protein | 2e–44 | |
| miR482a | Contig5317 | 2.5 | Calcium-dependent kinase-like | 1e–04 | |
| 35051* | 3 | DNA repair-recombination family protein | 4e–20 | ||
| miR482b | Contig1667 | 3 | nd | nd | |
| Contig2174 | 3 | Casein kinase II regulatory subunit | 1.38e–53 | ||
| Contig3591 | 3 | Malectin-like receptor kinase protein | 5.36e–95 | ||
| Contig4446 | 3 | Armadillo repeat-containing protein | 5e–58 | ||
| Contig4669 | 3 | nd | nd | ||
| Contig5426 | 3 | Casein kinase II regulatory subunit | 1.14e–119 | ||
| Contig8850 | 3 | nd | nd | ||
| 14996* | 3 | probable LRR receptor-like serine/threonine-protein kinase | 0.0 | ||
| miR482c | Contig2315 | 2.5 | alpha-crystallin-Hsps_p23-like superfamily | 3.31e–13 | |
| 13908* | 2.5 | Disease resistance protein RPM1 | 1e–84 | ||
| miR5225 | CC00-XX-EC1-024-H07-EC.F | 3 | F-Box protein | 2.96e–10 | |
| miR8558 | CC00-XX-EC1-026-A05-EC.F | 3 | AAA ATPase protein | 2.18e–03 | |
| Contig1997 | 3 | Aluminium induced protein, GATase super-family | 2.44e–142 | ||
| Contig7834 | 3 | EF-hand, calcium binding motif | 2.63e–05 | ||
| Contig13827 | 3 | Glutamine amidotransferases class-II (GATase) | 1.21e–137 | ||
| Contig1962 | 3 | EF-hand, calcium binding motif | 1.69e–04 | ||
| 3730* | 2.5 | E3 ubiquitin-protein ligase PUB23-like | 3e–161 | ||
| 8740* | 3 | stem-specific protein TSJT1-like | 0.0 | ||
| 15962* | 3 | Calcium-binding EF hand family protein | 6e–50 | ||
| miR8697 | CC00-XX-PP1-048-E12-TL.F | 2.5 | nd | nd | |
| Contig1147 | 3 | Nucleoside diphosphate kinase Group I (NDPk_I)-like | 5.64e–77 | ||
| CC00-XX-PP1-052-B07-TL.F | 3 | Drought induced 19 protein (Di19) | 3.55e–43 | ||
| Contig15912 | 2.5 | nd | nd | ||
| Contig3438 | 2.5 | Ubiquitin domain of GABA-receptor-associated protein | 5.41e–76 | ||
| Contig13716 | 2.5 | NADH dehydrogenase subunit | 1.95e–46 | ||
| Contig7657 | 3 | nd | nd | ||
| Contig10430 | 3 | Nucleoside diphosphate kinases (NDP kinases, NDPks) | 8.55e–35 | ||
| Contig13558 | 3 | Nucleoside diphosphate kinase Group I (NDPk_I)-like | 2.64e–79 | ||
| Contig15081 | 3 | Drought induced 19 protein (Di19) | 4.09e–67 | ||
| 21361* | 2 | Putative NBS domain resistance protein gene | 0.0 | ||
| 3895* | 2.5 | PhATG8b mRNA for autophagy 8b | 0.0 | ||
| 10009* | 3 | DEHYDRATION-INDUCED 19 protein | 4e–147 | ||
| 822* | 3 | NtNDPK mRNA for nucleoside diphosphate kinase | 6e–151 | ||
| miR-Seq-1 | CA00-XX-CA1-014-H09-EZ_F | 3 | Ubiquitin-like domain | 4.05e–31 | |
| miRSeq-2 | 3354* | 2.5 | Pyruvate dehydrogenase E1 component subunit beta-3 | 0.0 | |
| Contig4225 | 2.5 | Pyruvate dehydrogenase E1 component subunit beta-3 | 0.0 | ||
| CA00-XX-RM1-023-H03-UT_F | 2.5 | Catharanthus roseus secretory peroxidase | 6e–88 | ||
| Contig4993 | 2.5 | pyruvate dehydrogenase E1 component subunit beta-like | 1e–61 |
Given by the psRNA Target software;
Based on BLASTN/Pfam searches; nd – not determined;
Species from where the putative target was sequenced.