| Literature DB >> 22799740 |
Md Shamimuzzaman1, Lila Vodkin.
Abstract
BACKGROUND: MicroRNAs (miRNAs) regulate the expression of target genes by mediating gene silencing in both plants and animals. The miRNA targets have been extensively investigated in Arabidopsis and rice using computational prediction, experimental validation by overexpression in transgenic plants, and by degradome or PARE (parallel analysis of RNA ends) sequencing. However, miRNA targets mostly remain unknown in soybean (Glycine max). More specifically miRNA mediated gene regulation at different seed developmental stages in soybean is largely unexplored. In order to dissect miRNA guided gene regulation in soybean developing seeds, we performed a transcriptome-wide experimental method using degradome sequencing to directly detect cleaved miRNA targets.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22799740 PMCID: PMC3410764 DOI: 10.1186/1471-2164-13-310
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Analysis of degradome reads from five different libraries matched to the soybean genome
| Cot25 | 27802207 | 25926963 | 25300772 | 24754628 |
| SC25 | 41395866 | 40296710 | 39308067 | 37957708 |
| Cot100 | 10473739 | 9088079 | 8902528 | 8646558 |
| SC100 | 26326261 | 25931744 | 25289389 | 24436953 |
| Cot300 | 44451740 | 41517379 | 40494429 | 40335132 |
aCot25 and SC25 are libraries from cotyledons and seed coats dissected from early-maturation green seed of 25–50 mg fresh wt.; Cot100 and SC100 are cotyledons and seed coats dissected from mid-maturation green seed of 100–200 mg fresh weight range; Cot300 are cotyledons dissected from late maturation yellow seeds of 300–400 mg weight range.
bGenome matched reads indicate the number of reads of 20 or 21 nt that matched to Glycine max genome sequence available in Phytozome. Transcriptome matched reads indicate the number of reads that matched to Glycine max transcriptome (Glyma models) available in Phytozome [23].
Figure 1Target plots (t-plots) of identified miRNA targets using degradome sequencing. Representative t-plots (a-d) are shown, one from each of four different libraries of the cotyledon (25–50 mg fresh weight range), seed coat (25–50 mg), cotyledon (100–200 mg) and seed coat (100–200 mg). Red arrows indicate signatures consistent with miRNA-directed cleavage. mRNA:miRNA alignments along with the detected cleavage frequencies (absolute numbers) are shown above the black arrow. The red colored italicized nucleotide on the target transcript from the 3’ end indicates the cleavage site detected in the degradome library.
Identified miRNA targets found in both soybean seed coat and cotyledon
| gma-miR156 | C100 | Glyma02g13370.1 | SBP domain containing protein | 1219 | CDS | 0 | 92.52237 | 0.0477 |
| | C300 | Glyma02g13370.1 | SBP domain containing protein | 1219 | CDS | 0 | 27.27151 | 0.0496 |
| | SC100 | Glyma02g30670.1 | SBP domain containing protein | 768 | CDS | 0 | 335.5574 | 0.0445 |
| | C100 | Glyma05g00200.1 | SBP domain containing protein | 1202 | CDS | 1 | 57.82648 | 0.015 |
| gma-miR159* | ||||||||
| | C300 | Glyma04g15150.1 | MYB family transcription factor | 853 | CDS | 0 | 178.5044 | 0.0091 |
| | C25 | Glyma06g47000.1 | Myb family transcription factor | 852 | CDS | 0 | 787.7315 | 0.0111 |
| | SC25 | Glyma06g47000.1 | Myb family transcription factor | 852 | CDS | 0 | 18.44158 | 0.0123 |
| | C300 | Glyma06g47000.1 | MYB family transcription factor | 852 | CDS | 0 | 178.5044 | 0.0091 |
| | SC100 | Glyma13g34710.1 | No functional annotation | 1161 | CDS | 2 | 106.3962 | 0.0391 |
| | C300 | Glyma13g34710.1 | No Functional Annotation | 1161 | CDS | 1 | 9.916913 | 0.0133 |
| | C25 | Glyma19g40720.1 | Anion Exchange protein | 2162 | CDS | 1 | 28.27754 | 0.0157 |
| gma-miR160# | C100 | Glyma10g35480.1 | Auxin response factor | 740 | CDS | 0 | 12166.69 | 0.0168 |
| | ||||||||
| | ||||||||
| | ||||||||
| | ||||||||
| | SC25 | Glyma04g43350.1 | Auxin response factor | 1337 | CDS | 0 | 1391.022 | 0.0379 |
| | C300 | Glyma04g43350.1 | Auxin response factor | 1337 | CDS | 0 | 401.635 | 0.0281 |
| | SC25 | Glyma10g06080.1 | Auxin response factor | 1355 | CDS | 0 | 821.9674 | 0.0379 |
| gma-miR164* | ||||||||
| | C100 | Glyma04g33270.1 | No apical meristem protein | 634 | CDS | 0 | 150.3488 | 0.0026 |
| | SC100 | Glyma04g33270.1 | No apical meristem protein | 634 | CDS | 0 | 425.585 | 0.0024 |
| | SC100 | Glyma05g00930.1 | No apical meristem protein | 751 | CDS | 2 | 425.585 | 0.0433 |
| gma-miR166 | SC100 | Glyma04g09000.1 | START domain containing protein | 93 | CDS | 3 | 16.36865 | 0.0478 |
| | SC100 | Glyma05g30000.1 | HD-ZIP transcription factor | 1041 | CDS | 3 | 8.184326 | 0.0478 |
| | C25 | Glyma05g30000.1 | HD-ZIP transcription factor | 1041 | CDS | 0 | 3389.265 | 0.0331 |
| | C300 | Glyma05g30000.1 | HD-ZIP transcription factor | 1041 | CDS | 0 | 4321.295 | 0.0271 |
| gma-miR167* | ||||||||
| | SC100 | Glyma02g18250.3 | Elongation Factor S-II | 90 | CDS | 1 | 3592.919 | 0.0168 |
| | C100 | Glyma02g40650.1 | Auxin response factor | 2924 | CDS | 0 | 855.8319 | 0.027 |
| | SC100 | Glyma02g40650.1 | Auxin response factor | 2924 | CDS | 0 | 1239.925 | 0.0252 |
| C25 | Glyma02g40650.1 | Auxin response factor | 2924 | CDS | 0 | 2536.899 | 0.0343 |
Complete tables can be found in Additional file 1.
CDS: Coding Sequence; UTR: Untranslated Region; TP100M: Transcripts per 100 million; C.site (Cleavage site): Nucleotide number from 5’ end of cDNA; P-value ≤0.05 using Cleaveland pipeline.
Cot25 and SC25 are libraries from cotyledons and seed coats dissected from early-maturation green seed of 25–50 mg fresh wt.; Cot100 and SC100 are cotyledons and seed coats dissected from mid-maturation green seed of 100–200 mg fresh weight range; Cot300 are cotyledons dissected from late maturation yellow seeds of 300–400 mg weight range.
* and bold are miRNAs whose targets shown in the target plots (t-plots) (Figure 1).
# and bold are miRNAs whose RLM-5’RACE validated targets are shown in the target plots (t-plots) (Figure 2).
Potential miRNA targets found only in cotyledons at different seed developmental stages
| gma-miR171 | C25 | Glyma15g01820.1 | Protein tyrosine kinase | 1359 | CDS | 1 | 24.2379 | 0.0013 |
| | C300 | Glyma12g08490.1 | Putative methyltransferase | 434 | CDS | 1 | 12.3961 | 0.0462 |
| gma-miR394 | C100 | Glyma06g13230.3 | Ferredoxin related protein | 1070 | CDS | 1 | 104.088 | 0.027 |
| | C100 | Glyma06g13230.2 | Ferredoxin related protein | 685 | CDS | 1 | 104.088 | 0.027 |
| | C100 | Glyma14g25810.1 | GTPase-activating protein | 1013 | CDS | 1 | 57.8265 | 0.0035 |
| | C100 | Glyma06g13230.1 | Ferredoxin related protein | 952 | CDS | 1 | 104.088 | 0.027 |
| gma-miR398 | C100 | Glyma03g40280.2 | Copper/Zinc superoxide dismutase | 156 | 5' UTR | 2 | 57.8265 | 0.0379 |
| | C100 | Glyma03g40280.3 | Copper/Zinc superoxide dismutase | 156 | 5' UTR | 2 | 57.8265 | 0.0379 |
| gma-miR1509 | C300 | Glyma18g03980.2 | No Functional Annotation | 2155 | 3' UTR | 0 | 7.43768 | 0.0041 |
| gma-miR1513 | C300 | Glyma08g27950.1 | F-box domain containing protein | 190 | CDS | 0 | 29.7507 | 0.0402 |
| gma-miR1514 | C25 | Glyma07g05360.2 | No apical meristem protein | 773 | CDS | 0 | 16.1586 | 0.0371 |
| gma-miR1515 | C300 | Glyma09g02920.1 | PAZ domain containing protein | 2750 | CDS | 0 | 57.0223 | 0.003 |
| gma-miR1531 | C100 | Glyma16g33400.1 | Serine protease inhibitor family | 536 | 3' UTR | 3 | 46.2612 | 0.0414 |
| gma-miR1532 | C25 | Glyma10g28900.1 | Universal stress protein family | 626 | 3' UTR | 4 | 4.03965 | 0.015 |
| gma-miR1535 | C25 | Glyma08g22920.1 | Ribose 5-phosphate isomerase | 168 | CDS | 1 | 60.5947 | 0.0242 |
| gma-miR2109 | C25 | Glyma03g14900.1 | LRR containing protein | 46 | CDS | 0 | 88.8723 | 0.0165 |
| gma-miR4357 | C300 | Glyma13g01500.1 | Alg9-like mannosyltransferase | 779 | CDS | 1 | 22.3131 | 0.0089 |
| | C300 | Glyma01g35530.1 | Transferase family protein | 522 | CDS | 1 | 9.91691 | 0.0114 |
| gma-miR4369 | C25 | Glyma19g43800.1 | Calcineurin-like phosphoesterase | 1257 | 3' UTR | 4 | 4.03965 | 0.0468 |
| gma-miR4371 | C300 | Glyma15g08400.4 | No Functional Annotation | 452 | 3' UTR | 0 | 17.3546 | 0.0017 |
| gma-miR4380 | C300 | Glyma06g10840.1 | MYB family transcription factor | 630 | CDS | 1 | 9.91691 | 0.0347 |
| gma-miR4387 | C300 | Glyma08g43670.1 | Uncharacterized conserved protein | 1501 | CDS | 0 | 4.95846 | 0.0068 |
| gma-miR4390 | C100 | Glyma15g06380.1 | Dynamin family protein | 287 | CDS | 3 | 104.088 | 0.0251 |
| gma-miR4398 | C25 | Glyma02g25150.1 | Integrase domain containing protein | 199 | 5' UTR | 4 | 4.03965 | 0.0444 |
| gma-miR4402 | C300 | Glyma02g18090.1 | Lectin domain containing protein | 783 | CDS | 4 | 2.47923 | 0.0293 |
| gma-miR4403 | C300 | Glyma19g36500.1 | No Functional Annotation | 1860 | 3' UTR | 4 | 2.47923 | 0.0484 |
| gma-miR4408 | C25 | Glyma16g34800.1 | No functional annotation | 434 | CDS | 1 | 68.674 | 0.0057 |
| gma-miR4409 | C25 | Glyma16g04060.3 | BTB/POZ domain containing protein | 1304 | 3' UTR | 0 | 12.1189 | 0.0144 |
| gma-miR4415 | C100 | Glyma02g20490.2 | Transferase family protein | 502 | CDS | 1 | 57.8265 | 0.0463 |
| | C100 | Glyma02g20490.1 | Transferase family protein | 502 | CDS | 1 | 57.8265 | 0.0463 |
| | C100 | Glyma15g12430.1 | Transferase family protein | 464 | CDS | 1 | 57.8265 | 0.0463 |
| gma-miR4416 | C100 | Glyma01g44970.1 | BTB/POZ domain containing protein | 434 | CDS | 0 | 161.914 | 0.0318 |
CDS: Coding Sequence; UTR: Untranslated Region; TP100M: Transcripts per 100 million; C.site (Cleavage site): Nucleotide number from 5’ end of cDNA; P-value ≤0.05 using Cleaveland pipeline.
Potential miRNAs and their targets found only in seed coats at different developmental stages
| gma-miR319 | SC100 | Glyma12g33640.1 | TCP family transcription factor | 740 | CDS | 0 | 106.396 | 0.0231 |
| | SC100 | Glyma15g09910.1 | TCP family transcription factor | 959 | CDS | 0 | 69.5668 | 0.0121 |
| gma-miR393 | SC100 | Glyma19g27280.1 | LRR containing protein | 2207 | CDS | 2 | 20.4608 | 0.032 |
| | SC100 | Glyma03g36770.1 | LRR containing protein | 1750 | CDS | 3 | 167.779 | 0.031 |
| | SC100 | Glyma16g05500.1 | LRR containing protein | 2279 | CDS | 3 | 20.4608 | 0.0132 |
| | SC100 | Glyma19g39420.1 | LRR containing protein | 1751 | CDS | 3 | 167.779 | 0.031 |
| gma-miR1508 | SC100 | Glyma13g35890.1 | EF-hand containing protein | 517 | 3' UTR | 1 | 8.18433 | 0.0385 |
| gma-miR1518 | SC100 | Glyma15g12180.1 | Ubiquitin-protein ligase | 273 | CDS | 1 | 16.3687 | 0.02 |
| gma-miR1523 | SC25 | Glyma20g02470.1 | LRR containg protein | 407 | CDS | 1 | 5.26902 | 0.0419 |
| gma-miR1526 | SC100 | Glyma08g09640.1 | No Functional Annotation | 424 | CDS | 0 | 163.687 | 0.048 |
| | SC100 | Glyma08g09640.2 | No Functional Annotation | 424 | CDS | 0 | 163.687 | 0.048 |
| gma-miR2119 | SC100 | Glyma14g24860.1 | Alcohol dehydrogenase | 99 | CDS | 2 | 2488.04 | 0.0462 |
| gma-miR4374 | SC100 | Glyma01g36660.1 | Ankyrin repeat containing protein | 2124 | 3' UTR | 1 | 8.18433 | 0.009 |
| gma-miR4377 | SC25 | Glyma06g04910.1 | Cyclin family protein | 910 | CDS | 4 | 2.63451 | 0.0297 |
| gma-miR4399 | SC100 | Glyma17g34370.1 | No functional annotation | 637 | 3' UTR | 4 | 4.09216 | 0.0491 |
| gma-miR4406 | SC25 | Glyma03g17300.1 | Glyoxal oxidase related protein | 1489 | CDS | 0 | 73.7663 | 0.0082 |
| gma-miR4407 | SC100 | Glyma08g15670.1 | Peptide transporter family protein | 1078 | CDS | 1 | 1076.24 | 0.0159 |
| SC100 | Glyma05g04810.1 | Peptide transporter family protein | 918 | CDS | 1 | 1076.24 | 0.0159 |
CDS: Coding Sequence; UTR: Untranslated Region; TP100M: Transcripts per 100 million; C.site (Cleavage site): Nucleotide number from 5’ end of cDNA; P-value ≤0.05 using Cleaveland pipeline.
* and bold is miRNA whose target is shown in the target plot (t-plot) (Figure 1).
Figure 2Validation of Auxin Response Factors (ARFs) regulated by gma-miR160 in cotyledon. Confirmed targets (a-d) for gma-miR160 are presented in the form of target plots (t-plot) and alignments. Absolute numbers of signature sequences are indicated in the t-plot. Red arrows indicate signatures consistent with miRNA-directed cleavage. The black arrows indicate a site verified by RLM 5’-RACE and detected cleavage frequencies (absolute numbers) are shown above the arrow. The cleavage site is shown as a red letter. Cleavage frequency as determined by gene-specific 5’-RACE at the indicated position is shown in parenthesis.
Figure 3GO analysis of miRNA target genes identified in cotyledons in different soybean seed developmental stages. Green bars indicate the enrichment of miRNA targets in GO terms. Blue bars indicate the percentage of total annotated soybean genes mapping to GO terms. Only the predicted target genes for miRNAs identified by degradome sequencing were considered.
Comparison of miRNA target annotations between the Williams and Heinong44 data sets
| Alcohol/Zinc-binding dehydrogenase | Polyubiquitin protein | AGO protein |
| Alg9-like mannosyltransferase | Plasmamembrane protein | Auxin Response Factor |
| Aluminium induced protein | Auxin signaling F-BOX protein | Copper/zinc superoxide dismutase |
| Anion Exchange protein | NADP+ | Growth Regulating Factor |
| Ankyrin repeat containing protein | MtN19-like protein | HD-ZIP Transcription Factor |
| Ataxin-2 domain containing protein | Serine-type endopeptidase | MYB family Transcription Factor |
| bHLH family protein | elongation factor | No Apical Meristem protein (NAC Family) |
| BRE Expressed protein | NSF attachment protein | SBP domain containing protein |
| BTB/POZ domain containing protein | Autophagy protein | TCP family transcription factor |
| Calcineurin-like phosphoesterase | embryo-related protein | Nuclear Factor-YA |
| Cellulose synthase | AP2 transcription factor | Zinc Finger Family protein |
| Cyclin family protein | heat shock cognate protein | |
| Dynamin family protein | expressed protein | |
| EF-hand containing protein | 60 S ribosomal protein | |
| Elongation Factor S-II | Disulfide isomerase | |
| F-box domain containing protein | FAD linked oxidase family protein | |
| Ferredoxin related protein | Auxin inducible transcription factor | |
| Glyoxal oxidase related protein | ribulose-1,5-bisphosphate carboxylase | |
| GTPase-activating protein | | |
| Integrase domain containing protein | | |
| lectin domain containing protein | | |
| LRR containing protein | | |
| mRNA capping enzyme | | |
| NADP/FAD oxidoreductase | | |
| NB-ARC domain containing protein | | |
| No Functional Annotation | | |
| PAZ domain containing protein | | |
| Peptide transporter family protein | | |
| Permease family protein | | |
| PPR repeat containing protein | | |
| Protein tyrosine kinase | | |
| Putative methyltransferase | | |
| Ras family protein | | |
| Ribose 5-phosphate isomerase | | |
| Serine protease inhibitor family | | |
| Serine-threonine protein kinase | | |
| START Domain containing protein | | |
| TIR domain containing protein | | |
| Transcription factor TFIID | | |
| Transferase family protein | | |
| Transmembrane protein 14 C | | |
| Transporter family protein | | |
| UBA domain containing protein | | |
| Ubiquitin-protein ligase | | |
| Uncharacterized conserved protein | | |
| Universal stress protein family |
Williams data sets from this study were generated using five degradome libraries of early, mid and late soybean seed developmental stage. The Heinong44 data set represented an earlier seed stage [21].