| Literature DB >> 24119094 |
Sebastian Baumgarten1, Till Bayer, Manuel Aranda, Yi Jin Liew, Adrian Carr, Gos Micklem, Christian R Voolstra.
Abstract
BACKGROUND: Animal and plant genomes produce numerous small RNAs (smRNAs) that regulate gene expression post-transcriptionally affecting metabolism, development, and epigenetic inheritance. In order to characterize the repertoire of endogenous smRNAs and potential gene targets in dinoflagellates, we conducted smRNA and mRNA expression profiling over 9 experimental treatments of cultures from Symbiodinium microadriaticum, a photosynthetic symbiont of scleractinian corals.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24119094 PMCID: PMC3853145 DOI: 10.1186/1471-2164-14-704
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Overview over smRNA and mRNA sequencing and assembly statistics
| | | | | | | | | | |
| Total base pairs | 476,999,446 | 327,143,288 | 185,735,770 | 254,824,059 | 427,785,834 | 220,539,256 | 258,016,695 | 133,416,325 | 361,512,933 |
| No. of reads (after trimming) | 17,193,220 | 12,593,192 | 8,433,048 | 10,658,877 | 15,378,342 | 8,725,667 | 10,662,995 | 6,131,326 | 13,591,577 |
| Mean read length | 28 | 26 | 22 | 24 | 28 | 25 | 24 | 22 | 27 |
| No. of reads (after smRNA filtering) | 16,404,546 | 12,102,929 | 8,234,625 | 10,180,915 | 14,747,100 | 8,399,643 | 10,259,502 | 6,025,546 | 13,017,083 |
| No. of unique reads | 2,288,514 | 2,198,132 | 1,633,046 | 2,338,445 | 2,352,632 | 1,705,659 | 1,964,913 | 1,405,173 | 1,881,585 |
| No. of miRNAs (miRDeep2) | 118 | 96 | 131 | 114 | 136 | 104 | 129 | 115 | 138 |
| Mean readcount of miRNAs | 149 | 418 | 300 | 199 | 257 | 170 | 253 | 171 | 446 |
| No. of miRNAs w/ miRNA* (miRDeep2) | 55 | 83 | 84 | 66 | 61 | 70 | 77 | 75 | 81 |
| No. of miRNAs (after manual inspection) | 21 | 21 | 21 | 21 | 21 | 21 | 21 | 21 | 21 |
| | | | | | | | | | |
| No. of read pairs (2 x 100 bp) | 26,119,370 | 34,506,164 | 41,918,850 | 30,712,175 | 39,211,121 | 34,918,126 | 28,787,997 | 36,782,164 | 29,985,278 |
| No. read pairs mapped | 16,777,195 | 23,084,001 | 27,761,160 | 19,905,837 | 24,920,330 | 23,277,719 | 19,357,409 | 24,141,222 | 20,023,519 |
| No. of 58,649 genes expressed (FPKM > 0) | 55,881 | 56,353 | 57,389 | 55,518 | 55,287 | 56,526 | 56,292 | 56,041 | 56,778 |
| No. of 58,649 genes expressed (FPKM > 5) | 40,078 | 39,125 | 39,246 | 37,256 | 37,435 | 39,198 | 38,041 | 38,422 | 40,206 |
| Median FPKM | 12.58 | 12.29 | 11.87 | 11.42 | 11.13 | 12.38 | 11.41 | 11.66 | 12.49 |
| BLASTX annotation [%] | 44.81 | 44.51 | 43.80 | 45.10 | 45.39 | 44.44 | 44.59 | 44.81 | 44.18 |
| Pfam [%] | 34.50 | 34.23 | 33.63 | 34.71 | 34.87 | 34.13 | 34.27 | 34.43 | 33.98 |
| GO annotation [%] | 34.90 | 34.67 | 34.08 | 35.14 | 35.32 | 34.59 | 34.71 | 34.89 | 34.39 |
| DEGs all (vs. noon) | 119 | 37 | 351 | 2,465 | 138 | 48 | 67 | 60 | - |
| DEGs up (vs. noon) | 47 | 12 | 105 | 293 | 21 | 14 | 22 | 28 | - |
| DEGs down (vs. noon) | 72 | 25 | 246 | 2,172 | 117 | 34 | 45 | 32 | - |
| log2 difference (minimum/maximum) | −8.30/+6.61 | −7.27/+5.27 | −6.84/+7.34 | −8.43/+8.17 | −10.43/+5.46 | −7.02/+6.24 | −7.32/+5.19 | −7.06/+7.64 | - |
| log2 difference (mean) | 2.93 | 3.15 | 2.29 | 2.79 | 2.73 | 3.27 | 2.99 | 3.01 | - |
DEG = Differentially Expressed Gene.
Figure 1Overview of smRNA and mRNA analysis workflow. Cultures of Symbiodinium microadriaticum were subjected to 9 experimental treatments (noon: 12 h/12 h day/night cycle, sampled at noon; 4°C: 4°C for 4 hours; 16°C: 16°C for 4 hours; 34°C: 34°C for 12 hours; 36°C: 36°C for 4 hours; 20 g: 20 g/L NaCl salt content for 4 hours; 60 g: 60 g/L NaCl salt content for 4 hours; DS (dark stress): 18 hour dark period; DC (dark cycle): 12 h/12 h day/night cycle, sampled at midnight). Noon was selected as the reference condition for differential expression analyses. A total of 137 million small RNA reads resulted in the prediction of 219 miRNAs in 9 experimental treatments with the software miRDeep2, yielding a set of 21 smRNAs after further quality filtering. miRNA target gene prediction yielded 1,720 animal- and 19 plant-like miRNA binding sites via bowtie software in the set of 12,858 3'UTRs and 51,917 genes, respectively. Annotated miRNA targets were subsequently tested for GO category enrichment. A total of 302 million paired-end (PE) reads were assembled to a final gene set of 58,649 genes ≥ 250 bp with the Oases software. smRNA and mRNA expression over 9 experimental treatments was quantified with the DESeq software. Expression estimates of 21 smRNAs and 19,893 GO-annotated genes were assessed for correlation over 9 experimental treatments, and smRNAs-mRNA expression pairs displaying a correlation > 0.8 or < −0.8 (Spearman Rank) were tested for GO category enrichment.
Figure 2smRNA identification in over 9 experimental treatments. (A) Lengths, read count distribution, and 5' identity of 5,125,940 distinct genome-matching small RNAs from the set of 137 million sequenced reads. All small RNA reads were mapped to a draft genome assembly via bowtie software. Genomic location is indicated in the pie chart. (B) miRDeep2 output for a miRNA precursor indicating the guide (red) and passenger (blue) strand as well as the hairpin loop (yellow). (C) miRDeep2 output for a siRNA precursor. Note the perfect complementarity between guide (red) and passenger (blue) strand as well as the hairpin loop (yellow).
Set of 21 smRNAs (8 miRNAs and 13 siRNAs) that were independently identified over 9 experimental treatments and matched all criteria for smRNA identification
| smb107.2 | CAAGGAUGGGAUGCUCAGAGAA | 22 | 88 | −69.3 | 13,315 | 398 (9) |
| smb107.3 | CAAGGAUGGGAUGGUCAGAGAA | 22 | 88 | −64.1 | 3,322 | 398 (9) |
| smb123 | CAGUCGGCCAAAGUGCUGGACC | 22 | 89 | −64.1 | 451 | 154 (9) |
| smb203 | CUUUGUAUCCCGGAUCCUGAUA | 22 | 87 | −46.6 | 1,087 | 389 (9) |
| smb215 | GAGGAUGCUGAUCAUUCACUGG | 22 | 87 | −80.6 | 85 | 34 (8) |
| smb295 | UCAGAGACCAGACGCAGAGGCU | 22 | 90 | −40.6 | 12,543 | 160 (9) |
| smb297 | UCAGUGGCAGAAGCUGGGAACU | 22 | 87 | −63.5 | 965 | 60 (8) |
| smb313 | UCGAACUUUCAGGAAUAGUAUC | 22 | 87 | −54.8 | 2,707 | 1475 (9) |
| smb21 | AAUUUGAACGUUGCCAUCUAUC | 22 | 87 | −72.5 | 123 | 9 (7) |
| smb41 | ACCUGCAGCAUUUGGCGCCUGA | 22 | 84 | −77.9 | 299 | 18 (7) |
| smb51 | ACUUAGAACUCUCCUACGAGGG | 22 | 88 | −83.1 | 510 | 275 (8) |
| smb79 | AGUUGGACCAGACCAGUUGGUC | 22 | 87 | −71.7 | 489 | 319 (9) |
| smb83 | AUCACUCCACAAAGGGAUUUG | 21 | 87 | −65.5 | 217 | 7 (5) |
| smb101 | CAACGAGAUUGGCCUUCUGUGC | 22 | 87 | −82.9 | 5,234 | 412 (9) |
| smb163 | CGGGACUCGAUUCGGAGGGUGC | 22 | 88 | −63.6 | 2,015 | 660 (9) |
| smb271 | UAGAAUGUAGUCGUCAUCUUGC | 22 | 88 | −68.9 | 1,044 | 29 (9) |
| smb303 | UCCGCCGUGCAACUGUCGCAAC | 22 | 88 | −80.2 | 207 | 107 (9) |
| smb359 | UGAUGUACAUCGAUUGAUCGAC | 22 | 86 | −63.8 | 646 | 23 (9) |
| smb365 | UGCCAACGUGAUUUGCAACUCC | 22 | 84 | −68.1 | 333 | 75 (7) |
| smb379 | UGGACUUGGAAAGCUUCUCUGC | 22 | 86 | −76.7 | 2,505 | 2 (2) |
| smb427 | UUUGUCCAGUGUACCUGCGCU | 21 | 85 | −73 | 728 | 50 (8) |
Read counts for guide and passenger strands were derived by pooling counts over conditions. The number of experimental treatments that identified the passenger strand is indicated in brackets. MFE = Minimum Free Energy of precursor.
Figure 3smRNA target prediction for 21 smRNAs within the gene set of . (A) 3 distinct animal-like target sites in the 3'UTR of genes exist that are characterized by seeds of lengths 6-8nt that display perfect complementary base pairing between the miRNA and mRNA sequence. Vertical dashes indicate Watson-Crick base pairing. The pie chart displays the relative frequency of these target sites in the 3'UTRs of 1,720 genes (from a set of 12,858 genes with available 3'UTRs). (B) 19 plant-like mRNA target sites were identified in the coding sequence and 3'UTRs of 51,917 genomic genes of Symbiodinium microadriaticum. Plant-like mRNA target sites are characterized by full-length complementary base pairing between a miRNA and its mRNA with only few mismatches (i.e. near-perfect). (C) Number of identified plant-like mRNA target sites (blue bars) in relation to number of mismatches allowed. Number of false positives in 1,000 randomly generated cohorts of small RNA sequences of length 22 nt (red bars) are displayed for comparison. A cutoff of 3 mismatches (mm) over the aligned smRNA and mRNA provides a False Positive Rate of about 1 in 5. (D) Enriched GO terms within the set of matching and annotated animal-like targets (n = 519, P < 0.05, 4,047 3' UTRs). (E) Enriched GO terms within the set of matching and annotated plant-like targets (n = 7, P < 0.05, 18,290 genes).
Summary of the transcriptome assembly
| No. of genes (≥ 250 bp) | 58,649 | |
| Mean transcript length bp | 1,324 | |
| BLASTX annotation | 25,288 | 43.12% |
| Pfam annotation | 16,446 | 28.04% |
| KEGG annotation | 9,914 | 16.90% |
Correlation between smRNAs and mRNAs (Spearman’s Rho > +0.8 or < −0.8) over 9 experimental treatments (21 smRNAs, 19,893 annotated mRNAs, 417,753 comparisons)
| | Negative correlation | Positive correlation | Total |
| smb123 | 235 | 363 | 598 |
| smb83 | 237 | 172 | 409 |
| smb359 | 134 | 188 | 322 |
| smb21 | 116 | 196 | 312 |
| smb107.3 | 208 | 91 | 299 |
| smb303 | 149 | 138 | 287 |
| smb101 | 189 | 76 | 265 |
| smb107.2 | 157 | 106 | 263 |
| smb215 | 104 | 108 | 212 |
| smb297 | 86 | 113 | 199 |
| smb427 | 114 | 74 | 188 |
| smb79 | 92 | 69 | 161 |
| smb51 | 70 | 85 | 155 |
| smb295 | 101 | 39 | 140 |
| smb163 | 47 | 71 | 118 |
| smb365 | 43 | 51 | 94 |
| smb271 | 31 | 60 | 91 |
| smb203 | 41 | 35 | 76 |
| smb313 | 17 | 50 | 67 |
| smb379 | 30 | 36 | 66 |
| smb41 | 34 | 32 | 66 |
| Total no. genes | 2,235 | 2,153 | 4,388 |
| Distinct no. genes | 1,673 | 1,602 | 3,502 |
Enrichment of GO terms of negatively and positively correlated smRNA-mRNA expression pairs to manually assorted higher order categories
| | ||
| | ||
| GO:0006457 | Protein folding | 0.000 |
| GO:0051082 | Unfolded protein binding | 0.002 |
| GO:0004252 | Serine-type endopeptidase activity | 0.013 |
| GO:0018106 | Peptidyl-histidine phosphorylation | 0.009 |
| GO:0016925 | Protein sumoylation | 0.005 |
| GO:0003755 | Peptidyl-prolyl cis-trans isomerase activity | 0.025 |
| GO:0036065 | Fucosylation | 0.034 |
| GO:0000413 | Protein peptidyl-prolyl isomerization | 0.021 |
| | ||
| GO:0019787 | Small conjugating protein ligase activity | 0.021 |
| GO:0006487 | Protein N-linked glycosylation | 0.045 |
| GO:0047485 | Protein N-terminus binding | 0.013 |
| GO:0070534 | Protein K63-linked ubiquitination | 0.011 |
| GO:0042787 | Protein ubiquitination in ubiquitin-dependent protein catabolic process | 0.021 |
| GO:0019773 | Proteasome core complex, alpha-subunit complex | 0.034 |
| GO:0004180 | Carboxypeptidase activity | 0.000 |
| | | |
| | ||
| GO:0016446 | Somatic hypermutation of immunoglobulin genes | 0.009 |
| | ||
| GO:0002666 | Positive regulation of T cell tolerance induction | 0.015 |
| GO:0042130 | Negative regulation of T cell proliferation | 0.024 |
| GO:0002698 | Negative regulation of immune effector process | 0.036 |
| | | |
| | ||
| GO:0023014 | Signal transduction by phosphorylation | 0.046 |
| GO:0009909 | Regulation of flower development | 0.034 |
| GO:0010019 | Chloroplast-nucleus signaling pathway | 0.011 |
| GO:0031930 | Mitochondria-nucleus signaling pathway | 0.011 |
| | ||
| GO:0031930 | Mitochondria-nucleus signaling pathway | 0.000 |
| | | |
| | ||
| GO:0032404 | Mismatch repair complex binding | 0.001 |
| GO:0006307 | DNA dealkylation involved in DNA repair | 0.034 |
| GO:0032300 | Mismatch repair complex | 0.005 |
| GO:0031072 | Heat shock protein binding | 0.005 |
| GO:0008630 | Intrinsic apoptotic signaling pathway in response to DNA damage | 0.038 |
| | ||
| GO:0008630 | Intrinsic apoptotic signaling pathway in response to DNA damage | 0.036 |
| GO:0006289 | Nucleotide-excision repair | 0.015 |
| | ||
| | ||
| GO:0035552 | Oxidative single-stranded DNA demethylation | 0.028 |
| GO:0030261 | Chromosome condensation | 0.004 |
| | ||
| GO:0031048 | Chromatin silencing by small RNA | 0.008 |
| | | |
| | ||
| GO:0006412 | Translation | 0.030 |
| GO:0003735 | Structural constituent of ribosome | 0.006 |
| GO:0022627 | Cytosolic small ribosomal subunit | 0.009 |
| GO:0042255 | Ribosome assembly | 0.013 |
| GO:0016071 | mRNA metabolic process | 0.050 |
| GO:0006364 | rRNA processing | 0.000 |
| | ||
| GO:0008353 | RNA polymerase II carboxy-terminal domain kinase activity | 0.032 |
| GO:0005689 | U12-type spliceosomal complex | 0.008 |
| GO:0015935 | Small ribosomal subunit | 0.000 |
| | | |
| | ||
| GO:0043734 | DNA-N1-methyladenine dioxygenase activity | 0.030 |
| GO:0051747 | Cytosine C-5 DNA demethylase activity | 0.030 |
| GO:0043462 | Regulation of ATPase activity | 0.017 |
| GO:0019213 | Deacetylase activity | 0.017 |
| GO:0016811 | Hydrolase activity | 0.045 |
| GO:0019478 | D-amino acid catabolic process | 0.001 |
| GO:0016308 | 1-phosphatidylinositol-4-phosphate 5-kinase activity | 0.050 |
| GO:0042264 | Peptidyl-aspartic acid hydroxylation | 0.035 |
| | ||
| GO:0006662 | Glycerol ether metabolic process | 0.008 |
| GO:0042775 | mitochondrial ATP synthesis coupled electron transport | 0.017 |
| GO:0001522 | Pseudouridine synthesis | 0.006 |
| GO:0003879 | ATP phosphoribosyltransferase activity | 0.047 |
| GO:0009982 | Pseudouridine synthase activity | 0.005 |
| GO:0005388 | Calcium-transporting ATPase activity | 0.024 |
| GO:0016847 | 1-aminocyclopropane-1-carboxylate synthase activity | 0.018 |
| GO:0042218 | 1-aminocyclopropane-1-carboxylate biosynthetic process | 0.017 |
| GO:0015035 | Protein disulfide oxidoreductase activity | 0.012 |
| GO:0009062 | Fatty acid catabolic process | 0.003 |
| GO:0015020 | Glucuronosyltransferase activity | 0.008 |
| GO:0047661 | Amino-acid racemase activity | 0.025 |
| GO:0009252 | Peptidoglycan biosynthetic process | 0.008 |
| GO:0016303 | 1-phosphatidylinositol-3-kinase activity | 0.037 |
| GO:0036092 | Phosphatidylinositol-3-phosphate biosynthetic process | 0.050 |
| GO:0042823 | Pyridoxal phosphate biosynthetic process | 0.001 |
| GO:0019213 | Deacetylase activity | 0.015 |
Only categories that are negatively and positively correlated are shown.