| Literature DB >> 25955857 |
Tim A Dahlmann1, Ulrich Kück1.
Abstract
MicroRNAs (miRNAs) are non-coding small RNAs (sRNAs) that regulate gene expression in a wide range of eukaryotes. In this study, we analyzed regulatory sRNAs in Penicillium chrysogenum, the industrial producer of the β-lactam antibiotic penicillin. To identify sRNAs and microRNA-like RNAs (milRNAs) on a global approach, two sRNA sequencing libraries were constructed. One library was created with pooled total RNA, obtained from twelve differently grown cultures (RNA Mix), and the other with total RNA from a single submerged cultivation (∆ku70FRT2). Illumina sequencing of both RNA libraries produced 84,322,825 mapped reads. To distinguish between Dicer-dependent and independent sRNA formation, we further constructed two single dicer gene mutants (∆dcl2 and ∆dcl1) and a dicer double mutant (∆dcl2∆dcl1) and analyzed an sRNA library from the Dicer-deficient double-mutant. We identified 661 Dicer-dependent loci and in silico prediction revealed 34 milRNAs. Northern blot hybridization of two milRNAs provided evidence for mature milRNAs that are processed either in a complete or partial Dicer-dependent manner from an RNA precursor. Identified milRNAs share typical characteristics of previously discovered fungal milRNAs, like a strong preference for a 5' uracil and the typical length distribution. The detection of potential milRNA target sites in the genome suggests that milRNAs might play a role in posttranscriptional gene regulation. Our data will further increase our knowledge of sRNA dependent gene regulation processes, which is an important prerequisite to develop more effective strategies for improving industrial fermentations with P. chrysogenum.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25955857 PMCID: PMC4425646 DOI: 10.1371/journal.pone.0125989
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Statistical summary of small RNA sequencing data and distribution of small RNAs with perfect match to the genome sequence of P. chrysogenum.
| Total reads | Unique reads (≥ 10 reads) | ||||||
|---|---|---|---|---|---|---|---|
| RNAs (15–36 nt) | RNA-Mix | ∆ | ∆ | RNA-Mix | ∆ | ∆ | |
|
| 46,240,082 | 52,144,415 | 65,660,252 | 122,018 | 113,603 | 104,922 | |
|
| 39,188,594 | 45,134,231 | 57,721,023 | 89,096 | 77,035 | 67,428 | |
|
| 84.80% | 86.60% | 87.90% | 73.01% | 67.81% | 64.26% | |
|
| 16,171,567 | 16,249,031 | 20,055,379 | 38,337 | 35,173 | 26,558 | |
| sense | 11,612,935 | 10,975,391 | 13,497,944 | 29,659 | 27,835 | 24,798 | |
| antisense | 4,942,727 | 5,554,209 | 6,874,230 | 10,389 | 8,631 | 2,679 | |
|
| 5,619,395 | 6,472,049 | 7,539,001 | 16,527 | 17,680 | 8,431 | |
| sense | 1,633,831 | 1,687,351 | 1,557,211 | 9,788 | 11,875 | 7,596 | |
| antisense | 4,009,410 | 4,799,719 | 5,984,778 | 7,043 | 6,020 | 865 | |
|
| 10,569,963 | 9,777,092 | 12,504,834 | 21,702 | 17,439 | 18,022 | |
| sense | 9,996,408 | 9,276,048 | 11,918,300 | 19,744 | 15,839 | 17,085 | |
| antisense | 905,029 | 741,304 | 873,060 | 3,238 | 2,557 | 1,729 | |
|
| 6,128,717 | 6,746,039 | 8,779,724 | 5,062 | 4,133 | 4,363 | |
| sense | 6,126,353 | 6,745,245 | 8,779,649 | 5,025 | 4,123 | 4,363 | |
| antisense | 2,577 | 971 | 277 | 38 | 11 | 1 | |
|
| 22,659,643 | 29,792,848 | 39,201,820 | 43,143 | 34,808 | 39,639 | |
| sense | 22,652,991 | 29,780,783 | 39,201,243 | 43,051 | 34,639 | 39,626 | |
| antisense | 6,652 | 12,065 | 577 | 92 | 169 | 3 | |
|
| 21,367,882 | 27,127,757 | 35,564,101 | 50,226 | 41,195 | 35,646 | |
[1] Perfectly mapped reads with more than one locus may occur in several categories as mapped reads.
Fig 1Chromosomal distribution of small RNAs and sRNA-producing loci.
Pie graphs for total reads (A) and unique reads (B) are showing the relative abundance of sRNAs located in tRNAs, rRNAs, intergenic, exonic and intronic regions in ∆ku70FRT2. Alignments of sRNA-producing loci of ∆ku70FRT2 (C) and ∆dcl2∆dcl1 (D) show that the number of sRNAs that map to both DNA strands of one feature have strongly increased and that the fraction of sRNA loci that align to exonic regions in sense orientation has decreased substantially in ∆dcl2∆dcl1 compared to ∆ku70FRT2.
Characterization of small RNA producing loci.
| RNA-Mix | ∆ | ∆ | ||
|---|---|---|---|---|
|
| 1513 | 1319 | 783 | |
|
| 438 | 374 | 6 | |
|
| ||||
|
| on both strands | 292 | 233 | 0 |
| only sense | 319 | 335 | 337 | |
| only antisense | 137 | 114 | 20 | |
|
| on both strand | 2 | 1 | 0 |
| strand-specific | 21 | 17 | 16 | |
|
| on both strands | 4 | 0 | 0 |
| strand-specific | 177 | 181 | 176 | |
|
| on both strands | 3 | 4 | 3 |
| strand specific | 37 | 37 | 43 | |
|
| on both strands | 137 | 136 | 3 |
| strand-specific | 384 | 261 | 185 | |
Fig 2Length distribution of P. chrysogenum small RNA population.
Length distribution of mapping sRNA reads for the datasets of total (A) and unique (B) reads obtained from three different samples. Frequency of the 5'-nucleotide of the unique reads of ∆ku70FRT2 (C) and ∆dcl2∆dcl1 (D) in dependency of their read length.
Fig 3Characterization of Dicer-dependent small RNAs.
(A) Strain-specific and overrepresented unique reads in ∆ku70FRT2 compared to ∆dcl2∆dcl1 and vice versa. (B) Nucleotide preference and size distribution of Dicer-dependent small RNAs. (C) Pie graphs of the relative abundance of Dicer-dependent sRNAs and (D) Dicer-dependent sRNA-producing loci in accordance to their strand bias.
Fig 4Accumulation of sRNAs along representative coding-sequences.
(A) Normalized read count (TPTM: transcripts per ten million) of ∆ku70FRT2 (grey graph) and ∆dcl2∆dcl1 (black graph) of two representative Dicer-dependent coding regions, the Copia13-like transposable element Pc17g00440 and the putative DNA-binding protein Pc12g14660. (B) Dicer-independent sRNA accumulation for the coding region of the putative cell-wall protein Pc20g06530 and for a histidine tRNA-gene cluster. To ensure a faultless representation of Dicer-independent reads the graphs for ∆dcl2∆dcl1 were slightly moved to the right.
Predicted milRNAs and the appearance of their reads within the three datasets.
| milRNA | Sequence (5'-3') | Normalized Expression Level (TPTM[
| Score[
| MFE[
| Precursor location | ||
|---|---|---|---|---|---|---|---|
| RNA-Mix | ∆ | ∆ | |||||
|
| uugguacgaucgauuggaga | 122 | 305 | 1 | 882.2 | -41.0 | ChrI: 11548223–11548283, exon sense (Pc16g07970) |
|
| ugagaacgcugauauauauau | 152 | 3 | 0 | 373.4 | -17.1 | ChrIII: 2410668–2410742, intergenic |
|
| uagaagaucaugcagcaugua | 144 | 148 | 0 | 356.5 | -29.2 | ChrI: 13214806–13214889, exon antisene (Pc15g01560) |
|
| ugcgacacaagaacaaucggacgau | 6 | 40 | 0 | 208.3 | -22.4 | ChrIII: 3988218–3988278, exon sense (Pc22g06050) |
|
| ugacauaggaacgacgaga | 17 | 27 | 0 | 231.2 | -24.2 | ChrIV: rev(37091–37159), exon antisense (Pc16g00090) |
|
| ucgggcccguagcugugaugc | 69 | 123 | 0 | 167.2 | -32.7 | ChrI: 11697045–11697111; intergenic |
|
| uagguucaggaaguucucucc | 40 | 12 | 0 | 166.2 | -25.2 | ChrI: rev(11697681–11697758), intergenic; ChrI: rev(12068486–12068563), intergenic; ChrII: rev(4407094–4407171). intergeneic |
|
| uggcuacggauaugacucuc | 114 | 35 | 0 | 161.5 | -42.6 | ChrII: 4498587–4498671, intergenic |
|
| ggccccagagaucguuggucuaa | 43 | 31 | 0 | 154.3 | -25.9 | ChrII: rev(6202677–6202728), intergenic |
|
| uagccaucucgucgagguaga | 123 | 46 | 4 | 143.4 | -29.9 | ChrIII: rev(877512–877577), intergenic |
|
| uaugucugcucugguccaugac | 4 | 12 | 0 | 136.9 | -49.6 | ChrI: 12126475–12126539, exon sense (Pc16g15350) |
|
| uccaggcugacgguggcgca | 23 | 2 | 0 | 135.4 | -35.1 | ChrIV: 2408756–2408823, exon antisense (Pc13g13430) |
|
| uaggacuugaucgcgugagac | 12 | 5 | 0 | 128.5 | -28.7 | ChrI: 8239273–8239341, exon antisense (Pc13g04730) |
|
| uagaaugcaagcgcguugagc | 52 | 27 | 1 | 104.6 | -24.4 | ChrII: rev(3729478–3729558), intergenic |
|
| uucgauugggaucugggccu | 28 | 19 | 0 | 91.4 | -33.6 | ChrI: rev(7068824–7068895), intergenic |
|
| ugggcgggcgagauugaac | 66 | 14 | 0 | 90.4 | -30.0 | ChrI: rev(9237687–9237761), exon sense (Pc13g08790) |
|
| uccaucguggcugugcacuu | 19 | 20 | 1 | 75.6 | -26.9 | ChrII: rev(5605185–5605257), exon antisense (Pc21g03660) |
|
| uggacgaggaacaucugcac | 12 | 1 | 0 | 73.0 | -25.7 | ChrI: rev(8625875–8625949), exon antisense (Pc13g06180) |
|
| cggacgagaaugccgaggcucuguu | 20 | 5 | 0 | 67.3 | -52.8 | ChrIV: 979955–980021, intergenic |
|
| uccuacguuacuccaaaggau | 10 | 1 | 0 | 61.9 | -29.1 | ChrI: 8009736–8009780, intergenic |
|
| ucccccucgagaagaucuagucugccucu | 8 | 1 | 0 | 59.2 | -34.0 | ChrI: 7174825–7174898, intergenic |
|
| uuuggaugaaauucgaaugaa | 15 | 2 | 0 | 37.9 | -32.8 | ChrIII: 1228283–1228364, intergenic |
|
| uagacuguuccaaggaugcu | 11 | 14 | 1 | 36.6 | -25.0 | ChrII: 8010371–8010446, exon antisense (Pc21g13900) |
|
| ugcacaaguggacucucccu | 10 | 4 | 0 | 36.6 | -21.7 | ChrI: 12845073–12845153, exon sense (Pc06g00870) |
|
| ucaucggcaaaacuuggagaa | 5 | 8 | 0 | 32.2 | -28.5 | ChrII: rev(3651114–3651202), intergenic |
|
| ucacacguagaaauccagau | 8 | 6 | 0 | 31.6 | -19.8 | ChrI: 10506362–10506430, intergenic |
|
| ucuaccgagacugucuuuga | 13 | 6 | 0 | 31,2 | -31.3 | ChrI: 4059417–4059459, intergenic |
|
| uuagcaugcaugguauugua | 2 | 13 | 0 | 30.8 | -34.5 | ChrII: 8593139–8593200, intergenic |
|
| ugcuuggucgucacucggga | 11 | 6 | 0 | 28.2 | -28.4 | ChrII: 2351660–2351714, intergenic |
|
| uaccaagucgucgaaaugcu | 13 | 3 | 0 | 25.1 | -27.1 | ChrIV: 190879–190926, exon antisense (Pc16g00780) |
|
| ugagaccgcggagcaaacg | 29 | 12 | 0 | 24.0 | -24.5 | ChrI: 3765153–3765197, intergenic |
|
| uggagaaugucacuuguggaa | 2 | 4 | 0 | 22.6 | -36.1 | ChrIII: rev(719002–719055), exon antisense (Pc12g03150) |
|
| augcccuucggcguuagucuacc | 5 | 1 | 1 | 22.4 | -31.6 | ChrIV: rev(1820276–1820326), intergenic |
|
| uuucccaucucgaucaccgga | 15 | 1 | 0 | 20.8 | -19.0 | ChrIV: rev(1325461–1325507), intergenic |
[1] TPTM: transcripts per ten million
[2] Score: miRDeep2 prediction score
[3] MFE: minimal free energy
milRNA target gene prediction.
| milR ID | target ID | function | miRanda score | TAPIR score | psRNATarget UPE | Comment[
|
|---|---|---|---|---|---|---|
|
| Pc22g17450 | hypothetical protein | 164 | - | - | - |
| Pc12g13700 | Dicer-like protein ( | 161 | - | - | - | |
| Pc12g02820 | hypothetical protein | - | 4 | - | - | |
|
| Pc12g15260 | hypothetical protein | 174 | - | 17.2 | - |
|
| Pc22g02960 | SNARE associated Golgi protein | 179 | - | 17.0 | - |
|
| Pc04g00060 | hypothetical protein | 190 | 0 | 21.7 | star |
| Pc19g00700 | hypothetical protein | 190 | 0 | 23.6 | star | |
| Pc24g01200 | hypothetical protein | 190 | 0 | 23.8 | star | |
| Pc24g02780 | hypothetical protein | 190 | 0 | 24.5 | star | |
|
| Pc16g15330 | hypothetical protein | 200 | 0 | 20.1 | star |
| Pc12g06730 | hypothetical protein | 179 | 2.5 | - | - | |
|
| Pc24g01250 | hypothetical protein | 195 | 0 | 13.3 | star |
| Pc17g00230 | hypothetical protein | - | 1 | 14.4 | - | |
| Pc21g00150 | putative proline-rich call wall protein | - | 2 | 15.2 | - | |
| Pc24g02850 | hypothetical protein | - | 2 | 17.1 | - | |
|
| Pc21g17740 | SNF2 family helicase | 290 | - | - | - |
| Pc12g12880 | ubiquitin carboxyl-terminal hydrolase | 286 | - | - | - | |
| Pc13g12300 | BRCA1-like DNA repair protein | 283 | - | - | - | |
| Pc13g05360 | putative Ku70-binding protein | - | - | 20.8 | - | |
|
| Pc20g05760 | putative cytochrome P450 protein | 190 | 0 | 18.5 | star |
| Pc18g04580 | hypothetical protein | 170 | 3 | 19.9 | - | |
|
| Pc22g16980 | hypothetical protein | 169 | 3.5 | 21.2 | - |
|
| Pc22g13440 | hypothetical protein | 185 | 0 | - | star |
| Pc21g05840 | putative Zn(2)-C6 DNA-binding protein | 176 | 2.5 | - | - | |
| Pc21g02000 | PLC-like phosphodiesterase | 171 | 4 | - | - | |
| Pc12g10520 | hypothetical protein | 172 | 3.5 | - | - | |
|
| Pc16g04680 | hypothetical protein | 195 | 0 | 19.9 | star |
| Pc18g05650 | hypothetical protein | 171 | - | 21.7 | - |
[1] Predicted target sites were compared to the milRNA-star sequence.
Fig 5Validation and expression analysis of (A) milR-1 and (B) milR-21.
Total RNA from strain P2niaD18, the recipient ∆ku70FRT2 as well as Dicer single and double mutant strains were used for polyacrylamide gel electrophoresis and northern blot analysis. Mature milRNAs (milRs) and milRNA precursors (pre-milRs) were detected with revers complement 32P-labled DNA probes. Below, loading controls of the total RNA, stained with ethidium bromide (EtBr), and the predicted secondary structures of milRNA precursors are given. On the secondary structures, milRNA sequences are highlighted in red and arrows indicate the expected Dicer cleavage sites.