| Literature DB >> 32973216 |
Ayush Pathak1, Reuben W Nowell1,2, Christopher G Wilson1,2, Matthew J Ryan3, Timothy G Barraclough4,5.
Abstract
Antibiotics were derived originally from wild organisms and therefore understanding how these compounds evolve among different lineages might help with the design of new antimicrobial drugs. We report the draft genome sequence of Alexander Fleming's original fungal isolate behind the discovery of penicillin, now classified as Penicillium rubens Biourge (1923) (IMI 15378). We compare the structure of the genome and genes involved in penicillin synthesis with those in two 'high producing' industrial strains of P. rubens and the closely related species P. nalgiovense. The main effector genes for producing penicillin G (pcbAB, pcbC and penDE) show amino acid divergence between the Fleming strain and both industrial strains, whereas a suite of regulatory genes are conserved. Homologs of penicillin N effector genes cefD1 and cefD2 were also found and the latter displayed amino acid divergence between the Fleming strain and industrial strains. The draft assemblies contain several partial duplications of penicillin-pathway genes in all three P. rubens strains, to differing degrees, which we hypothesise might be involved in regulation of the pathway. The two industrial strains are identical in sequence across all effector and regulatory genes but differ in duplication of the pcbAB-pcbC-penDE complex and partial duplication of fragments of regulatory genes. We conclude that evolution in the wild encompassed both sequence changes of the effector genes and gene duplication, whereas human-mediated changes through mutagenesis and artificial selection led to duplication of the penicillin pathway genes.Entities:
Mesh:
Year: 2020 PMID: 32973216 PMCID: PMC7515868 DOI: 10.1038/s41598-020-72584-5
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1(A) Synthesis pathway for penicillin G and penicillin N. Bold italics indicate genes producing enzymes that catalyse each step (next to arrows). (B) Regulatory genes considered in this study that either upregulate (+) or downregulate (−) expression of the effector enzymes above.
Voucher and repository information for the strains used in this study.
| Species | Strain | Origin |
|---|---|---|
| IMI 15378 | Alexander Fleming’s isolate from contaminated agar plate, London, UK[ | |
| P2niaD18 | Derived by mutagenesis and artificial selection from the IMI 015378 isolate, from mouldy cantaloupe, Illinois, USA[ | |
| Wisconsin 54-1255 | Derived by mutagenesis and artificial selection from the IMI 015378 isolate, from mouldy cantaloupe, Illinois, USA[ | |
| IBT 13039 | Goat cheese, Crete, Greece[ |
aOriginally misnamed as P. chrysogenum but confirmed by molecular data to belong to the P. rubens clade[3,4].
Genome assembly metrics for P. rubens (IMI 15378) and the published genomes.
| Strain | Fleming IMI 15378 | P2niaD18[ | Wisconsin 54-1255[ | |
|---|---|---|---|---|
| GenBank assembly accession (name) | GCA_902636305.1 (nPRUBv1) | GCA_000710275.1 (ASM71027v1) | GCA_000226395.1 (PenChr_Nov2007) | GCA_002072425.1 (ASM207242v1) |
| Coverage (method) | 93X (Illumina HiSeq) | 110X (Illumina HiSeq) | 9.8X (clone-based Sanger) | 115X (Illumina HiSeq) |
| Span (Mb) | 30.5 | 32.5 | 32.2 | 33.0 |
| No. scaffolds | 101 | 5 | 49 | 1956 |
| Scaffold N50 (kb) | 1619 | 10,455 | 3889 | 358 |
| Longest scaf fold (kb) | 3393 | 13,598 | 5624 | 2018 |
| % GC | 49.0% | 48.9% | 49.0% | 48.5% |
| BUSCOEUK ( | C: 99.0% [S: 98.7%, D: 0.3%], F: 0.0%, M: 1.0% | C: 99.0% [S: 98.7%, D: 0.3%], F: 0.0%, M: 1.0% | C: 99.0% [S: 98.7%, D: 0.3%], F: 0.0%, M: 1.0% | C: 99.0% [S: 98.7%, D: 0.3%], F: 0.0%, M: 1.0% |
| BUSCOFUN ( | C: 99.3% [S: 98.3%, D: 1.0%], F: 0.0%, M: 0.7% | C: 99.7% [S: 99.0%, D: 0.7%], F: 0.0%, M: 0.3% | C: 99.6% [S: 98.6%, D: 1.0%], F: 0.0%, M: 0.4% | C: 99.7% [S: 99.0%, D: 0.7%], F: 0.0%, M: 0.3% |
BUSCO notation: C, complete BUSCOs; S, complete and single-copy BUSCOs, D, complete and duplicated BUSCOs; F, fragmented BUSCOs; M, missing BUSCOs.
Figure 2Dotplots showing regions of forward alignment (in blue), reversals (green) and repetitive alignments (orange). The 4 nuclear chromosomes of the ‘P. chrysogenum’ P2niaD18 assembly were used as the reference in each case: top panel, Fleming’s strain P. rubens (IMI 15378); middle panel, Wisconsin 54-1255; bottom panel, P. nalgiovense (IBT 13039).
Figure 3Structure of the penicillin gene cluster in the four strains. Distance between tandem duplicates in P2niaD18 not shown to scale (indicated by dashed line). The asterisk and vertical line in pcbAB indicate a fragment matching to a 36 bp fragment of the cefD1 gene. More detailed view of the region is in figure S2.
Figure 4The average ratio of non-synonymous to synonymous substitutions (dN/dS) for alignments of penicillin pathway genes across the sampled genomes: standard error bars on the estimate from the PAML analysis are shown. The first four genes were sampled for three strains: Wisconsin 54-1255, Fleming (IMI 15378), and P. nalgiovense (IBT 13039). The remaining genes were compared just between the Fleming (IMI 15378) genome and P. nalgiovense (IBT 13039) because of the lack of any variation across the three P. rubens strains.