| Literature DB >> 24564352 |
Fu-Qiang Wang, Jun Zhong, Ying Zhao, Jingfa Xiao, Jing Liu, Meng Dai, Guizhen Zheng, Li Zhang, Jun Yu, Jiayan Wu, Baoling Duan.
Abstract
BACKGROUND: Due to the importance of Penicillium chrysogenum holding in medicine, the genome of low-penicillin producing laboratorial strain Wisconsin54-1255 had been sequenced and fully annotated. Through classical mutagenesis of Wisconsin54-1255, product titers and productivities of penicillin have dramatically increased, but what underlying genome structural variations is still little known. Therefore, genome sequencing of a high-penicillin producing industrial strain is very meaningful.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24564352 PMCID: PMC4046689 DOI: 10.1186/1471-2164-15-S1-S11
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
P. chrysogenum NCPC10086 genome sequencing data
| Instruments | Insert fragment size (Kb) | Reads length | Sequencing throughput (Mb) | Coverage |
|---|---|---|---|---|
| Roche 454 GS | single end | 410 | 614 | 18× |
| Illumina HiSeq 2000 | 1-2 | 50 | 6,120 | 180× |
| ABI 3730 | 3-4 | 659 | 170 | 5× |
| MegaBACE 1000 | 6-8 | 739 | 34 | 1× |
| Total | - | - | 6,938 | 204× |
The estimated genome size of P. chrysogenum NCPC10086 is about 34 Mb.
Global statistics of the genome assembly and annotation of P. chrysogenum NCPC10086
| Assembly | Number | N50 (Kb) | Longest (Kb) | Size (Mb) | Percentage of the assembly |
|---|---|---|---|---|---|
|
| 327 | 661 | 1,655 | 32.2 | 99.7 |
|
| 175 | 2,847 | 4,063 | 32.3 | 100 |
|
|
|
|
|
| |
|
| 13,290 | 1,499 | 48.9 | 2,430 | |
|
| 10,966 | 1,559 | 51.6 | 2,945 |
Figure 1Gene annotation and gene ontology of NCPC10086. (A) Venn diagram showing unique and shared proteins could be annotated by databases of Non-redundant, InterProscan, Swiss-Prot/UniProtKB and Gene Ontology. (B) There are 6,831 proteins could be assigned to cellular component, biological process and molecular function by Gene Ontology classification system.
Figure 2The single nuclear variations (SNVs) statistics between NCPC10086 and Wisconsin54-1255. (A) We discovered 11,573 genes are identical and 759 SNVs between two strains. (B) Among them, 135 SNVs take place in intron regions, 177 SNVs are synonymous mutations and 447 SNV are non-synonymous mutations including 34 termination codon mutations.
Metabolism or progress involved by several "new" genes
| Gene name | Length | Location | The metabolism or progress |
|---|---|---|---|
| Pch125g10680 | 4980 | scaf125 | Amino sugar and nucleotide sugar metabolism |
| Pch106g00010 | 769 | scaf106 | Nitrogen metabolism, oxidative phosphorylation |
| Pch114g00050 | 153 | scaf114 | Oxidative phosphorylation |
| Pch041g00010 | 713 | scaf041 | Riboflavin metabolism |
| Pch056g00010 | 787 | scaf056 | N-Glycan biosynthesis |
| Pch018g00010 | 694 | scaf018 | Glutathione, arachidonic acid, taurine and hypotaurine metabolism, |
| Pch180g00010 | 580 | scaf180 | Fluorobenzoate, chlorocyclohexane and chlorobenzene, toluene degradation |
Figure 3Comparative organizations of penicillin biosynthetic genes cluster (PBC) in different strains. (A) PBC region of Wisconsin54-1255 is about 56.9-kb, consisting of 53.7-kb fragment and 3.2-kb shift fragment bounded by a conserved TGTAAA/T hexanucleotide. (B) PBC fragment arrangement schematic. We discovered a new shift fragment in NCPC10086, marked with orange bar and blue letters.
Figure 4Identification of 266 Kb translocation. (A) A 266 Kb fragment translocation (orange bar) between Wisconsin54-1255 and NCPC10086. Genes are marked with green bar; special one is red boxed. (B) Reads alignment of the region around the breakpoint of translocation shows that there are 11 reads to support our conclusion. (C) PCR identification of the translocation. W stands for Wisconsin54-1255 and N stands for NCPC10086.
Figure 5Identification of 1,202 Kb translocation. (A) A 1,202 Kb fragment translocation (orange bar) between Wisconsin54-1255 and NCPC10086. Genes are marked with green bar; special one is red boxed. (B) Reads alignment of the region around the breakpoint of translocation shows that there are 11 reads to support our conclusion. (C) PCR identification of the translocation. W stands for Wisconsin54-1255 and N stands for NCPC10086.
Single nuclear variations (SNVs) involved in homogentisate pathway and the regulators of penicillin biosynthesis
| Gene | Length | Description | Discrepancies |
|---|---|---|---|
|
| 1,727 | A phenylacetate 2-hydroxylase which catalyzes the first step of the homogentisate pathway for PAA catabolism | -- |
|
| 1,797 | Strongly similar to | -- |
|
| 1,785 | Strongly similar to | non-synonymous |
|
| 2,423 | Strongly similar to | synonymous |
|
| 1,340 | A global regulator of secondary metabolism | -- |
|
| 1,745 | An activator and repressor of secondary metabolism | non-synonymous (C1002T) |