| Literature DB >> 28725336 |
Sujay Paul1, Yvette Ludeña1, Gretty K Villena1, Fengan Yu2, David H Sherman2, Marcel Gutiérrez-Correa1.
Abstract
Filamentous fungus Aspergillus niger has high industrial value due to their lignocellulolytic enzyme activities and ATCC 10864 is one of the few type strains of A. niger which has a unique biofilm forming capability. Here we report the first draft genome sequence of A. niger ATCC 10864 strain. The genome of A. niger ATCC 10864 is 36,172,237 bp long and comprise of 310 scaffolds with 49.5% average GC content. A total of 10,804 protein-coding genes were predicted among which 10,761 genes were with putative functions. A. niger ATCC 10864 genome coded for 709 putative carbohydrate active enzyme families distributed in six functional categories and among them glycoside hydrolases (GHs) represent the most number of families (279). Genes that include pepA, brlA, exgA, LaeA, rodA, GCN have also been identified in this study, which may play a role in biofilm formation. This high-quality draft genome sequence will facilitate our understanding of the mechanisms behind fungal biofilm formation and higher lignocellulolytic enzyme production.Entities:
Keywords: Aspergillus niger; Biofilm; Genomic feature; Lignocellulolytic enzyme
Year: 2017 PMID: 28725336 PMCID: PMC5514509 DOI: 10.1186/s40793-017-0254-2
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Fig. 1Electron micrograph of Aspergillus niger strain ATCC 10864. Inset: ornamented spores
Classification and general features of Aspergillus niger strain ATCC 10864
| MIGS ID | Property | Term | Evidence codea |
|---|---|---|---|
| Current classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Strain ATCC 10864 | |||
| Gram stain | N/A | ||
| Cell shape | Septate and hyaline hyphae | TAS [ | |
| Motility | Non-motile | NAS | |
| Sporulation | Carbon black ornamented spores from biseriate phialides | TAS [ | |
| Temperature range | 6 °C–47 °C | TAS [ | |
| Optimum temperature | 25-35 °C | TAS [ | |
| pH range | 1.4–9.8 | TAS [ | |
| Carbon source | Organic carbon source | TAS [ | |
| MIGS-6 | Habitat | In soil, litter, compost and on decaying plant material | TAS [ |
| MIGS-6.3 | Salinity | 0-5% | TAS [ |
| MIGS-22 | Oxygen | Aerobic | TAS [ |
| MIGS-15 | Biotic relationship | Plant and animal | TAS [ |
| MIGS-14 | Pathogenicity | Mild/Opportunistic | TAS [ |
| MIGS-4 | Geographic location | Budapest, Hungary | TAS [ |
| MIGS-5 | Sample collection time | Unknown | |
| MIGS-4.1 | Latitude | Unknown | |
| MIGS-4.2 | Longitude | Unknown | |
| MIGS-4.3 | Depth | Unknown | |
| MIGS-4.4 | Altitude | Unknown |
aEvidence codes - TAS traceable author statement (i.e., a direct report exists in the literature), NAS non-traceable author statement
Fig. 2Phylogenetic tree showing the close relationship of A. niger strain ATCC 10864 (grey block) with other A. niger strains based on aligned sequences of the 18S rRNA gene. Multiple sequence alignment was performed using ClustalX program and the phylogeny was calculated using maximum likelihood method based on the Tamura-Nei model. The bootstrap value was inferred from 1000 replicates. Penicillium solitum is considered as the outgroup for this analysis. The whole analysis was carried out using MEGA5 package [43]
Project Information
| MIGS ID | Property | Term |
|---|---|---|
| MIGS-31 | Finishing quality | High-quality draft (Full genome representation) |
| MIGS-28 | Libraries used | libraries of 400–450 bp |
| MIGS-29 | Sequencing platforms | Illumina HiSeq 2000 (100*2) |
| MIGS31.2 | Fold coverage | 88.19 X |
| MIGS-30 | Assemblers | SPAdes v 3.1 |
| MIGS-32 | Gene calling method (Gene prediction tool) | Augustus 3.0.3 |
| Locus tag | Not indicated | |
| Genbank ID | MCQH00000000 | |
| GenBank Date of Release | 29-AUG-2016 | |
| Bioproject | PRJNA300350 | |
| MIGS-13 | Source material identifier | ATCC 10864 |
| Project relevance | Industrial |
Genome statistics
| Attribute | Value | % of Total |
|---|---|---|
| Genome size (bp) | 36,172,237 | 100.0 |
| DNA coding (bp) | 17,676,693 | 48.87 |
| DNA G + C (bp) | 17,905,257 | 49.50 |
| DNA scaffolds | 310 | |
| Total genes | 10,973 | 100.0 |
| Protein coding genes | 10,804 | 98.45 |
| RNA genes | 169 | 1.56 |
| Pseudo genes | Unknown | |
| Genes in internal clusters | Unknown | |
| Genes with function prediction (GO annotated proteins) | 10,761 | 98.06 |
| Genes assigned to COGs | 7509 | 69.50 |
| Genes with Pfam domains | 8430 | 76.82 |
| Genes with signal peptides | 994 | 9.05 |
| Genes with transmembrane helices | 2362 | 21.86 |
| CRISPR repeats | N/A |
Number of genes associated with general COG functional categories
| Code | Value | % of total | Description |
|---|---|---|---|
| J | 324 | 2.99 | Translation, ribosomal structure and biogenesis |
| A | 29 | 0.20 | RNA processing and modification |
| K | 337 | 3.11 | Transcription |
| L | 362 | 3.35 | Replication, recombination and repair |
| B | 42 | 0.30 | Chromatin structure and dynamics |
| D | 80 | 0.70 | Cell cycle control, cell division, chromosome partitioning |
| Y | 2 | 0.01 | Nuclear structure |
| V | 70 | 0.60 | Defense mechanisms |
| T | 236 | 2.18 | Signal transduction mechanisms |
| M | 179 | 1.65 | Cell wall/membrane biogenesis |
| N | 3 | 0.02 | Cell motility |
| Z | 51 | 0.40 | Cytoskeleton |
| W | 0 | 0 | Extracellular structures |
| U | 90 | 0.80 | Intracellular trafficking and secretion |
| O | 311 | 2.87 | Posttranslational modification, protein turnover, chaperones |
| C | 437 | 4.04 | Energy production and conversion |
| G | 813 | 7.52 | Carbohydrate transport and metabolism |
| E | 910 | 8.42 | Amino acid transport and metabolism |
| F | 129 | 1.19 | Nucleotide transport and metabolism |
| H | 250 | 2.31 | Coenzyme transport and metabolism |
| I | 409 | 3.78 | Lipid transport and metabolism |
| P | 616 | 5.70 | Inorganic ion transport and metabolism |
| Q | 600 | 5.55 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 1772 | 16.40 | General function prediction only |
| S | 195 | 1.80 | Function unknown |
| - | 3295 | 30.49 | Not in COGs |
Fig. 3Whole genome comparison analysis of A. niger ATCC 10864 with other reported A. niger strains. a Comparison of A. niger ATCC 10864, A. niger An76, and A. niger SH-2 strains against reference A. niger ATCC 1015 strain (using BRIG tool). The outermost dark green circle represents the reference genome of A. niger ATCC 1015; the next purple, blue and light green circle represent A. niger SH-2, A. niger An76, and A. niger ATCC 10864 strains respectively. The black line lying in between the A. niger ATCC 10864 and genome-scale symbolize the GC content. b Comparison of A. niger ATCC 10864, A. niger An76, and A. niger SH-2 strains against reference A. niger ATCC 1015 strain (using Mauve tool). Colored block outlines are known as Locally Collinear Blocks and are connected by corresponding coloured lines. LCBs represent the regions of similarity among the genomes that are homologous and have not undergone any rearrangement. Blocks lying above the center black horizontal line are in forward orientation while blocks below the center line indicate regions that align in the reverse complement (inverse) orientation. Regions outside blocks (white regions) show no homology among the input genomes