| Literature DB >> 34849826 |
Jeffrey M Skerker1, Kaila M Pianalto1, Stephen J Mondo2,3, Kunlong Yang4, Adam P Arkin1,5, Nancy P Keller4, Igor V Grigoriev2,6, N Louise Louise Glass1,6.
Abstract
Aspergillus flavus is an opportunistic pathogen of crops, including peanuts and maize, and is the second leading cause of aspergillosis in immunocompromised patients. A. flavus is also a major producer of the mycotoxin, aflatoxin, a potent carcinogen, which results in significant crop losses annually. The A. flavus isolate NRRL 3357 was originally isolated from peanut and has been used as a model organism for understanding the regulation and production of secondary metabolites, such as aflatoxin. A draft genome of NRRL 3357 was previously constructed, enabling the development of molecular tools and for understanding population biology of this particular species. Here, we describe an updated, near complete, telomere-to-telomere assembly and re-annotation of the eight chromosomes of A. flavus NRRL 3357 genome, accomplished via long-read PacBio and Oxford Nanopore technologies combined with Illumina short-read sequencing. A total of 13,715 protein-coding genes were predicted. Using RNA-seq data, a significant improvement was achieved in predicted 5' and 3' untranslated regions, which were incorporated into the new gene models.Entities:
Keywords: zzm321990 Aspergillus flavuszzm321990 ; NRRL 3357; Nanopore; PacBio; genome sequence
Mesh:
Substances:
Year: 2021 PMID: 34849826 PMCID: PMC8496237 DOI: 10.1093/g3journal/jkab213
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Summary of assembly and annotation statistics of the A. flavus NRRL 3357 genome
| Assembly statistics |
|
| Mixed sequencing assembly (this work) |
|---|---|---|---|
| Genome size (Mbp) | 36.89 | 36.89 | 37.75 |
| Coverage | 5X | 5X | 650X |
| Number of scaffolds/contigs | 958/331 | 958/331 | 8 |
| L50 | 6 | 6 | 4 |
| N50, Mbp (Scaffold) | 2.39 | 2.39 | 4.81 |
| Complete chromosomes | — | — | 8 |
| % genes by BUSCO assessment | |||
| Single-copy | 92.70 | 96.30 | 98.70 |
| Duplicated | 0.30 | 1.70 | 0.50 |
| Fragmented | 3.00 | 0.90 | 0.00 |
| Annotation statistics | |||
| Number of predicted protein coding genes | 13,485 | 14,313 | 13,715 |
| Percent gene models with annotated UTRs | 9.25% | — | 44.43% |
| Predicted secondary metabolite clusters | 56 | 83 | |
| Predicted CAZymes | 627 | 644 | |
| Total repeat length (bp) | 404,315 (1.1%) | 1,311,342 (3.47%) |
Nierman et al. (2015).
Hatmaker et al. (2020).
Seppey et al. (2019).
Drott et al. (2021).
Figure 1Graphic representation of the 8 NRRL 3357 Aspergillus flavus assembled chromosomes. Green traces represent AT content and dark blue traces represent GC content. Teal-colored arrows indicate predicted coding regions1.
1Figure generated using Geneious Prime v.2021.0.3; www.geneiousprime.com.