Literature DB >> 31896653

Draft Genome Sequence of Aspergillus oryzae BP2-1, Isolated from Traditional Malted Rice in South Korea.

Jongbum Jeon1, Jung A Kim2, Sook-Young Park3, Gye-Won Kim4, Cheon-Seok Park5,6, Changmu Kim7, Hye Yoon Park2, Joo-Hong Yeo2, Yong-Hwan Lee8, Soonok Kim9.   

Abstract

The fungus Aspergillus oryzae strain BP2-1 was isolated from the traditional malted starter culture nuruk. We report here the draft whole-genome sequence of A. oryzae BP2-1, which is comprised of 14 scaffolds with a total length of 39,455,382 bp and a GC content of 47.13%.
Copyright © 2020 Jeon et al.

Entities:  

Year:  2020        PMID: 31896653      PMCID: PMC6940305          DOI: 10.1128/MRA.01405-19

Source DB:  PubMed          Journal:  Microbiol Resour Announc        ISSN: 2576-098X


ANNOUNCEMENT

Aspergillus oryzae is extensively used in industry, especially in the production of fermented foods and alcoholic beverages. Discovery of novel A. oryzae strains may assist in the development of novel enzymes and chemicals. An A. oryzae strain, BP2-1, which confers a unique flavor to makgeolli, was isolated during a survey of fungal strains from traditional nuruk in 2014. Moreover, extracts of malted rice with this strain exhibited lipid metabolism improvement activity by increasing peroxisome proliferator-activated receptor a (PPARa) activity in monkey kidney cells (CV-1) and a skin-lightening effect by inhibiting tyrosinase activity. Here, we report the draft genome sequence of A. oryzae BP2-1. This strain was isolated from a nuruk sample collected from Donghae (Gangwon, South Korea) according to the protocols described by Yang et al. (1) and was deposited in the National Institute of Biological Resources (NIBR) culture collection under the accession no. NIBRFGC000134774. High-quality genomic DNA was extracted from A. oryzae BP2-1 germlings grown overnight at 23°C in potato dextrose broth with shaking at 120 rpm using a DNeasy minikit (Qiagen, Valencia, CA, USA) according to the manufacturer’s instructions. Genome sequencing of A. oryzae BP2-1 was conducted at Theragen Etex Bio Institute (Suwon, South Korea). One short-insert paired-end (PE) library (fragment size, 550 bp) and two long-insert mate pair (MP) libraries (insert sizes, 5 kb and 10 kb) were generated using a TruSeq DNA sample prep kit and a Nextera MP library prep kit (Illumina, CA, USA), respectively. The single-molecule real-time (SMRT) sequencing library was prepared according to the PacBio standard library preparation protocol using a PacBio DNA template prep kit (Pacific Biosciences, CA, USA), with a fragment size of 20 kb. A total of 996,216 reads with an average read size of 7,484 bp were generated on 8 cells of a PacBio RS II sequencer. Reads were assembled using an overlap-layout-consensus (OLC) algorithm and FALCON v. 0.2JASM (2), generating 41 contigs with a total length of 38,361,741 bp and an N50 value of 2,463,276 bp. After filtering with NextClip v. 1.3 (3), we obtained 46,143,916 reads with a Q30 of 88.70% from the PE library, 31,740,314 high-quality reads from the 5-kb inserted MP library, and 22,346,808 high-quality reads from the 10-kb inserted MP library from 101 cycles on an Illumina HiSeq 2000 platform. Using SOAPdenovo v. 2.04 (4), these short-read data sets were assembled into 78 contigs with a total length of 39,948,698 bp and an N50 value of 2,254,781 bp. These two assemblies were merged into 23 contigs using HaploMerger2, followed by cleaning twice using the faDnaPolishing.pl script provided by HaploMerger2 (removeShortSeq=500) (5). Scaffolding and gap filling were performed using SSPACE-standard v3.0 (6), SSPACE-LongRead v1.1 (7), and GapFiller v1.10 (8), using default parameters. The final assembly consisted of 14 scaffolds with a total length of 39,455,382 bp and an N50 value of 4,366,015 bp. The GC content of the assembled genome was 47.13%. Genome assembly was validated using BUSCO v. 3.0.2b with the fungal ortholog data set (fungi_odb9) (9). In the assembly, 98.6% of the orthologs, including one that was duplicated, were found to be complete. This draft genome sequence will provide valuable information to identify genes for various bioactivities and will also facilitate comparative genomics with other publicly available Aspergillus fungal sequences for evolutionary studies.

Data availability.

The genome sequence of Aspergillus oryzae BP2-1 obtained in this study has been deposited at GenBank under the accession no. NGZN00000000. The version described in this article is the first version, NGZN01000000. SRA data of PacBio and Illumina sequences were also deposited at GenBank under accession no. SRR9306672 to SRR9306682 (BioProject no. PRJNA368788).
  8 in total

1.  Scaffolding pre-assembled contigs using SSPACE.

Authors:  Marten Boetzer; Christiaan V Henkel; Hans J Jansen; Derek Butler; Walter Pirovano
Journal:  Bioinformatics       Date:  2010-12-12       Impact factor: 6.937

2.  BAUM: improving genome assembly by adaptive unique mapping and local overlap-layout-consensus approach.

Authors:  Anqi Wang; Zhanyu Wang; Zheng Li; Lei M Li
Journal:  Bioinformatics       Date:  2018-06-15       Impact factor: 6.937

3.  HaploMerger: reconstructing allelic relationships for polymorphic diploid genome assemblies.

Authors:  Shengfeng Huang; Zelin Chen; Guangrui Huang; Ting Yu; Ping Yang; Jie Li; Yonggui Fu; Shaochun Yuan; Shangwu Chen; Anlong Xu
Journal:  Genome Res       Date:  2012-05-03       Impact factor: 9.043

4.  SSPACE-LongRead: scaffolding bacterial draft genomes using long read sequence information.

Authors:  Marten Boetzer; Walter Pirovano
Journal:  BMC Bioinformatics       Date:  2014-06-20       Impact factor: 3.169

5.  BUSCO Applications from Quality Assessments to Gene Prediction and Phylogenomics.

Authors:  Robert M Waterhouse; Mathieu Seppey; Felipe A Simão; Mosè Manni; Panagiotis Ioannidis; Guennadi Klioutchnikov; Evgenia V Kriventseva; Evgeny M Zdobnov
Journal:  Mol Biol Evol       Date:  2018-03-01       Impact factor: 16.240

6.  GapFiller: a de novo assembly approach to fill the gap within paired reads.

Authors:  Francesca Nadalin; Francesco Vezzi; Alberto Policriti
Journal:  BMC Bioinformatics       Date:  2012-09-07       Impact factor: 3.169

7.  SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler.

Authors:  Ruibang Luo; Binghang Liu; Yinlong Xie; Zhenyu Li; Weihua Huang; Jianying Yuan; Guangzhu He; Yanxiang Chen; Qi Pan; Yunjie Liu; Jingbo Tang; Gengxiong Wu; Hao Zhang; Yujian Shi; Yong Liu; Chang Yu; Bo Wang; Yao Lu; Changlei Han; David W Cheung; Siu-Ming Yiu; Shaoliang Peng; Zhu Xiaoqian; Guangming Liu; Xiangke Liao; Yingrui Li; Huanming Yang; Jian Wang; Tak-Wah Lam; Jun Wang
Journal:  Gigascience       Date:  2012-12-27       Impact factor: 6.524

8.  NextClip: an analysis and read preparation tool for Nextera Long Mate Pair libraries.

Authors:  Richard M Leggett; Bernardo J Clavijo; Leah Clissold; Matthew D Clark; Mario Caccamo
Journal:  Bioinformatics       Date:  2013-12-02       Impact factor: 6.937

  8 in total
  1 in total

1.  Comparison of Two Aspergillus oryzae Genomes From Different Clades Reveals Independent Evolution of Alpha-Amylase Duplication, Variation in Secondary Metabolism Genes, and Differences in Primary Metabolism.

Authors:  Katherine Chacón-Vargas; Colin O McCarthy; Dasol Choi; Long Wang; Jae-Hyuk Yu; John G Gibbons
Journal:  Front Microbiol       Date:  2021-07-13       Impact factor: 5.640

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.