Literature DB >> 34989601

Long-Read Genome Assembly of Saccharomyces uvarum Strain CBS 7001.

Jingxuan Chen1, David J Garfinkel2,3, Casey M Bergman1,2.   

Abstract

Here, we report a long-read genome assembly for Saccharomyces uvarum strain CBS 7001 based on PacBio whole-genome shotgun sequence data. Our assembly provides an improved reference genome for an important yeast in the Saccharomyces sensu stricto clade.

Entities:  

Year:  2022        PMID: 34989601      PMCID: PMC8759399          DOI: 10.1128/mra.00972-21

Source DB:  PubMed          Journal:  Microbiol Resour Announc        ISSN: 2576-098X


ANNOUNCEMENT

Saccharomyces uvarum is a globally distributed yeast species that can be commonly extracted from low-temperature fermentations and other natural substrates (1–6). Low-coverage Sanger sequence assemblies were initially reported for an S. uvarum reference strain called CBS 7001 (also called MCYC 623) (7, 8). Later, Scannell et al. (9) generated a chromosome-level scaffolded assembly for CBS 7001 that combined Sanger and Illumina short-read sequences. All previous CBS 7001 assemblies contain a large number of gaps (Table 1), limiting the comprehensive analysis of transposable elements (TEs) in this species (9, 10). Here, we generated a long-read genome assembly for strain CBS 7001 to improve the reference genome and support analysis of TEs in S. uvarum.
TABLE 1

Statistics for de novo assemblies of CBS 7001

AssemblyNCBI accession no.No. of contigsNo. of scaffoldsTotal length (bp)Contig N50 (bp)Complete BUSCOs (%)No. of full-length Tsu4 copies
Kellis et al. (7) GCA_000166995.1 1,0981,09811,475,8904,79588.30
Cliften et al. (8) GCA_000167035.1 58858611,862,88919,03294.23
Scannell et al. (9)NAb3631611,502,91318,17297.10
This studyCM034478 to CM034493171611,968,998836,61897.39

The contigs within a scaffold were defined by 10 consecutive Ns.

NA, not applicable. The CBS 7001 genome assembly from Scannell et al. (9) is available at http://www.saccharomycessensustricto.org/.

Statistics for de novo assemblies of CBS 7001 The contigs within a scaffold were defined by 10 consecutive Ns. NA, not applicable. The CBS 7001 genome assembly from Scannell et al. (9) is available at http://www.saccharomycessensustricto.org/. To prepare DNA for PacBio sequencing, a single colony of strain CBS 7001 (NCYC 2909) was inoculated in 7 mL yeast extract-peptone-dextrose (YPD) liquid broth and cultured for ∼24 h at 30°C. DNA was isolated using the Wizard genomic DNA purification kit (Promega), and a PacBio library was prepared using the SMRTbell Express template prep kit (Pacific Biosciences), following the >15-kb size-selection protocol that includes Covaris g-TUBE shearing. PacBio sequencing was performed using the Sequel II instrument (sequencing kit v2.1), yielding 655,409 reads (average length, 8,490 bp). HGAP4 (smrtlink-release_5.1.0.26412) (11) was used for de novo assembly, and the resulting PacBio contigs were polished five times using Pilon v1.24 (12) with 33,377,822 51-bp Illumina Genome Analyzer (GA) II reads (SRA accession number SRR173084). The polished contigs were scaffolded using RagTag v2.0.1 (13) with CBS 7001 ultrascaffolds from reference 9 that corrected the name swap for ChrX and ChrXII noted in reference 5. The assembly statistics were calculated using stats.sh from BBMap v38.90 (14) and BUSCO v5.0.0 (saccharomycetes_odb10) (15). TEs were annotated using a RepeatMasker-based pipeline described in reference 16. Genome alignment was performed using MUMmer v3.23 (17). The PacBio reads were aligned to the genome assemblies using minimap2 v2.17 with the parameters “-ax map-pb” (18). Default software parameters were used except where otherwise noted. Our CBS 7001 assembly is 11,968,998 bp long (contig N50, 836,618 bp) with an overall GC content of 40.18% (Table 1). Of the chromosomes, 15/16 are assembled in scaffolds with one contig each; ChrXII is assembled in one scaffold with one gap at the rDNA locus. In total, 97.3% of Saccharomycetes benchmarking universal single-copy orthologs (BUSCOs) are present and single copy. Our assembly was colinear with all scaffolds in the Scannell et al. (9) assembly except for one discrepancy at a sequencing gap in the previous assembly (ChrIII: 219401 to 219500). PacBio reads aligned to the previous assembly did not span this gap but did align continuously through the corresponding region in our assembly. Our assembly corrects this misassembly, the ChrX ↔ ChrXII name swap (5), and closes over 340 sequencing gaps in the previous assembly. Our assembly provides confirmation that Tsu4 is the only TE with full-length copies in the CBS 7001 genome (19–22): we found nine full-length Tsu4 copies in our assembly, tripling previous estimates (8, 23) (Table 1).

Data availability.

The assembly produced here was deposited at NCBI under accession numbers CM034478 to CM034493. The PacBio data used to generate the assembly are available under SRA accession number PRJNA753102.
  22 in total

1.  Minimap2: pairwise alignment for nucleotide sequences.

Authors:  Heng Li
Journal:  Bioinformatics       Date:  2018-09-15       Impact factor: 6.937

2.  Genomic exploration of the hemiascomycetous yeasts: 5. Saccharomyces bayanus var. uvarum.

Authors:  E Bon; C Neuvéglise; S Casaregola; F Artiguenave; P Wincker; M Aigle; P Durrens
Journal:  FEBS Lett       Date:  2000-12-22       Impact factor: 4.124

3.  Saccharomyces uvarum, a distinct group within Saccharomyces sensu stricto.

Authors:  S Rainieri; C Zambonelli; J E Hallsworth; A Pulvirenti; P Giudici
Journal:  FEMS Microbiol Lett       Date:  1999-08-01       Impact factor: 2.742

4.  Saccharomyces eubayanus and Saccharomyces uvarum associated with the fermentation of Araucaria araucana seeds in Patagonia.

Authors:  M Eugenia Rodríguez; Laura Pérez-Través; Marcela P Sangorrín; Eladio Barrio; Christian A Lopes
Journal:  FEMS Yeast Res       Date:  2014-08-04       Impact factor: 2.796

5.  Microbe domestication and the identification of the wild genetic stock of lager-brewing yeast.

Authors:  Diego Libkind; Chris Todd Hittinger; Elisabete Valério; Carla Gonçalves; Jim Dover; Mark Johnston; Paula Gonçalves; José Paulo Sampaio
Journal:  Proc Natl Acad Sci U S A       Date:  2011-08-22       Impact factor: 11.205

6.  Saccharomyces uvarum, a proper species within Saccharomyces sensu stricto.

Authors:  A Pulvirenti; H Nguyen; C Caggia; P Giudici; S Rainieri; C Zambonelli
Journal:  FEMS Microbiol Lett       Date:  2000-11-15       Impact factor: 2.742

7.  A Gondwanan imprint on global diversity and domestication of wine and cider yeast Saccharomyces uvarum.

Authors:  Pedro Almeida; Carla Gonçalves; Sara Teixeira; Diego Libkind; Martin Bontrager; Isabelle Masneuf-Pomarède; Warren Albertin; Pascal Durrens; David James Sherman; Philippe Marullo; Chris Todd Hittinger; Paula Gonçalves; José Paulo Sampaio
Journal:  Nat Commun       Date:  2014-06-02       Impact factor: 14.919

8.  Horizontal transfer and proliferation of Tsu4 in Saccharomyces paradoxus.

Authors:  Casey M Bergman
Journal:  Mob DNA       Date:  2018-06-12

9.  Transposable Element Mobilization in Interspecific Yeast Hybrids.

Authors:  Caiti Smukowski Heil; Kira Patterson; Angela Shang-Mei Hickey; Erica Alcantara; Maitreya J Dunham
Journal:  Genome Biol Evol       Date:  2021-03-01       Impact factor: 3.416

10.  Evolution of Ty1 copy number control in yeast by horizontal transfer and recombination.

Authors:  Wioletta Czaja; Douda Bensasson; Hyo Won Ahn; David J Garfinkel; Casey M Bergman
Journal:  PLoS Genet       Date:  2020-02-21       Impact factor: 5.917

View more
  1 in total

1.  Chromosome-level assemblies from diverse clades reveal limited structural and gene content variation in the genome of Candida glabrata.

Authors:  Marina Marcet-Houben; María Alvarado; Ewa Ksiezopolska; Ester Saus; Piet W J de Groot; Toni Gabaldón
Journal:  BMC Biol       Date:  2022-10-08       Impact factor: 7.364

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.