Literature DB >> 26021914

Draft Genome Sequence of the Archiascomycetous Yeast Saitoella complicata.

Kenta Yamauchi1, Shinji Kondo2, Makiko Hamamoto3, Yurika Takahashi1, Yoshitoshi Ogura4, Tetsuya Hayashi4, Hiromi Nishida5.   

Abstract

The draft genome sequence of the archiasomycetous yeast Saitoella complicata was determined. The assembly of newly and previously sequenced data sets resulted in 104 contigs (total of 14.1 Mbp; N 50, 239 kbp). On the newly assembled genome, a total of 6,933 protein-coding sequences (7,119 transcripts, including alternative splicing forms) were identified.
Copyright © 2015 Yamauchi et al.

Entities:  

Year:  2015        PMID: 26021914      PMCID: PMC4447899          DOI: 10.1128/genomeA.00220-15

Source DB:  PubMed          Journal:  Genome Announc


GENOME ANNOUNCEMENT

The subphylum Taphrinomycotina (Archiascomycetes) is the earliest ascomycetous lineage that diverged before the separation of the subphyla Pezizomycotina (Euascomycetes, filamentous ascomycetes) and Saccharomycotina (Hemiascomycetes, budding ascomycetous yeasts) (1, 2). The anamorphic and saprobic budding yeast Saitoella complicata is a member of the Taphrinomycotina, which was isolated from Himalayan soil (3). Interestingly, S. complicata shares some characteristics with both ascomycetous and basidiomycetous yeasts (3, 4). We previously attempted to assemble the genome sequence of S. complicata using 454 (Roche) sequences (5) and Illumina paired-end read pairs (6). Although these previous assemblies were of a large number of small contigs, at 7,981 contigs (13.0 Mbp) (5) and 1,800 contigs (14.2 Mbp) (6), respectively, we found that the amino acid sequences of protein-coding genes identified on the contigs showed the highest similarity to proteins of Pezizomycotina (5, 6). To elucidate the detailed characteristics of the S. complicata genomic DNA sequences, we have refined the genome assembly with additional sequencing of mate-paired DNA libraries of this species. We generated a total of 11.4 million paired-end read pairs (700-bp insert and 100 bp in length) and a total of 23.7 million mate-paired read pairs (6.2 million 3-kb-, 6.2 million 5-kb-, 5.3 million 10-kb-, and 6.0 million 15-kb-long-insert read pairs), respectively, using Illumina HiSeq and MiSeq sequencers. The read pairs were dereplicated by Fulcrum (7) and assembled using the SPAdes assembler (8). The assembly of the dereplicated read pairs by using 21 to 89 bp for the k-mer size option yielded a set of 104 contigs of ≥1 kb, whose total size and N50 are 14.1 Mb and 239 kb, respectively. Using Augustus (9), a gene prediction software based on the alignment of expressed sequences to the genome, we have determined coding sequences (CDSs) of the genes expressed on the assembled genome of Saitoella according to the gene model of Aspergillus nidulans, which is thought to have some taxonomic proximity to Saitoella. Based on the exon coordinates mapped by a total of 89.3 million RNA sequencing (RNA-seq) paired-end read pairs (100 bp in length) uniquely mapped to the genome by BLAT (10), Augustus identified 6,933 protein-coding genes (7,119 transcripts, including alternative splicing forms) on the Saitoella genome. All this computational work was done on the NIG Supercomputer system (11).

Nucleotide sequence accession numbers.

The DNA sequences have been deposited in DDBJ under the accession numbers BACD03000001 to BACD03000104.
  8 in total

1.  BLAT--the BLAST-like alignment tool.

Authors:  W James Kent
Journal:  Genome Res       Date:  2002-04       Impact factor: 9.043

2.  SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing.

Authors:  Anton Bankevich; Sergey Nurk; Dmitry Antipov; Alexey A Gurevich; Mikhail Dvorkin; Alexander S Kulikov; Valery M Lesin; Sergey I Nikolenko; Son Pham; Andrey D Prjibelski; Alexey V Pyshkin; Alexander V Sirotkin; Nikolay Vyahhi; Glenn Tesler; Max A Alekseyev; Pavel A Pevzner
Journal:  J Comput Biol       Date:  2012-04-16       Impact factor: 1.479

3.  Fulcrum: condensing redundant reads from high-throughput sequencing studies.

Authors:  Matthew S Burriesci; Erik M Lehnert; John R Pringle
Journal:  Bioinformatics       Date:  2012-03-13       Impact factor: 6.937

4.  Using native and syntenically mapped cDNA alignments to improve de novo gene finding.

Authors:  Mario Stanke; Mark Diekhans; Robert Baertsch; David Haussler
Journal:  Bioinformatics       Date:  2008-01-24       Impact factor: 6.937

5.  Draft genome sequencing of the enigmatic yeast Saitoella complicata.

Authors:  Hiromi Nishida; Makiko Hamamoto; Junta Sugiyama
Journal:  J Gen Appl Microbiol       Date:  2011       Impact factor: 1.452

6.  The early diverging ascomycetous budding yeast Saitoella complicata has three histone deacetylases belonging to the Clr6, Hos2, and Rpd3 lineages.

Authors:  Hiromi Nishida; Takashi Matsumoto; Shinji Kondo; Makiko Hamamoto; Hirofumi Yoshikawa
Journal:  J Gen Appl Microbiol       Date:  2014       Impact factor: 1.452

7.  Phylogenomic analyses support the monophyly of Taphrinomycotina, including Schizosaccharomyces fission yeasts.

Authors:  Yu Liu; Jessica W Leigh; Henner Brinkmann; Melanie T Cushion; Naiara Rodriguez-Ezpeleta; Hervé Philippe; B Franz Lang
Journal:  Mol Biol Evol       Date:  2008-10-14       Impact factor: 16.240

8.  DDBJ new system and service refactoring.

Authors:  Osamu Ogasawara; Jun Mashima; Yuichi Kodama; Eli Kaminuma; Yasukazu Nakamura; Kousaku Okubo; Toshihisa Takagi
Journal:  Nucleic Acids Res       Date:  2012-11-24       Impact factor: 16.971

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.