| Literature DB >> 27084056 |
Lukas Sturmberger1, Thomas Chappell2, Martina Geier1, Florian Krainer3, Kasey J Day4, Ursa Vide3, Sara Trstenjak3, Anja Schiefer1, Toby Richardson5, Leah Soriaga5, Barbara Darnhofer6, Ruth Birner-Gruenberger6, Benjamin S Glick4, Ilya Tolstorukov7, James Cregg7, Knut Madden2, Anton Glieder8.
Abstract
Strains of the species Komagataella phaffii are the most frequently used "Pichia pastoris" strains employed for recombinant protein production as well as studies on peroxisome biogenesis, autophagy and secretory pathway analyses. Genome sequencing of several different P. pastoris strains has provided the foundation for understanding these cellular functions in recent genomics, transcriptomics and proteomics experiments. This experimentation has identified mistakes, gaps and incorrectly annotated open reading frames in the previously published draft genome sequences. Here, a refined reference genome is presented, generated with genome and transcriptome sequencing data from multiple P. pastoris strains. Twelve major sequence gaps from 20 to 6000 base pairs were closed and 5111 out of 5256 putative open reading frames were manually curated and confirmed by RNA-seq and published LC-MS/MS data, including the addition of new open reading frames (ORFs) and a reduction in the number of spliced genes from 797 to 571. One chromosomal fragment of 76kbp between two previous gaps on chromosome 1 and another 134kbp fragment at the end of chromosome 4, as well as several shorter fragments needed re-orientation. In total more than 500 positions in the genome have been corrected. This reference genome is presented with new chromosomal numbering, positioning ribosomal repeats at the distal ends of the four chromosomes, and includes predicted chromosomal centromeres as well as the sequence of two linear cytoplasmic plasmids of 13.1 and 9.5kbp found in some strains of P. pastoris.Entities:
Keywords: Centromere; Genome; Killer plasmid; P. pastoris; RNA-seq; Splicing
Mesh:
Substances:
Year: 2016 PMID: 27084056 PMCID: PMC5089815 DOI: 10.1016/j.jbiotec.2016.04.023
Source DB: PubMed Journal: J Biotechnol ISSN: 0168-1656 Impact factor: 3.307