Literature DB >> 30701245

Complete Genome Sequence of Mycolicibacterium hassiacum DSM 44199.

Mercedes Sánchez1, Alba Blesa2, Eva Sacristán-Horcajada1, José Berenguer1.   

Abstract

Mycolicibacterium hassiacum is the most thermophilic of all the mycobacteria. A partial sequence based on Illumina technology of around 5 Mbp was published in 2012. Here, we report the 5,269,097-bp complete genome sequence assembled into a single circular chromosome.

Entities:  

Year:  2019        PMID: 30701245      PMCID: PMC6346194          DOI: 10.1128/MRA.01522-18

Source DB:  PubMed          Journal:  Microbiol Resour Announc        ISSN: 2576-098X


ANNOUNCEMENT

Mycolicibacterium hassiacum (1), described in 1997 as Mycobacterium hassiacum (2), is characterized by its ability to grow at temperatures up to 65°C (2). Its fast growth and cell yield and the conservation and thermostability of most genus-specific enzymes (3, 4) make this strain an attractive laboratory mycobacterial model. Also, its isolation from clinical samples supports the conservation of virulence traits of its pathogenic relatives (5). Mycolicibacterium hassiacum DSM 44199 was donated by N. Empadinhas, whose group carried out a partial sequencing of the strain (6). In that work, the sequence was divided into 169 contigs, with an estimated genome size of 5 Mbp. Here, we present the whole-genome sequence obtained by PacBio technology, with further refinement using original Illumina reads. The strain was grown at 50°C under stirring (200 rpm) in Middlebrook 7H9 broth (BD Difco) with 0.15% (vol/vol) Tween 80 (Sigma-Aldrich) and 0.2% (vol/vol) glycerol as a carbon source. The cells grown up to an optical density at 600 nm (OD600) of 0.8 were harvested and washed by centrifugation, and the total DNA was isolated by a combination of mechanical breaking with glass beads (Sigma-Aldrich) and phenol-chloroform extraction (7). Twenty-kilobase DNA fragments were selected using BluePippin equipment (Sage Science), and a library was prepared according to Pacific Biosciences protocols. The Pacific Biosciences RS II equipment with P6-C4 chemistry (polymerase attachment by binding P6 kit) was used, with a recording time of 360 min. A total of 69,152 reads, with an average length of 17,024 bp and a mean coverage of 223-fold, were obtained. The HGAP version 3 software (SMRT Analysis software version 2.3.0; Pacific Biosciences) was used with its default settings for the assembly. This produced a single linear contig of 5,282,087 bp. Its circularization was carried out using the Minimus2 software and further corrected with the RS_Resequencing.1 software (SMRT Analysis version 2.3.0). A single circular contig of 5,268,611 bp was obtained after this procedure. The PROKKA software version 1.12 (8) was employed on this circular chromosome for automatic coding sequence (CDS) annotation, with parameters set for gene code 11 and using Mycobacterium genus-specific databases for assignment of putative function. A manual inspection of the annotation obtained revealed that many conserved proteins within the Mycobacterium genus appeared to be truncated due to the inclusion of an additional base in some homopolymers of the assembled genome, including proteins essential for viability, such as the β′-subunit of the RNA polymerase, the chromosome partition protein Smc, the FtsK ATPase involved in DNA segregation, and the RecO protein involved in DNA repair. Due to this, the PacBio-based sequence was further refined by using the Illumina reads available online (6) using the PacBio-utilities and Pilon (version 1.22) programs (9), configured to fix only the indels detected. This allowed us to correct 483 positions in the sequence assembled, corresponding to homopolymers for which PacBio reads included an additional base. Once corrected, the initially truncated essential genes were translated to proteins of the size and sequence homology expected for their homologues. The final genome of 5,269,097 bp encodes 5,333 putative proteins, 53 tRNAs, and 2 rRNA clusters. The mean GC content was 69.29% (34.55% Cs and 34.74% Gs), with a coding density of 91.5%. The KEGG Automatic Annotation Server and its internal BlastKOALA annotation tool (10) led to the identification of 236 pathways, of which 61 contained more than 10 genes, including 20 genes putatively involved in the metabolism of steroids (KEGG steroid degradation pathway 00984). As this bacterium metabolizes different kinds of sterols from the environment, the identification of enzymes of this pathway constitutes a first step toward the isolation of specific mutants for relevant biotechnological bioconversions involving these molecules.

Data availability.

The raw reads and the annotated assembly of the whole genome of Mycolicibacterium hassiacum have been deposited in NCBI under the accession number GCA_900603025.
  8 in total

Review 1.  Genetic systems for mycobacteria.

Authors:  W R Jacobs; G V Kalpana; J D Cirillo; L Pascopella; S B Snapper; R A Udani; W Jones; R G Barletta; B R Bloom
Journal:  Methods Enzymol       Date:  1991       Impact factor: 1.600

2.  Prokka: rapid prokaryotic genome annotation.

Authors:  Torsten Seemann
Journal:  Bioinformatics       Date:  2014-03-18       Impact factor: 6.937

3.  Characterization of an isolate belonging to the newly described species Mycobacterium hassiacum.

Authors:  E Tortoli; U Reischl; G Besozzi; S Emler
Journal:  Diagn Microbiol Infect Dis       Date:  1998-03       Impact factor: 2.803

4.  Phylogenomics and Comparative Genomic Studies Robustly Support Division of the Genus Mycobacterium into an Emended Genus Mycobacterium and Four Novel Genera.

Authors:  Radhey S Gupta; Brian Lo; Jeen Son
Journal:  Front Microbiol       Date:  2018-02-13       Impact factor: 5.640

5.  Mycobacterium hassiacum sp. nov., a new rapidly growing thermophilic mycobacterium.

Authors:  K H Schröder; L Naumann; R M Kroppenstedt; U Reischl
Journal:  Int J Syst Bacteriol       Date:  1997-01

6.  Genome sequence of Mycobacterium hassiacum DSM 44199, a rare source of heat-stable mycobacterial proteins.

Authors:  Igor Tiago; Ana Maranha; Vitor Mendes; Susana Alarico; Patrick J Moynihan; Anthony J Clarke; Sandra Macedo-Ribeiro; Pedro J B Pereira; Nuno Empadinhas
Journal:  J Bacteriol       Date:  2012-12       Impact factor: 3.490

Review 7.  BlastKOALA and GhostKOALA: KEGG Tools for Functional Characterization of Genome and Metagenome Sequences.

Authors:  Minoru Kanehisa; Yoko Sato; Kanae Morishima
Journal:  J Mol Biol       Date:  2015-11-14       Impact factor: 5.469

8.  Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement.

Authors:  Bruce J Walker; Thomas Abeel; Terrance Shea; Margaret Priest; Amr Abouelliel; Sharadha Sakthikumar; Christina A Cuomo; Qiandong Zeng; Jennifer Wortman; Sarah K Young; Ashlee M Earl
Journal:  PLoS One       Date:  2014-11-19       Impact factor: 3.240

  8 in total
  2 in total

Review 1.  Nitrate Respiration in Thermus thermophilus NAR1: from Horizontal Gene Transfer to Internal Evolution.

Authors:  Mercedes Sánchez-Costa; Alba Blesa; José Berenguer
Journal:  Genes (Basel)       Date:  2020-11-04       Impact factor: 4.096

2.  ARAMIS: From systematic errors of NGS long reads to accurate assemblies.

Authors:  E Sacristán-Horcajada; S González-de la Fuente; R Peiró-Pastor; F Carrasco-Ramiro; R Amils; J M Requena; J Berenguer; B Aguado
Journal:  Brief Bioinform       Date:  2021-11-05       Impact factor: 11.622

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.