Literature DB >> 28074122

High quality draft genome of Nakamurella lactea type strain, a rock actinobacterium, and emended description of Nakamurella lactea.

Imen Nouioui1, Markus Göker2, Lorena Carro1, Maria Del Carmen Montero-Calasanz1, Manfred Rohde3, Tanja Woyke4, Nikos C Kyrpides5, Hans-Peter Klenk1.   

Abstract

Nakamurella lactea DLS-10T, isolated from rock in Korea, is one of the four type strains of the genus Nakamurella. In this study, we describe the high quality draft genome of N. lactea DLS-10T and its annotation. A summary of phenotypic data collected from previously published studies was also included. The genome of strain DLS-10T presents a size of 5.82 Mpb, 5100 protein coding genes, and a C + G content of 68.9%. Based on the genome analysis, emended description of N. lactea in terms of G + C content was also proposed.

Entities:  

Keywords:  Bioactive natural product; Frankineae; Nakamurellaceae; Next generation sequencing; Rare actinobacteria

Year:  2017        PMID: 28074122      PMCID: PMC5217420          DOI: 10.1186/s40793-016-0216-0

Source DB:  PubMed          Journal:  Stand Genomic Sci        ISSN: 1944-3277


Introduction

The genus , belong to the order Nakamurellales [1] and is one of the rare genera in the class [2]. The genus is the sole and type genus of the family , which replaced the family [2] in 2004 [3]. The genus and family names were assigned in honour of the microbiologist Kazonuri Nakamura [4]. Only four species with validly published names, [3, 5], [6, 7], [6-8], and [6, 7, 9], have been described, and only the genome of has been published [10]. was originally described as [9], which was the type species of one of the three genera comprising in the family . Then, in the light of the 16S rRNA gene and rpoB gene sequences similarities and chemotaxonomic features [6], the species was reclassified into the genus . is represented by the type strain DLS-10T (= DSM 19367 T = JCM 16024 = KCTC 19285). The availability of the genome of one more species in the genus will provide vital baseline information for better understanding of the ecology of these rare actinobacteria and their potential as source of bioactive natural products. In the present study, we summarise the phenotypic, physiological and chemotaxonomic, features of DLS-10T together with the genomic data.

Organism information

Classification and features

DLS-10T was isolated from a rock collected on the parasitic volcano Darangshi Oreum at 300 m above sea level in Jeju island, Republic of Korea (latitude 33.51, longitude 126.52) [9]. It has been shown by Lee et al. [9] and Kim et al. [4, 6] that its cells are aerobic, non-motile, non-spore and non-mycelium forming short rods with 0.4–0.7 μm and 0.9–1.0 μm of cell diameter and length, respectively (Fig. 1), producing cream-coloured colonies on TSA medium. A summary of the classification and general features of strain DLS-10T is presented in the Table 1. Additional phenotypic features can be found in Lee et al. and Kim et al. [6, 9].
Fig. 1

Scanning electron micrograph of N. lactea DLS-10T. The bacterium was grown on DSM medium 65 for 3 days at 28∘C

Table 1

Classification and general features of Nakamurella lactea strain DLS-10T, according to the MIGS recommendations [36] as developed by [22]

MIGS IDPropertyTermEvidence codea
ClassificationDomain Bacteria TAS [39]
Phylum Actinobacteria TAS [40]
Class Actinobacteria TAS [2]
Order Nakamurellales TAS [1]
Family Nakamurellaceae TAS [41]
Genus Nakamurella TAS [3, 41]
Species Nakamurella lactea Type strain DLS-10TAS [6, 9]
Gram stainPositiveTAS [6, 9]
Cell shapeRodTAS [6, 9]
Motilitynon-motileTAS [6, 9]
SporulationNon-sporulatingNAS [6, 9]
Temperature range4–37 °CTAS [6, 9]
Optimum temperature25 °CTAS [6, 9]
pH range5.1–9.1TAS [6, 9]
pH Optimum6.0–7.0
Carbon sourceL-Arabinose, myo-inositol and methyl α-D-mannoside, D-cellobiose, D-fructose, D-glucose, D-galactose, lactose, D-maltose, D-mannitol, D-mannose, L-rhamnose, salicin, sucrose and D-trehalose, D- turanose TAS [6, 9]
MIGS-6HabitatRockTAS [9]
MIGS-6.3SalinityUp to 3% NaClTAS [6, 9]
MIGS-22Oxygen requirementAerobicTAS [9]
MIGS-15Biotic relationshipfree-livingTAS [9]
MIGS-14Pathogenicitynon-pathogenNAS
MIGS-4Geographic locationKoreaTAS [9]
MIGS-5Sample collectionNot reportedTAS []
MIGS-4.1Latitude33.51TAS [9]
MIGS-4.2Longitude126.52TAS [9]
MIGS-4.4Altitude300 mTAS [9]

aEvidence codes are from of the Gene Ontology project [42]. TAS traceable author statement (i.e., a direct report exists in the literature)

Scanning electron micrograph of N. lactea DLS-10T. The bacterium was grown on DSM medium 65 for 3 days at 28∘C Classification and general features of Nakamurella lactea strain DLS-10T, according to the MIGS recommendations [36] as developed by [22] aEvidence codes are from of the Gene Ontology project [42]. TAS traceable author statement (i.e., a direct report exists in the literature) Only four species isolated from soil ( and ), rock () and sludge (N. mutipartita), respectively, are currently classified in the genus. Due to this limited number of the characterised species, the ecological diversity as well as the biotechnological potential of the members of the genus remain to be studied in depth. Phylogenies based on 16S rRNA gene sequences included in this manuscript were performed using the GGDC web server [11] implementation of the DSMZ phylogenomics pipeline [12]. The multiple alignment was created with MUSCLE [13] and maximum likelihood (ML) and maximum parsimony (MP) trees were inferred from it with RAxML [14] and TNT [15], respectively. For ML, rapid bootstrapping in conjunction with the autoMRE bootstopping criterion [16] and subsequent search for the best tree was used; for MP, 1000 bootstrapping replicates were used in conjunction with tree-bisection-and-reconnection branch swapping and ten random sequence addition replicates. This analysis shows the family [4] as the sister group of the families , , and . The monophyly of the genus was supported by (close to) maximum bootstrap values under ML and MP (Fig. 2).
Fig. 2

Maximum likelihood phylogenetic tree of N. lactea DLS-10T and related type strains within the related families constructed under the GTR + GAMMA model and rooted using Actinomyces bovis NCTC 11535T as outgroup. The branches are scaled in terms of the expected number of substitutions per site (see size bar). Support values from maximum-likelihood (left) and maximum-parsimony (right) bootstrapping are shown above the branches if equal to or larger than 60%

Maximum likelihood phylogenetic tree of N. lactea DLS-10T and related type strains within the related families constructed under the GTR + GAMMA model and rooted using Actinomyces bovis NCTC 11535T as outgroup. The branches are scaled in terms of the expected number of substitutions per site (see size bar). Support values from maximum-likelihood (left) and maximum-parsimony (right) bootstrapping are shown above the branches if equal to or larger than 60%

Chemotaxonomic data (optional, Heading 3)

Glucose, mannose, ribose and rhamnose were detected as the whole-cell sugars [5]. The pattern of polar lipid contains diphosphatidylglycerol, phosphatidylethanolamine, phosphatidylinositol, aminophospholipid, five unidentified phosphoglycolipids, and one unidentified glycolipid [6]. The diagnostic peptidoglican is the meso-diaminopimelic acid. The major fatty acids are anteiso-C15:0, C16:0, iso-C16:0, and anteiso-C17:0 [9]. MK-8(H4) and MK-9(H4) are the predominant menaquinones but MK-7(H4) was also revealed in a low amount [6].

Genome sequencing information

Genome project history

DLS-10T (DSM 19367 T) was selected for sequencing on the basis of its phylogenetic position [17, 18], and is part of Genomic Encyclopedia of Type Strains, Phase I: the one thousand microbial genomes project [19], a follow-up of the Genomic Encyclopedia of Bacteria and Archaea pilot project [20], which aims at increasing the sequencing coverage of key reference microbial genomes and to generate a large genomic basis for the discovery of genes encoding novel enzymes [21]. KMG-I is the first of the production phases of the “Genomic Encyclopedia of Bacteria and Archaea: sequencing a myriad of type strains” initiative [22] and a Genomic Standards Consortium project [23]. The project and the genome sequence are deposited in the Genome OnLine Database [24] and Genbank under the accession number AUFT00000000.1. In Table 2, we summarize genome sequence project.
Table 2

Project information

MIGS IDPropertyTerm
MIGS 31Finishing qualityLevel 1: Standard Draft
MIGS-28Libraries usedNOHX
MIGS 29Sequencing platformsIllumina, Illumina HiSeq 2000
MIGS 31.2Fold coverageNA
MIGS 30AssemblersAllpaths/Velvet
MIGS 32Gene calling methodProdigal 2.5
Locus TagK340
Genbank IDAUFT00000000.1
GenBank Date of Release2013-06-03
GOLD IDGi11889
BIOPROJECTPRJNA195807
MIGS 13Source Material IdentifierDSM 19367T
Project relevanceGEBA-KMG, Tree of Life
Project information

Growth conditions and genomic DNA preparation

A DLS-10T culture was prepared in DSM medium 65 [25] at 28 °C. Genomic DNA was extracted using MasterPure™ Gram Positive DNA Purification Kit (Epicentre MGP04100) following the standard protocol provided by the manufacturer but modified by the incubation on ice overnight on a shaker, the use of additional 1 μl proteinase K, and the addition of 7.5 units achromopeptidase, 7.5 μg/μl lysostaphine, 1050.0 units lysozyme, and 7.5 units mutanolysine. DNA is available from DSMZ through the DNA Bank Network [26].

Genome sequencing and assembly

The draft genome of DLS-10T was generated at the DOE Joint genome Institute (JGI) using the Illumina technology [27]. An Illumina standard shotgun library was constructed and sequenced using the Illumina HiSeq 2000 platform, which generated 13,910,936 reads totalling 2,086.6 Mb. All general aspects of library construction and sequencing performed at the JGI can be found at http://www.jgi.doe.gov. All raw Illumina sequence data was passed through DUK, a filtering program developed at JGI, which removes known Illumina sequencing and library preparation artefacts (unpublished results). Following steps were then performed for assembly: (1) filtered Illumina reads were assembled using Velvet (version 1.1.04) [28], (2) 1–3 kb simulated paired end reads were created from Velvet contigs using wgsim (https://github.com/lh3/wgsim), (3) Illumina reads were assembled with simulated read pairs using Allpaths–LG (version r42328) [29]. Parameters for assembly steps were: 1) Velvet (velveth:63 –shortPaired and velvetg: −very clean yes –exportFiltered yes –min contig lgth 500 –scaffolding no–cov cutoff 10) 2) wgsim (−e 0 –1 100 –2 100 –r 0 –R 0 –X 0) 3) Allpaths–LG (PrepareAllpathsInputs:PHRED 64 = 1 PLOIDY = 1 FRAG COVERAGE = 125 JUMP COVERAGE = 25 LONG JUMP COV = 50, RunAllpathsLG: THREADS = 8 RUN = std shredpairs TARGETS = standard VAPI WARN ONLY = True OVERWRITE = True). The final draft assembly contained 31 contigs in 27 scaffolds. The total size of the genome is 5.8 Mb and the final assembly is based on 712.8 Mb of Illumina data, which provides an average 122.5X coverage of the genome.

Genome annotation

The complete genome sequence was annotated using the JGI Prokaryotic Automatic Annotation Pipeline [30] with additional manual review using the Integrated Microbial Genomes - Expert Review (IMG-ER) platform [31]. The predicted CDSs were translated and used to search the National Center for Biotechnology Information (NCBI) non redundant database, UniProt, TIGRFam, Pfam, KEGG, COG, and InterPro databases. The tRNAScanSE tool [32] was used to find tRNA genes, whereas ribosomal RNA genes were found by searches against models of the ribosomal RNA genes built from SILVA [33]. Other non–coding RNAs such as the RNA components of the protein secretion complex and the RNase P were identified by searching the genome for the corresponding Rfam profiles using INFERNAL [34]. Additional gene prediction analysis and manual functional annotation was performed within the Integrated Microbial Genomes (IMG) platform [35, 36] developed by the Joint Genome Institute, Walnut Creek, CA, USA [37].

Genome properties

The 5820860 bp of genome size of DLS-10T presents 5100 protein-coding genes, 3 rRNA genes (5S, 16S, 23S RNA) and 59 tRNA genes. A G + C content of 68.9% was calculated. More genome details are listed in Tables 3 and 4.
Table 3

Genome statistics

AttributeValue% of Total
Genome size (bp)5820860100.00
DNA coding (bp)533224591.61
DNA G + C (bp)401179068.92
DNA scaffolds27100.00
Total genes5169100.00
Protein coding genes510098.67
RNA genes691.33
Pseudo genes231
Genes in internal clusters58811.38
Genes with function prediction404878.31
Genes assigned to COGs332164.25
Genes with Pfam domains421181.47
Genes with signal peptides4328.36
Genes with transmembrane helices120623.33
CRISPR repeats1
Table 4

Number of genes associated with general COG functional categories

CodeValue%Description
J1985.07Translation, ribosomal structure and biogenesis
A10.03RNA processing and modification
K39210.04Transcription
L1223.12Replication, recombination and repair
B10.03Chromatin structure and dynamics
D250.64Cell cycle control, Cell division, chromosome partitioning
V942.41Defence mechanisms
T1373.51Signal transduction mechanisms
M1443.69Cell wall/membrane biogenesis
N100.26Cell motility
U230.59Intracellular trafficking and secretion
O1213.1Posttranslational modification, protein turnover, chaperones
C2105.38Energy production and conversion
G64812.31Carbohydrate transport and metabolism
E45911.75Amino acid transport and metabolism
F912.33Nucleotide transport and metabolism
H2195.61Coenzyme transport and metabolism
I2556.53Lipid transport and metabolism
P2446.25Inorganic ion transport and metabolism
Q1543.94Secondary metabolites biosynthesis, transport and catabolism
R44311.34General function prediction only
S15844.05Function unknown
-184835.75Not in COGs
Genome statistics Number of genes associated with general COG functional categories

Conclusion

The genome of will be used to study, for the first time, its potential as bioactive natural products source and the correlation between the rare soil bacteria and their habitat. According to [38], the within-species deviation in genomic G + C content is at most 1%. The range of 70.4–74.3% given in by Kim et al. [6] is thus too broad and too deviating from the 68.9% calculated in the genome sequence, much like the value 74.3% provided by Lee et al. [9]. This calls for an emendation of the species description [38].

Emended description of (Lee et al. [9]) Kim et al. [6]

The properties are as given in the species description by Kim et al. [6] with the following emendation. Based on the genomic data the G + C content is 68.9%.
  33 in total

1.  Humicoccus flavidus gen. nov., sp. nov., isolated from soil.

Authors:  Jung-Hoon Yoon; So-Jung Kang; Seo-Youn Jung; Tae-Kwang Oh
Journal:  Int J Syst Evol Microbiol       Date:  2007-01       Impact factor: 2.747

2.  Velvet: algorithms for de novo short read assembly using de Bruijn graphs.

Authors:  Daniel R Zerbino; Ewan Birney
Journal:  Genome Res       Date:  2008-03-18       Impact factor: 9.043

3.  Saxeibacter lacteus gen. nov., sp. nov., an actinobacterium isolated from rock.

Authors:  Soon Dong Lee; Sang Kyu Park; Yoe-Won Yun; Dong Wan Lee
Journal:  Int J Syst Evol Microbiol       Date:  2008-04       Impact factor: 2.747

4.  Phylogeny of the class Actinobacteria revisited in the light of complete genomes. The orders 'Frankiales' and Micrococcales should be split into coherent entities: proposal of Frankiales ord. nov., Geodermatophilales ord. nov., Acidothermales ord. nov. and Nakamurellales ord. nov.

Authors:  Arnab Sen; Vincent Daubin; Danis Abrouk; Isaac Gifford; Alison M Berry; Philippe Normand
Journal:  Int J Syst Evol Microbiol       Date:  2014-08-28       Impact factor: 2.747

5.  The Genomes OnLine Database (GOLD) v.5: a metadata management system based on a four level (meta)genome project classification.

Authors:  T B K Reddy; Alex D Thomas; Dimitri Stamatis; Jon Bertsch; Michelle Isbandi; Jakob Jansson; Jyothi Mallajosyula; Ioanna Pagani; Elizabeth A Lobos; Nikos C Kyrpides
Journal:  Nucleic Acids Res       Date:  2014-10-27       Impact factor: 16.971

6.  Nakamurella panacisegetis sp. nov. and proposal for reclassification of Humicoccus flavidus Yoon et al., 2007 and Saxeibacter lacteus Lee et al., 2008 as Nakamurella flavida comb. nov. and Nakamurella lactea comb. nov.

Authors:  Kwang Kyu Kim; Keun Chul Lee; Jung-Sook Lee
Journal:  Syst Appl Microbiol       Date:  2012-06-15       Impact factor: 4.022

7.  Proposal of Nakamurella gen. nov. as a substitute for the bacterial genus Microsphaera Yoshimi et al. 1996 and Nakamurellaceae fam. nov. as a substitute for the illegitimate bacterial family Microsphaeraceae Rainey et al. 1997.

Authors:  Tian-Shen Tao; Ying-Yu Yue; Wen-Xin Chen; Wen-Feng Chen
Journal:  Int J Syst Evol Microbiol       Date:  2004-05       Impact factor: 2.747

8.  The Genomic Standards Consortium.

Authors:  Dawn Field; Linda Amaral-Zettler; Guy Cochrane; James R Cole; Peter Dawyndt; George M Garrity; Jack Gilbert; Frank Oliver Glöckner; Lynette Hirschman; Ilene Karsch-Mizrachi; Hans-Peter Klenk; Rob Knight; Renzo Kottmann; Nikos Kyrpides; Folker Meyer; Inigo San Gil; Susanna-Assunta Sansone; Lynn M Schriml; Peter Sterk; Tatiana Tatusova; David W Ussery; Owen White; John Wooley
Journal:  PLoS Biol       Date:  2011-06-21       Impact factor: 8.029

9.  Genomic Encyclopedia of Type Strains, Phase I: The one thousand microbial genomes (KMG-I) project.

Authors:  Nikos C Kyrpides; Tanja Woyke; Jonathan A Eisen; George Garrity; Timothy G Lilburn; Brian J Beck; William B Whitman; Phil Hugenholtz; Hans-Peter Klenk
Journal:  Stand Genomic Sci       Date:  2013-12-17

10.  Complete genome sequence of DSM 30083(T), the type strain (U5/41(T)) of Escherichia coli, and a proposal for delineating subspecies in microbial taxonomy.

Authors:  Jan P Meier-Kolthoff; Richard L Hahnke; Jörn Petersen; Carmen Scheuner; Victoria Michael; Anne Fiebig; Christine Rohde; Manfred Rohde; Berthold Fartmann; Lynne A Goodwin; Olga Chertkov; Tbk Reddy; Amrita Pati; Natalia N Ivanova; Victor Markowitz; Nikos C Kyrpides; Tanja Woyke; Markus Göker; Hans-Peter Klenk
Journal:  Stand Genomic Sci       Date:  2014-12-08
View more
  3 in total

1.  Sugarcane cultivars manipulate rhizosphere bacterial communities' structure and composition of agriculturally important keystone taxa.

Authors:  Muhammad Tayyab; Waqar Islam; Ali Noman; Ziqin Pang; Shiyan Li; Sheng Lin; Lin Wenxiong; Zhang Hua
Journal:  3 Biotech       Date:  2022-01-03       Impact factor: 2.406

2.  Genome-Scale Data Call for a Taxonomic Rearrangement of Geodermatophilaceae.

Authors:  Maria Del Carmen Montero-Calasanz; Jan P Meier-Kolthoff; Dao-Feng Zhang; Adnan Yaramis; Manfred Rohde; Tanja Woyke; Nikos C Kyrpides; Peter Schumann; Wen-Jun Li; Markus Göker
Journal:  Front Microbiol       Date:  2017-12-19       Impact factor: 5.640

Review 3.  The Ecology of Subaerial Biofilms in Dry and Inhospitable Terrestrial Environments.

Authors:  Federica Villa; Francesca Cappitelli
Journal:  Microorganisms       Date:  2019-09-23
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.