Literature DB >> 26918090

Draft genome sequence of Sphingomonas paucimobilis strain LCT-SP1 isolated from the Shenzhou X spacecraft of China.

Lei Pan1, Hong Zhou2, Jia Li2, Bing Huang2, Jun Guo2, Xue-Lin Zhang2, Long-Cheng Gao3, Chou Xu2, Chang-Ting Liu2.   

Abstract

Sphingomonas paucimobilis strain LCT-SP1 is a glucose-nonfermenting Gram-negative, chemoheterotrophic, strictly aerobic bacterium. The major feature of strain LCT-SP1, isolated from the Chinese spacecraft Shenzhou X, together with the genome draft and annotation are described in this paper. The total size of strain LCT-SP1 is 4,302,226 bp with 3,864 protein-coding and 50 RNA genes. The information gained from its sequence is potentially relevant to the elucidation of microbially mediated corrosion of various materials.

Entities:  

Keywords:  Sphingomonas paucimobilis; corrosion; genome sequence

Year:  2016        PMID: 26918090      PMCID: PMC4766693          DOI: 10.1186/s40793-016-0136-z

Source DB:  PubMed          Journal:  Stand Genomic Sci        ISSN: 1944-3277


Introduction

strain LCT-SP1 is a glucose-nonfermenting Gram-negative, chemoheterotrophic, strictly aerobic bacterium [1]. LCT-SP1, based on 16S rRNA gene sequences, is most closely related to , which is isolated from water and soil. Several studies suggest that can degrade many compounds or materials, such as ferulic acid [2], lignin [3], and biphenyl [4]. LCT-SP1 was isolated from the condensate water in the Chinese spacecraft Shenzhou X. LCT-SP1 can corrode numerous materials including epoxy resin, ester polyurethane, and ethers polyurethane. Therefore, the strain may be a suitable model for examining the properties of genes involved in microbial corrosion of materials used in aerospace applications. This study mainly aims to describe the draft genome of strain LCT-SP1 together with the genomic sequencing and annotation, which may be helpful in investigating the possible mechanisms in the microbial corrosion of materials.

Organism information

Classification and features

A phylogenetic tree was constructed with MEGA 5 [5] along with the sequences of representative members of the genus using the maximum likelihood method based on 16S rRNA gene phylogeny (Fig. 1). Figure 1 shows that LCT-SP1 is most closely related to sp.DSM 30198 (HF558376), G1Bc9 (KF465966), SKJH-30 (AY749436), and G3Cc10 (KF465968), with a sequence similarity of 100 % based on BLAST analysis. In addition, considering that the ANI is an important index in terms of phylogenetic analysis [6], the ANIs between LCT-SP1 and NBRC 13935 were also calculated. The ANI result was 99.68 %, which is greater than 95 % (the species ANI cutoff value). Therefore, LCT-SP1 is asssumed to belongs to the species of .
Fig. 1

Phylogenetic tree highlighting the position of the Sphingomonas paucimobilis strain LCT-SP1 relative to selected Sphingomonas species using the Rhizobium leguminosarum ATCC 14480 as the outgroup. The strains and their corresponding GenBank accession numbers of 16S rRNA genes are indicated. Bar: 0.01 substitutions per nucleotide position

Phylogenetic tree highlighting the position of the Sphingomonas paucimobilis strain LCT-SP1 relative to selected Sphingomonas species using the Rhizobium leguminosarum ATCC 14480 as the outgroup. The strains and their corresponding GenBank accession numbers of 16S rRNA genes are indicated. Bar: 0.01 substitutions per nucleotide position The general information of LCT-SP1 is shown in Table 1. LCT-SP1 is an aerobic, Gram-negative, rod-shaped, glucose-nonfermenting, slowly motile, and non-sporulating bacterium (Fig. 2-b). The strain grew optimally in the following conditions: pH 7.2, 35 °C, and at low salinity (NaCl range 0–1.0 %). On aerobic LB agar, LCT-SP1 formed several small, yellow-pigmented, round colonies (Fig. 2-a). LCT-SP1 was able to use a range of carbon substrates including D-glucose, maltose, lactose, sucrose, fucose, malic acid, acetic acid, and Tween-40.
Table 1

Classification and general features of Sphingomonas paucimobilis strain LCT-SP1 according to the MIGS recommendations [22]

MIGS IDPropertyTermEvidence codea
ClassificationDomain Bacteria TAS [23]
Phylum Proteobacteria TAS [24]
Class Alphaproteobacteria TAS [25, 26]
Order Sphingomonadales TAS [25, 26]
Family Sphingomonadaceae TAS [27, 28]
Genus Sphingomonas TAS [27, 28]
Species Sphingomonas paucimobilis TAS [1, 29]
(Type) strain: LCT-SP1IDA
Gram stainNegativeTAS [1]
Cell shapeRod-shapedTAS [1]
MotilitySlow motilityTAS [1]
SporulationNon-sporulatingTAS [1]
Temperature range30-38 °CNAS
Optimum temperature35 °CNAS
pH range; Optimum6.0-7.5; 7.2IDA
Carbon sourceD-glucose, maltose, lactose, sucrose, fucose, malic acid, acetic acid, Tween-40IDA
MIGS-6HabitatSpace cabin surfaceIDA
MIGS-6.3Salinity0-1.0 % NaCl (w/v)IDA
MIGS-22Oxygen requirementAerobicTAS [1]
MIGS-15Biotic relationshipFree-livingNAS
MIGS-14PathogenicityOpportunistic pathogenTAS [1, 30, 31]
MIGS-4Geographic locationInner Mongolia, ChinaIDA
MIGS-5Sample collectionJune 5, 2013NAS
MIGS-4.1LatitudeNot recorded
MIGS-4.2LongitudeNot recorded
MIGS-4.4AltitudeNot recorded

Evidence codes -IDA Inferred from Direct Assay, TAS Traceable Author Statement (i.e., a direct report exists in the literature), NAS Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [32]

Fig. 2

Images of the Sphingomonas paucimobilis strain LCT-SP1: (a) colonies of the strains on Luria Bertani agar plates, and (b) scanning electron micrographs of the strain

Classification and general features of Sphingomonas paucimobilis strain LCT-SP1 according to the MIGS recommendations [22] Evidence codes -IDA Inferred from Direct Assay, TAS Traceable Author Statement (i.e., a direct report exists in the literature), NAS Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [32] Images of the Sphingomonas paucimobilis strain LCT-SP1: (a) colonies of the strains on Luria Bertani agar plates, and (b) scanning electron micrographs of the strain

Genome sequencing information

Genome project history

A summary of the main project information of the strain LCT-SP1 is shown in Table 2. This organism was isolated from the condensate water in the Shenzhou X spacecraft, and was selected for sequencing for its phylogenetic affiliation with a lineage of . The genome sequences of this organism were deposited in GenBank under accession number KR080483, which belongs to the 16s ribosomal RNA coding gene sequence of LCT-SP1.
Table 2

Project information

MIGS IDPropertyTerm
MIGS-31Finishing qualityImproved high quality draft
MIGS-28Libraries usedOne 300bp Illumina genomic library
MIGS-29Sequencing platformsIllumina HiSeq2000
MIGS-31.2Fold coverage50×
MIGS-30AssemblersSOAPdenovo 1.05
MIGS-32Gene calling methodGlimmer 3.0
Locus TagACJ66
Genbank IDLDUA01000000
Genbank Date of ReleaseJune 18, 2015
GOLD IDGs0115809
BIOPROJECTPRJNA282437
MIGS-13Source Material IdentifierLCT-SP1
Project relevanceEnvironment
Project information

Growth conditions and genomic DNA preparation

strain LCT-SP1 was grown overnight on an aerobic LB agar plate at 35 °C. The total genomic DNA was extracted from 20 mL of cells using a CTAB bacterial genomic DNA isolation method [7] with kits provided by Illumina Inc. according to the manufacturer's instructions. DNA quality and quantity was determined by spectrophotometry.

Genome sequencing and assembly

The genome of LCT-SP1 was sequenced using paired-end sequencing technology [8] with Illumina HiSeq2000 (Illumina, SanDiego, CA, USA) at Majorbio Bio-pharm Technology Co., Ltd. (Shanghai, China). Draft assemblies were based on 6,986,766 readings, totaling 1,754 Mbp of 300 bp the PCR-free library, and 3,442,511 readings, totaling 1,556 Mbp of the 6,000 bp index library. The assembly was performed using the SOAPdenovo software package version 1.05 [9]. The gaps among scaffolds were closed by custom primer walks or by PCR amplification, followed by DNA sequencing to achieve optimal assembly results. The genome contained 3,884 candidate protein-encoding genes (with an average size of 958 bp), giving a coding intensity of 87.7%. A total of 1,906 proteins were assigned to 25 COG families [10]. A total of 47 tRNA genes and 3 rRNA genes were identified.

Genome annotation

Protein-coding genes of the draft genome assemblies were established using Glimmer version 3.0 [11]. The predicted CDSs were translated and employed to search the KEGG, COG, String, NR, and GO databases. These data sources were brought together to assert a product description for each predicted protein. tRNAs and rRNAs were predicted using tRNAscan-SE [12] and RNAmmer [13], respectively. Automatic gene annotation was performed by the National Center for Biotechnology Information Prokaryotic Genomes Automatic Annotation Pipeline [14].

Genome properties

The LCT-SP1 genome consisted of 4,302,226 bp circular chromosomes with a GC content of 65.66 % (Table 3). Of the 3,934 predicted genes, 3,884 (98.73 %) were protein-coding genes, and 50 (1.27 %) were RNA genes (3 rRNA genes, and 47 tRNA genes). In addition, among the total predicted genes, 1,906 (48.45 %) represented COG functional categories. Of these, the most abundant COG category was “General function prediction only” (211 proteins) followed by “Amino acid transport and metabolism” (171 proteins), “Translation” (141 proteins), “Energy production and conversion” (140 proteins), “Replication, recombination and repair” (130 proteins), “Function unknown” (124 proteins), “Inorganic ion transport and metabolism” (210 proteins), and “Replication, recombination and repair” (201 proteins). The properties and statistics of the genome are summarized in Table 3. The draft genome map of strain LCT-SP1 is illustrated in Fig. 3, and the distribution of genes into COG functional categories is presented in Table 4.
Table 3

Genome statistics

AttributeValue% of total
Genome size (bp)4,302,226100.00
DNA coding (bp)3,772,44087.69
DNA G + C (bp)2,824,84265.66
DNA scaffolds91100.00
Total genes3,934100.00
Protein coding genes3,88498.73
RNA genes501.27
Pseudo genes00.00
Genes in internal clusters1,61040.93
Genes with function prediction3,91199.42
Genes assigned to COGs1,90648.45
Genes with Pfam domains2,57165.35
Genes with signal peptides3679.33
Genes with transmembrane helices84621.50
CRISPR repeats6-
Fig. 3

Circular map of the draft genome of the Sphingomonas paucimobilis strain LCT-SP1. From outside to the center: Genes on the forward strand (colored by the predicted coding sequences), genes on the reverse strand (colored by COG categories), RNA genes, GC content, and GC skew. The map was created using the DNAPlotter according to the method described by Carver et al. (2009) [33]. DNAPlotter reads the common sequence formats (EMBL, Genbank, GFF) using the Artemis file-reading library and displays the sequence as the circular plot. Additional feature files can be read in and overlaid on the sequence

Table 4

Number of genes associated with general COG functional categories

CodeValue% ageDescription
J1413.58Translation, ribosomal structure and biogenesis
A00.00RNA processing and modification
K1152.92Transcription
L1303.30Replication, recombination and repair
B10.03Chromatin structure and dynamics
D140.36Cell cycle control, Cell division, chromosome partitioning
V270.69Defense mechanisms
T781.98Signal transduction mechanisms
M751.91Cell wall/membrane biogenesis
N270.69Cell motility
U571.45Intracellular trafficking and secretion
O892.26Posttranslational modification, protein turnover, chaperones
C1403.56Energy production and conversion
G1092.77Carbohydrate transport and metabolism
E1714.35Amino acid transport and metabolism
F471.19Nucleotide transport and metabolism
H942.39Coenzyme transport and metabolism
I852.16Lipid transport and metabolism
P1142.90Inorganic ion transport and metabolism
Q571.45Secondary metabolites biosynthesis, transport and catabolism
R2115.36General function prediction only
S1243.15Function unknown
-2,02851.55Not in COGs
Genome statistics Circular map of the draft genome of the Sphingomonas paucimobilis strain LCT-SP1. From outside to the center: Genes on the forward strand (colored by the predicted coding sequences), genes on the reverse strand (colored by COG categories), RNA genes, GC content, and GC skew. The map was created using the DNAPlotter according to the method described by Carver et al. (2009) [33]. DNAPlotter reads the common sequence formats (EMBL, Genbank, GFF) using the Artemis file-reading library and displays the sequence as the circular plot. Additional feature files can be read in and overlaid on the sequence Number of genes associated with general COG functional categories

Insights from the genome sequence

Several studies suggest that the genus can degrade many compounds or materials, such as ferulic acid [2], lignin [3], and biphenyl [4]. Arens et al. believed that the localized corrosion of copper cold-water pipes resulted from the genus , leading to surface erosions, covered tubercles, and through-wall pinhole pits on the inner surface of the pipe [15]. strain LCT-SP1 can corrode several materials including epoxy resin, ester polyurethane, and ethers polyurethane (unpublished data). LCT-SP1 was isolated from the condensation water in the Chinese spacecraft Shenzhou X. Therefore, LCT-SP1 could be a suitable model for studying the properties of genes involved in microbial corrosion of aerospace related materials. Additionally, EC 1.14.11.2, gloA, and arsC gene were present in LCT-SP1, which was identified with 100% similarity to sp. S17 [16]. EC 1.14.11.2 is categorized as a procollagen-proline catalyzing enzyme [17]. The gloA gene encodes a glyoxalase that can reduce methylglyoxal toxicity in a cell [18]. Furthermore, arsC gene produces an arsenate reductase that can convert arsenate into arsenite, which is accordingly exported from cells by an energy-dependent efflux process [19]. Therefore, the genes mentioned above are likely responsible for the ability of LCT-SP1 to degrade various recalcitrant aromatic compounds and polysaccharides. The LCT-SP1 genome also contained an NhaA-type CDS for the Na+/H+ antiporter and some subunits of the multisubunit cation antiporter (Na+/H+) [20], which suggested that this strain should be compatible with its alkaline and hypersaline environment, and could corrode metallic materials by changing the pH balance of their surface. Also, biofilms from bacteria may be beneficial for corrosion control because of the removal of corrosive agents and the generation of a protective layer by biofilms [21]. LCT-SP1 included the gene encoding biofilm dispersion protein BdlA and biofilm growth-associated repressor that could inhibit the formation of biofilm, which may explain the microbial corrosion of materials. Further studies are needed to investigate these corrosion-based gene-coding sequences to reveal the role of LCT-SP1 in the microbial corrosion of materials.

Conclusions

The genome of strain LCT-SP1 isolated from the condensate water in the Chinese spacecraft Shenzhou X was sequenced. The strain LCT-SP1 genome included numerous genes that are likely responsible for their ability to degrade various recalcitrant aromatic compounds and polysaccharides. Further study of these corrosion-based gene-coding sequences may reveal the role of LCT-SP1 in microbial corrosion of materials, especially in aerospace applications. The genome sequence has been deposited at DDBJ/EMBL/GenBank under accession number LDUA00000000.
  31 in total

1.  List of new names and new combinations previously effectively, but not validly, published.

Authors: 
Journal:  Int J Syst Evol Microbiol       Date:  2006-01       Impact factor: 2.747

Review 2.  Biofilms: strategies for metal corrosion inhibition employing microorganisms.

Authors:  Rongjun Zuo
Journal:  Appl Microbiol Biotechnol       Date:  2007-08-16       Impact factor: 4.813

3.  MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods.

Authors:  Koichiro Tamura; Daniel Peterson; Nicholas Peterson; Glen Stecher; Masatoshi Nei; Sudhir Kumar
Journal:  Mol Biol Evol       Date:  2011-05-04       Impact factor: 16.240

4.  Affinity column purification of protocollagen proline hydroxylase from chick embryos and further characterization of the enzyme.

Authors:  R A Berg; D J Prockop
Journal:  J Biol Chem       Date:  1973-02-25       Impact factor: 5.157

5.  Strain identification of Mycobacterium tuberculosis by DNA fingerprinting: recommendations for a standardized methodology.

Authors:  J D van Embden; M D Cave; J T Crawford; J W Dale; K D Eisenach; B Gicquel; P Hermans; C Martin; R McAdam; T M Shinnick
Journal:  J Clin Microbiol       Date:  1993-02       Impact factor: 5.948

6.  Proposal of Sphingomonadaceae fam. nov., consisting of Sphingomonas Yabuuchi et al. 1990, Erythrobacter Shiba and Shimidu 1982, Erythromicrobium Yurkov et al. 1994, Porphyrobacter Fuerst et al. 1993, Zymomonas Kluyver and van Niel 1936, and Sandaracinobacter Yurkov et al. 1997, with the type genus Sphingomonas Yabuuchi et al. 1990.

Authors:  Y Kosako; E Yabuuchi; T Naka; N Fujiwara; K Kobayashi
Journal:  Microbiol Immunol       Date:  2000       Impact factor: 1.955

7.  Characterization of the 5-carboxyvanillate decarboxylase gene and its role in lignin-related biphenyl catabolism in Sphingomonas paucimobilis SYK-6.

Authors:  Xue Peng; Eiji Masai; Hirotaka Kitayama; Kyo Harada; Yoshihiro Katayama; Masao Fukuda
Journal:  Appl Environ Microbiol       Date:  2002-09       Impact factor: 4.792

8.  Cloning and characterization of the ferulic acid catabolic genes of Sphingomonas paucimobilis SYK-6.

Authors:  Eiji Masai; Kyo Harada; Xue Peng; Hirotaka Kitayama; Yoshihiro Katayama; Masao Fukuda
Journal:  Appl Environ Microbiol       Date:  2002-09       Impact factor: 4.792

9.  Taxonomic affiliation of new genomes should be verified using average nucleotide identity and multilocus phylogenetic analysis.

Authors:  María José Figueras; Roxana Beaz-Hidalgo; Mohammad J Hossain; Mark R Liles
Journal:  Genome Announc       Date:  2014-12-04

10.  DNAPlotter: circular and linear interactive genome visualization.

Authors:  Tim Carver; Nick Thomson; Alan Bleasby; Matthew Berriman; Julian Parkhill
Journal:  Bioinformatics       Date:  2008-11-05       Impact factor: 6.937

View more
  2 in total

1.  Complete genome sequencing and analysis of endophytic Sphingomonas sp. LK11 and its potential in plant growth.

Authors:  Sajjad Asaf; Abdul Latif Khan; Muhammad Aaqil Khan; Ahmed Al-Harrasi; In-Jung Lee
Journal:  3 Biotech       Date:  2018-08-28       Impact factor: 2.406

2.  Comprehensive Genome Analysis on the Novel Species Sphingomonas panacis DCY99T Reveals Insights into Iron Tolerance of Ginseng.

Authors:  Yeon-Ju Kim; Joon Young Park; Sri Renukadevi Balusamy; Yue Huo; Linh Khanh Nong; Hoa Thi Le; Deok Chun Yang; Donghyuk Kim
Journal:  Int J Mol Sci       Date:  2020-03-16       Impact factor: 5.923

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.