| Literature DB >> 26203334 |
Li Lin1, Chunyan Wei2, Mingyue Chen3, Hongcheng Wang3, Yuanyuan Li3, Yangrui Li1,2, Litao Yang1,2,4, Qianli An3.
Abstract
Klebsiella variicola strain DX120E (=CGMCC 1.14935) is an endophytic nitrogen-fixing bacterium isolated from sugarcane crops grown in Guangxi, China and promotes sugarcane growth. Here we summarize the features of the strain DX120E and describe its complete genome sequence. The genome contains one circular chromosome and two plasmids, and contains 5,718,434 nucleotides with 57.1% GC content, 5,172 protein-coding genes, 25 rRNA genes, 87 tRNA genes, 7 ncRNA genes, 25 pseudo genes, and 2 CRISPR repeats.Entities:
Keywords: Endophyte; Klebsiella pneumoniae; Klebsiella variicola; Nitrogen fixation; Pathogenicity; Plant growth-promoting bacteria; Sugarcane
Year: 2015 PMID: 26203334 PMCID: PMC4511632 DOI: 10.1186/s40793-015-0004-2
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Figure 1Morphology of Klebsiella variicola DX120E cells. Transmission electron micrograph (left) shows a DX120E cell stained by uranyl acetate; laser scanning confocal micrograph (right) shows DX120E cells tagged by green fluorescent protein. The scale bars represent 1 μm.
Classification and general features of strain DX120E according to the MIGS recommendations[11]
| Classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Type strain:F2R9T (ATCC BAA-830 = DSM 15968) | TAS [ | ||
| Gram stain | Negative | IDA | |
| Cell shape | Rod | IDA | |
| Motility | Non-motile | IDA | |
| Sporulation | Non-sporulating | IDA | |
| Temperature range | 4–40°C | IDA | |
| Optimum temperature | 28–32°C | IDA | |
| pH range; Optimum | 3.5–8.5; 7.0 | IDA | |
| Carbon source | Sucrose, citrate, fructose, galactose, glucose, lactose, malate, maltose, mannitol, mannose, rhamnose, & sorbitol | IDA | |
| MIGS-6 | Habitat | Soil, plants | IDA |
| MIGS-6.3 | Salinity | 0 – 5% NaCl (w/v) | IDA |
| MIGS-22 | Oxygen requirement | Aerobic | IDA |
| MIGS-15 | Biotic relationship | Free-living, endophytic | IDA |
| MIGS-14 | Pathogenicity | Not reported | |
| MIGS-4 | Geographic location | Daxin, Guangxi, China | TAS [ |
| MIGS-5 | Sample collection | 2008 | TAS [ |
| MIGS-4.1 MIGS-4.2 | Longitude | 107°20’E | NAS |
| Latitude | 22°80’N | NAS | |
| MIGS-4.3 | Depth | 0.1 – 0.2 m below the surface | IDA |
| MIGS-4.4 | Altitude | 320 m | NAS |
aEvidence codes – IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [21].
Figure 2Phylogenetic tree of 16S rRNA gene sequences showing the position of Klebsiella variicola strain DX120E (●). Escherichia coli ATCC 11775T is used as an outgroup. The sequences were aligned with the CLUSTAL W program and were constructed with the neighbor-joining algorithm and the Kimura 2-parameter model integrated in the MEGA 5.2 program [22]. The phylogenetic tree was tested with 1,000 bootstrap replicates. Bootstrap values are shown at the nodes. The GenBank accession numbers of the sequences are indicated in parentheses. The scale bar represents a 1% nucleotide sequence divergence. Note that the genome of strain DX120E, F2R9T (DSM 15968T), or Kp342 contains eight copies of 16S rRNA genes; these gene sequences are generally not identical but phylogenetically grouped together (data not shown). The sequence of DX120E (HQ204296) used to construct the phylogenetic tree is identical to the sequence of locus tag KR75_09260 (CP009274:1935034–1936587).
Genome sequencing project information for strain DX120E
| MIGS-31 | Finishing quality | Finished |
| MIGS-28 | Libraries used | PacBio 4 –10Kb library |
| Illumina 500 bp library | ||
| MIGS-29 | Sequencing platforms | PacBio RS II |
| Illumina HiSeq 2000 | ||
| MIGS-31.2 | Fold coverage | PacBio 96 × |
| Illumina 106 × | ||
| MIGS-30 | Assemblers | HGAP in smrtanalysis-2.1.1SOAPdenovo 2.04 |
| MIGS-32 | Gene calling method | GeneMarkS+ |
| Locus Tag | KR75 | |
| Genbank ID | CP009274 (Chromosome) | |
| CP009275 (plasmid pKV1) | ||
| CP009276 (plasmid pKV2) | ||
| Genbank Date of Release | January 1, 2015 | |
| GOLD ID | Gi0078577 | |
| BIOPROJECT | PRJNA259590 | |
| MIGS 13 | Source Material Identifier | CGMCC 1.14935 |
| Project relevance | Agriculture, plant-microbe interactions |
Summary of genome: one chromosome and two plasmids
| Chromosome | 5,501,013 | Circular | CP009274.1 | NZ_CP009274.1 |
| Plasmid pKV1 | 162,706 | Circular | CP009275.1 | NZ_CP009275.1 |
| Plasmid pKV2 | 54,715 | Circular | CP009276.1 | NZ_CP009276.1 |
Figure 3Circular map of the chromosome and plasmids of Klebsiella variicola strain DX120E. From outside to the center: genes on forward strand, genes on reverse strand, GC content, GC skew. Circular map was generated by CGView [31].
Genome statistics
| Genome size (bp) | 5,718,434 | 100 |
| DNA coding (bp) | 4,930,539 | 86.22 |
| DNA G + C (bp) | 3,265,303 | 57.10 |
| DNA scaffolds | 3 | 100 |
| Total genes | 5,316 | 100 |
| Protein-coding genes | 5,172 | 97.29 |
| RNA genes | 112 | 2.12 |
| Pseudo genes | 25 | 0.47 |
| Genes with function prediction | 4,623 | 87.00 |
| Genes assigned to COGs | 4,398 | 82.73 |
| Genes with Pfam domains | 4,631 | 87.11 |
| Genes with signal peptides | 526 | 9.89 |
| Genes with transmembrane helices | 1,289 | 24.25 |
| CRISPR repeats | 2 | 0.04 |
Number of genes associated with general COG functional categories
| J | 198 | 3.83 | Translation, ribosomal structure and biogenesis |
| A | 1 | 0.02 | RNA processing and modification |
| K | 489 | 9.45 | Transcription |
| L | 159 | 3.07 | Replication, recombination and repair |
| B | 1 | 0.02 | Chromatin structure and dynamics |
| D | 43 | 0.83 | Cell cycle control, cell division, chromosomepartitioning |
| V | 71 | 1.37 | Defense mechanisms |
| T | 235 | 4.54 | Signal transduction mechanisms |
| M | 260 | 5.03 | Cell wall/membrane biogenesis |
| N | 62 | 1.20 | Cell motility |
| U | 111 | 2.15 | Intracellular trafficking and secretion |
| O | 158 | 3.05 | Posttranslational modification, protein turnover, chaperones |
| C | 342 | 6.61 | Energy production and conversion |
| G | 583 | 11.27 | Carbohydrate transport and metabolism |
| E | 538 | 10.40 | Amino acid transport and metabolism |
| F | 102 | 1.97 | Nucleotide transport and metabolism |
| H | 215 | 4.16 | Coenzyme transport and metabolism |
| I | 130 | 2.51 | Lipid transport and metabolism |
| P | 344 | 6.65 | Inorganic ion transport and metabolism |
| Q | 112 | 2.17 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 541 | 10.46 | General function prediction only |
| S | 414 | 8.00 | Function unknown |
| - | 774 | 14.97 | Not in COGs |
The total is based on the total number of protein coding genes in the genome.