| Literature DB >> 28174619 |
Jian-Yu Jiao1, Lorena Carro2, Lan Liu1, Xiao-Yang Gao3, Xiao-Tong Zhang1, Wael N Hozzein4,5, Alla Lapidus6,7, Marcel Huntemann8, T B K Reddy8, Neha Varghese8, Michalis Hadjithomas8, Natalia N Ivanova8, Markus Göker9, Manoj Pillay10, Jonathan A Eisen11, Tanja Woyke8, Hans-Peter Klenk2,9, Nikos C Kyrpides8,12, Wen-Jun Li1,13.
Abstract
Jiangella gansuensis strain YIM 002T is the type strain of the type species of the genus Jiangella, which is at the present time composed of five species, and was isolated from desert soil sample in Gansu Province (China). The five strains of this genus are clustered in a monophyletic group when closer actinobacterial genera are used to infer a 16S rRNA gene sequence phylogeny. The study of this genome is part of the GenomicEncyclopedia ofBacteria andArchaea project, and here we describe the complete genome sequence and annotation of this taxon. The genome of J. gansuensis strain YIM 002T contains a single scaffold of size 5,585,780 bp, which involves 149 pseudogenes, 4905 protein-coding genes and 50 RNA genes, including 2520 hypothetical proteins and 4 rRNA genes. From the investigation of genome sizes of Jiangella species, J. gansuensis shows a smaller size, which indicates this strain might have discarded too much genetic information to adapt to desert environment. Seven new compounds from this bacterium have recently been described; however, its potential should be higher, as secondary metabolite gene cluster analysis predicted 60 gene clusters, including the potential to produce the pristinamycin.Entities:
Keywords: Desert; GEBA; Genome; Jiangella gansuensis; Jiangellales; Taxonomic comments
Year: 2017 PMID: 28174619 PMCID: PMC5292007 DOI: 10.1186/s40793-017-0226-6
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Fig. 1Scanning electron micrograph of Jiangella gansuensis strain YIM 002T grown on ISP medium 2 for 14d at 28 °C. Bar size: 2 μm
Fig. 2Phylogenetic tree showing the relationship of J. gansuensis YIM 002T with some other actinobacteria based on 16S rRNA gene sequences. The Neighbour-joining tree was built using MEGA 5 [39] with the Kimura 2-parameter model. Bootstrap values (percentages of 1000 replicates) are shown at branch points. Asterisks denote nodes that were also recovered using the Maximum Likelihood method in the branch of Jiangellaceae. The Haloglycomyces albus act as the outgroup
Classification and general features of Jiangella gansuensis strain YIM 002T in accordance with the MIGS recommendations [20], List of Prokaryotic names with Standing in Nomenclature [40] and the Names for Life database [41]
| MIGS ID | Property | Term | Evidence codea |
|---|---|---|---|
| Current classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Type strain YIM 002T (=DSM 44835T) | TAS [ | ||
| Gram stain | Positive | IDA | |
| Cell shape | Filamentous | IDA | |
| Motility | Non motile | IDA | |
| Sporulation | Non-sporulating | IDA | |
| Temperature range | 10–45 °C | IDA | |
| Optimum temperature | 28 °C | IDA | |
| pH range; Optimum | 7.0–8.0 | TAS [ | |
| Carbon source | Various | IDA | |
| Energy source | Chemoorganotroph | IDA | |
| MIGS-6 | Habitat | Desert soil | IDA |
| MIGS-6.3 | Salinity | Halotolerant | IDA |
| MIGS-22 | Oxygen requirement | Aerobic | IDA |
| MIGS-15 | Biotic relationship | Free living | IDA |
| MIGS-14 | Pathogenicity | None | IDA |
| MIGS-4 | Geographic location | Gansu Province, China | IDA |
| MIGS-5 | Sample collection time | 2005 or before | NAS |
| MIGS-4.1 | Latitude | Not reported | NAS |
| MIGS-4.2 | Longitude | Not reported | |
| MIGS-4.4 | Altitude | Not reported |
a Evidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from of the Gene Ontology project [45]
Genome sequencing project information
| MIGS ID | Property | Term |
|---|---|---|
| MIGS 31 | Finishing quality | Non-contiguous Finished |
| MIGS-28 | Libraries used | Illumina Std shotgun library |
| MIGS 29 | Sequencing platforms | 454-GS-FLX-Titanium Illumina GAii |
| MIGS 31.2 | Fold coverage | Unknown |
| MIGS 30 | Assemblers | ALLPATHS v. R37654 |
| MIGS 32 | Gene calling method | Prodigal 1.4, GenePRIMP |
| Locus Tag | JIAGA | |
| GenBank ID | AZXT00000000 | |
| GenBank Date of Release | 15-08-2013 | |
| GOLD ID | Gp0001209 | |
| BIOPROJECT | PRJNA224116, PRJNA63165 | |
| MIGS 13 | Source Material Identifier | YIM 002, DSM 44835 |
| Project relevance | Tree of Life, GEBA |
Fig. 3Graphical map of the J. gansuensis strain YIM 002T chromosome. The genome circular map was set up by the CGView Server [46]. From the outside to the center: Genes on forward strand (colored by COG categories), Genes on reverse strand (colored by COG categories), GC content, GC skew, where green indicates positive values and magenta indicates negative values
Genome Statistics
| Attribute | Value | % of total a |
|---|---|---|
| Genome size (bp) | 5,585,780 | 100.00 |
| DNA coding (bp) | 4,761,339 | 85.24 |
| DNA G + C (bp) | 3,960,974 | 70.91 |
| DNA scaffolds | 1 | - |
| Total genes | 5,104 | - |
| Protein-coding genes | 4,905 | 98.03 |
| RNA genes | 50 | 0.98 |
| Pseudo genes | 149 | 2.98 |
| Genes in internal clusters | 1763 | 34.54 |
| Genes with function prediction | 2,504 | 48.86 |
| Genes assigned to COGs | 2,156 | 42.07 |
| Genes with Pfam domains | 1,734 | 33.97 |
| Genes with signal peptides | 456 | 8.69 |
| Genes with trandmembrane helices | 1230 | 23.43 |
| CPISPR repeats | 0 | - |
a The total is based on either the size of genome in base pairs or the total number of genes in the predicted genome
Number of genes associated with the general COG functional categories
| Code | Value | % age a | Description |
|---|---|---|---|
| J | 160 | 3.18 | Translation, ribosomal structure and biogenesis |
| A | 1 | 0.02 | RNA processing and modification |
| K | 230 | 4.58 | Transcription |
| L | 116 | 2.31 | Replication, recombination and repair |
| B | 1 | 0.02 | Chromatin structure and dynamics |
| D | 21 | 0.42 | Cell cycle control, cell division, chromosome partitioning |
| V | 60 | 1.19 | Defence mechanisms |
| T | 75 | 1.49 | Signal transduction mechanisms |
| M | 96 | 1.91 | Cell wall/membrane biogenesis |
| N | 0 | 0.00 | Cell motility |
| U | 18 | 0.36 | Intracellular trafficking, secretion, and vesicular transport |
| O | 69 | 1.37 | Posttranslational modification, protein turnover, chaperones |
| C | 160 | 3.18 | Energy production and conversion |
| G | 223 | 4.44 | Carbohydrate transport and metabolism |
| E | 298 | 5.93 | Amino acid transport and metabolism |
| F | 56 | 1.11 | Nucleotide transport and metabolism |
| H | 114 | 2.27 | Coenzyme transport and metabolism |
| I | 111 | 2.21 | Lipid transport and metabolism |
| P | 179 | 3.56 | Inorganic ion transport and metabolism |
| Q | 84 | 1.67 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 311 | 6.19 | General function prediction only |
| S | 151 | 3.00 | Function unknown |
| - | 2868 | 57.09 | Not in COGs |
aThe total is based on the total number of protein-coding genes in the genome