| Literature DB >> 30349619 |
Zhen Yu1, Guiqin Yang1, Xiaoming Liu1, Yueqiang Wang1, Li Zhuang2, Shungui Zhou3.
Abstract
The Azospirillum humicireducens strain SgZ-5T, belonging to the Order Rhodospirillales and the Family Rhodospirillaceae, was isolated from a microbial fuel cell inoculated with paddy soil. A previous work has shown that strain SgZ-5T was able to fix atmospheric nitrogen involved in plant growth promotion. Here we present the complete genome of A. humicireducens SgZ-5T, which consists of a circular chromosome and six plasmids with the total genome size of 6,834,379 bp and the average GC content of 67.55%. Genome annotations predicted 5969 protein coding and 85 RNA genes including 14 rRNA and 67 tRNA genes. By genomic analysis, we identified a complete set of genes that is potentially involved in nitrogen fixation and its regulation. This genome also harbors numerous genes that are likely responsible for phytohormones production. We anticipate that the A. humicireducens SgZ-5T genome will contribute insights into plant growth promoting properties of Azospirillum strains.Entities:
Keywords: Azospirillum humicireducens; Complete genome; Nitrogen fixation; PGPP
Year: 2018 PMID: 30349619 PMCID: PMC6192227 DOI: 10.1186/s40793-018-0322-2
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Fig. 1Images of the SgZ-5T. a Colonies of the strain on NA agar plate, b light microscopy and c transmission electron microscopy of the strain
Classification and general features of SgZ-5T according to the MIGS recommendations [16]
| MIGS ID | Property | Term | Evidence codea |
|---|---|---|---|
| Current Classification | Domain | TSA [ | |
| Phylum | TSA [ | ||
| Class | TSA [ | ||
| Order | TSA [ | ||
| Family | TSA [ | ||
| Genus | TSA [ | ||
| Species | TSA [ | ||
| Type strain SgZ-5= | TSA [ | ||
| Gram stain | Negative | TSA [ | |
| Cell shape | Spiral, straight to slightly curved rods | TSA [ | |
| Motility | Motile | TSA [ | |
| Sporulation | Nonsporulating | NSA | |
| Temperature range | 25–37 °C | TSA [ | |
| Optimum temperature | 30 °C | TSA [ | |
| pH range; Optimum | 5.5–8.5; 7.2 | TSA [ | |
| Carbon source | Acetate, L-lactate, citrate, D-ribose, L-rhamnose, D-glucose, N-Acetyl-glucosamine, meso-inositol, D-saccharose, D-maltose, suberic acid, malonate, L-serine, salicin, L-alanine, gluconate, glycogen, 2-keto-gluconate, D-mannitol, D-melibiose, L-fucose, D-sorbierite, L-arabinose, L-histidine, 3-hydroxy-butyric acid, 4-hydroxy-benzoic acid, L-proline, capric acid, adipic acid and malic acid | TSA [ | |
| MIGS-6 | Habitat | Paddy soil | TSA [ |
| MIGS-6.3 | Salinity | NaCl 0–1% ( | TSA [ |
| MIGS-22 | Oxygen requirement | Facultative anaerobic | TSA [ |
| MIGS-15 | Biotic relationship | Free living | NAS |
| MIGS-14 | Pathogenicity | Not reported | |
| MIGS-4 | Geographic location | Guangzhou City, Guangdong Province, China | NAS |
| MIGS-5 | Sample collection time | Dec 2011 | NAS |
| MIGS-4.1 | Latitude | 23.18o N | NAS |
| MIGS-4.2 | Longitude | 113.36o E | NAS |
| MIGS-4.3 | Depth | 0.1 m beneath the surface | NAS |
| MIGS-4.4 | Altitude | 40 m | NAS |
aEvidence code – IDA Inferred from direct assay, TAS Traceable author statement (i.e., a direct report exists in the literature), NAS Non-traceable author statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology Project [39]
Fig. 2Phylogenetic tree highlighting the position of SgZ-5T relative to other type strains within the genus . The strains and their corresponding GenBank accession numbers of 16S rRNA genes were indicated in parentheses. The sequences were aligned using Clustal W and the neighbor-joining tree was constructed based on kimura 2-paramenter distance model by using MEGA 5. Bootstrap values above 50 % were obtained from 1000 bootstrap replications. Bar, 0.01 substitutions per nucleotide position. Rhodovulum adriaticum DSM 2781T was used as an outgroup
Genome sequencing project information
| MIGS ID | Property | Term |
|---|---|---|
| MIGS-31 | Finishing quality | Complete |
| MIGS-28 | Libraries used | Three libraries (a paired-end library and two mate-pair libraries) |
| MIGS-29 | Sequencing platforms | Illumina Hiseq 2500 |
| MIGS-31.2 | Fold coverage | 259× |
| MIGS-30 | Assemblers | SOAP |
| MIGS-32 | Gene calling method | GeneMarkS+ [ |
| Locus Tag | A6A40 | |
| Genbank ID | ||
| Genbank Date of Release | April 18, 2018 | |
| GOLD ID | Gp0150267 | |
| Bioproject |
| |
| MIGS-13 | Source Material Identifier | SgZ-5T |
| Project relevance | Type strain, nitrogen fixation, plant growth promotion |
Summary of genome: one chromosome and six plasmids
| Label | Size (bp) | Topology | INSDC identifier | RefSeq ID |
|---|---|---|---|---|
| Chromosome | 3,181,617 | Circular | NZ_CP015285.1 | |
| pYZ1 | 715,112 | Circular | NA | |
| pYZ2 | 1,008,603 | Circular | NA | |
| pYZ3 | 252,411 | Circular | NA | |
| pYZ4 | 338,445 | Circular | NA | |
| pYZ5 | 626,509 | Circular | NA | |
| pYZ6 | 711,682 | Circular | NA |
Genome statistics of SgZ-5T
| Attribute | Genome (total) | |
|---|---|---|
| Value | % of total | |
| Genome size (bp) | 6,834,379 | 100.00 |
| DNA coding (bp) | 5,902,731 | 86.37 |
| DNA G + C (bp) | 4,616,422 | 67.55 |
| DNA scaffolds | 7 | |
| Total genes | 6054 | 100.00 |
| Protein coding genes | 5969 | 98.60 |
| RNA genes | 85 | 1.40 |
| rRNA genes | 14 | 0.23 |
| tRNA genes | 67 | 1.11 |
| Pseudo genes | 194 | 3.20 |
| Genes in internal clusters | NA | |
| Genes with function prediction | 4844 | 80.01 |
| Genes assigned to COGs | 4550 | 75.16 |
| Genes with signal peptides | 425 | 7.02 |
| Genes with transmembrane helices | 1022 | 16.88 |
| CRISPR repeats | 3 | |
Fig. 3Circular map of the chromosome of SgZ-5T. From center to outside, circle 1 illustrates the GC skew. Circle 2 shows GC content (peaks out/inside the circle indicate values higher or lower than the average G+C content, respectively). Circle 3 denotes ncRNA genes. Circles 4, 5 and 6 indicate the CDSs, colored according to COG, KEGG and GO categories, respectively. Circle 7 demonstrates the predicted protein-coding sequences
Number of genes associated with general COG functional categories
| Code | Value | % of totala | Description |
|---|---|---|---|
| J | 182 | 2.98 | Translation, ribosomal structure and biogenesis |
| A | 0 | 0.00 | RNA processing and modification |
| K | 357 | 5.85 | Transcription |
| L | 175 | 2.87 | Replication, recombination and repair |
| B | 1 | 0.02 | Chromatin structure and dynamics |
| D | 38 | 0.62 | Cell cycle control, cell division, chromosome partitioning |
| V | 80 | 1.31 | Defense mechanisms |
| T | 338 | 5.54 | Signal transduction mechanisms |
| M | 218 | 3.57 | Cell wall/membrane/envelope biogenesis |
| N | 73 | 1.20 | Cell motility |
| U | 58 | 0.95 | Intracellular trafficking, secretion, and vesicular transport |
| O | 162 | 2.65 | Posttranslational modification, protein turnover, chaperones |
| C | 342 | 5.60 | Energy production and conversion |
| G | 263 | 4.31 | Carbohydrate transport and metabolism |
| E | 448 | 7.34 | Amino acid transport and metabolism |
| F | 81 | 1.33 | Nucleotide transport and metabolism |
| H | 160 | 2.62 | Coenzyme transport and metabolism |
| I | 139 | 2.28 | Lipid transport and metabolism |
| P | 333 | 5.45 | Inorganic ion transport and metabolism |
| Q | 144 | 2.36 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 227 | 3.72 | General function prediction only |
| S | 731 | 11.97 | Function unknown |
| – | 1555 | 25.47 | Not in COGs |
aThe total is based on the total number of protein coding genes in the annotated genome
Genome statistics comparison among characterized speciesa
| Genome name | 1 | 2 | 3 | 4 | 5 | 6 | ||
|---|---|---|---|---|---|---|---|---|
| Sp 7 | Az39 | Sp245 | 4B | DSM 3675 | BV-S | B510 | SgZ-5 | |
| Genome size (Mb) | 6.6 | 7.4 | 7.5 | 6.8 | 6.5 | 7.6 | 7.6 | 6.8 |
| Plasmids | 5 | 5 | 6 | 6 | NA | 7 | 6 | 6 |
| G + C content (%) | 68.3 | 68.5 | 68.5 | 67.7 | 70.7 | 68.2 | 67.6 | 67.5 |
| Total genes | 5951 | 6713 | 6982 | 6137 | 5999 | 6684 | 6692 | 6054 |
| Protein coding genes | 5828 | 6596 | 6882 | 6027 | 5925 | 6575 | 6582 | 5969 |
| RNA genes | 123 | 117 | 100 | 110 | 74 | 109 | 110 | 85 |
| rRNA genes | 29 | 26 | 15 | 26 | 11 | 26 | 26 | 14 |
| tRNA genes | 90 | 87 | 81 | 80 | 59 | 79 | 80 | 67 |
| Pseudogenes | 98 | 188 | 375 | 77 | 128 | 126 | 195 | 194 |
| Frameshifted genes | 38 | 61 | 286 | 21 | 27 | 53 | 85 | 67 |
| CRISPR | 2 | NA | NA | 3 | 4 | NA | 5 | 3 |
athe species are numbered as: 1, [26, 27]; 2, [26]; 3, (RefSeq ID: NZ_AUCF00000000.1); 4, [3]; 5, sp. [6]; 6,
Genes of SgZ-5T involved in nitrogen fixation
| Locus Tag | Size/aa | Gene | Gene product |
|---|---|---|---|
| A6A40_02185 | 852 |
| Electron transfer flavoprotein beta subunit |
| A6A40_02190 | 1080 |
| Electron transfer flavoprotein alpha chain |
| A6A40_02195 | 1302 |
| Flavoprotein-ubiquinone oxidoreductase |
| A6A40_09085 | 210 |
| Nitrogen fixation protein |
| A6A40_02200 | 285 |
| Ferredoxin-like protein |
| A6A40_09040 | 1866 |
| Nif-specific transcriptional activator |
| A6A40_09050 | 1518 |
| Nitrogenase FeMo cofactor biosynthesis protein |
| A6A40_02900 | 1440 |
| Nitrogenase molybdenum-iron protein alpha chain |
| A6A40_02875 | 1407 |
| Nitrogenase molybdenum-cofactor biosynthesis protein |
| A6A40_02905 | 897 |
| Nitrogenase iron protein |
| A6A40_02895 | 1560 |
| Nitrogenase molybdenum-iron protein subunit beta |
| A6A40_02870 | 1371 |
| Nitrogenase molybdenum-cofactor biosynthesis protein |
| A6A40_02230 | 1206 |
| Nitrogenase metalloclusters biosynthesis protein |
| A6A40_02235 | 924 |
| Iron-sulfur cluster assembly scaffold protein |
| A6A40_02225 | 1122 |
| Homocitrate synthase |
| A6A40_02215 | 336 |
| Nitrogenase-stabilizing/protective protein |
| A6A40_02865 | 399 |
| Nitrogenase molybdenum-iron protein |
| A6A40_09070 | 333 |
| Nitrogenase P-cluster assembly |
| A6A40_09075 | 306 |
| Nitrogenase P-cluster assembly |
| A6A40_02220 | 852 |
| Serine acetyltransferase |
| A6A40_02925 | 909 |
| ADP-ribosyl-[dinitrogen reductase] hydrolase |
| A6A40_02920 | 891 |
| ADP-ribosyl-[dinitrogenase reductase] transferase |
| A6A40_07245 | 2847 |
| [Protein-PII] uridylyltransferase |
| A6A40_07685 | 339 |
| Nitrogen regulatory protein P-II |
| A6A40_05220 | 1200 |
| Nitrogen regulation sensor histidine kinase |
| A6A40_05215 | 1146 |
| Nitrogen regulation response regulator |
| A6A40_05205 | 1401 |
| Sigma-54-dependent transcriptional regulator |
| A6A40_05210 | 2319 |
| Nitrogen regulation sensor histidine kinase |
Fig. 4Organization of the nitrogen fixation gene cluster in SgZ-5T. Arrows represent genes and their respective direction of transcription. Genes are colored as depicted in the lower box