| Literature DB >> 30250634 |
Xinye Wang1, Yantao Luo2, Dongying Liu2, Jiamei Wang2, Shi Wei1, Liang Zhao1.
Abstract
Mesorhizobium amorphae CCNWGS0123 was isolated in 2006, from effective nodules of Robinia pseudoacacia L. grown in lead-zinc mine tailing site, in Gansu Province, China. M. amorphae CCNWGS0123 is an aerobic, Gram-negative, non-spore-forming rod strain. This paper characterized M. amorphae CCNWGS0123 and presents its complete genome sequence information and genome annotation. The 7,374,589 bp long genome which encodes 7136 protein-coding genes and 63 RNA coding genes, contains one chromosome and four plasmids. Moreover, a chromosome with no gaps was assembled.Entities:
Keywords: Nitrogen; Nodulation; Rhizobia; Symbiosis
Year: 2018 PMID: 30250634 PMCID: PMC6145117 DOI: 10.1186/s40793-018-0321-3
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Fig. 1SEM (Scanning electron microscope) micrograph of M. amorphae CCNWGS0123 cells (a) and M. amorphae ACCC 19665T (b)
Fig. 2Phylogenetic tree showing the relationships of Mesorhizobium amorphae CCNWGS0123 with other root nodule bacteria based on aligned sequences of a 1296 bp internal region of the 16S rRNA gene. All sites were informative and there were no gap-containing sites. Phylogenetic analyses were performed using MEGA, version 6 [15]. The tree was built using the Maximum Likelihood method based on the Tamura-Nei model [16]. Bootstrap analysis [17] with 500 replicates was performed to assess the support of the clusters
Classification and general features of Mesorhizobium amorphae CCNWGS0123
| MIGS ID | Property | Term | Evidence codea |
|---|---|---|---|
| Classification | Domain: Bacteria | TAS [ | |
| Phylum: | TAS [ | ||
| Class: | TAS [ | ||
| Order: | TAS [ | ||
| Family: | TAS [ | ||
| Genus: | TAS [ | ||
| Species: | TAS [ | ||
| Stain: CCNWGS0123 | |||
| Gram strain | Negative | NAS | |
| Cell shape | rod | NAS | |
| Motility | motile | NAS | |
| Sporulation | None-spore forming | NAS | |
| Temperature range | Not reported | ||
| Optimum temperature | 28 °C | NAS | |
| Carbon source | D xylose, galactose, L-arabinose, D-ribose, rhamnose, mannose, maltose, glucose, saccharose, lactose | TAS | |
| MIGS-6 | Habitat | Soil, Host-associated | TAS [ |
| MIGS-6.3 | Salinity range | Not reported | |
| MIGS-22 | Oxygen requirement | aerobic | NAS |
| MIGS-15 | Biotic relationship | Free living, Symbiont | NAS |
| MIGS-14 | Pathogenicity | Non-pathogen | NAS |
| MIGS-4 | Geographic location | China: Gansu, Huixian | IDA |
| MIGS-5 | Sample collection | 2006 | IDA |
| MIGS-4.1 | Latitude | 33.8 N | IDA |
| MIGS-4.2 | Longitude | 106.1E | IDA |
| MIGS-4.4 | Altitude | 1049 m | IDA |
aEvidence codes - IDA Inferred from Direct Assay, TAS Traceable Author Statement (i.e., a direct report exists in the literature), NAS Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [27]
Project information
| MIGS ID | Property | Term |
|---|---|---|
| MIGS 31 | Finishing quality | High-quality,closed genome |
| MIGS-28 | Libraries used | A 10Kb library |
| MIGS-29 | Sequencing platforms | PacBio RS II |
| MIGS-31.2 | Fold coverage | 134.86× |
| MIGS-30 | Assemblers | Celera Assembler CA 8.1 |
| MIGS-32 | Gene calling method | GeneMarkS |
| Locus Tag | Mea0123 | |
| Genbank ID | Chromosome CP015318; pM0123a CP015319; pM0123b CP015320; pM0123c CP015321; pM0123d CP015322 | |
| Genbank date of Release | July 18,2016 | |
| BIOPROJECT | PRJNA318467 | |
| MIGS-13 | Source Material Identifier | CCNWGS0123 |
| Project relevance | Legume plant symbiosis |
Genome statistics
| Attribute | Value | % of total |
|---|---|---|
| Genome size (bp) | 7,343,952 | 100 |
| DNA coding (bp) | 6,378,582 | 86.85 |
| DNA G + C (bp) | 4,670,753 | 62.87 |
| DNA scaffolds | 4 | |
| Total genes | 7378 | 100 |
| Protein coding genes | 7136 | 96.45 |
| RNA genes | 63 | 0.92 |
| Pseudo genes | 179 | |
| Genes in internal clusters | NA | |
| Genes with function prediction | 6726 | 98.62 |
| Genes assigned to COGs | 4758 | 68.34 |
| Genes with Pfam domains | 5805 | 83.38 |
| Genes with signal peptides | 2239 | 35.72 |
| Genes with transmembrane helices | 1585 | 22.77 |
| CRISPR repeats | 4 | – |
Fig. 3Graphical map of Mesorhizobium amorphae CCNWGS0123 genome. From outside to the center: sequence position coordinates, coding gene, COG assignment, KEGG assignment, GO assignment, ncRNA, G + C content and G + C skew
Number of genes associated with the general COG functional categories
| Code | Value | % of total | Description |
|---|---|---|---|
| J | 190 | 2.64 | Translation, ribosomal structure and biogenesis |
| A | 0 | 0.00 | RNA processing and modification |
| K | 467 | 6.49 | Transcription |
| L | 200 | 2.78 | Replication, recombination and repair |
| B | 4 | 0.06 | Chromatin structure and dynamics |
| D | 26 | 0.36 | Cell cycle control, mitosis and meiosis |
| V | 26 | 0.36 | Defense mechanisms |
| T | 19 | 0.26 | Signal transduction mechanisms |
| M | 253 | 3.51 | Cell wall/membrane biogenesis |
| N | 43 | 0.60 | Cell motility |
| U | 101 | 1.40 | Intracellular trafficking and secretion |
| O | 181 | 2.51 | Posttranslational modification, protein turnover, chaperones |
| C | 328 | 4.56 | Energy production and conversion |
| G | 492 | 6.83 | Carbohydrate transport and metabolism |
| E | 765 | 10.63 | Amino acid transport and metabolism |
| F | 66 | 0.92 | Nucleotide transport and metabolism |
| H | 189 | 2.63 | Coenzyme transport and metabolism |
| I | 233 | 3.24 | Lipid transport and metabolism |
| P | 283 | 3.93 | Inorganic ion transport and metabolism |
| Q | 197 | 2.74 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 734 | 10.20 | General function prediction only |
| S | 485 | 6.74 | Function unknown |
| – | 1690 | 23.48 | Not in COGs |
Function annotation assignment from different databases
| Database | Assigned Number | Percent (%) |
|---|---|---|
| COG | 4758 | 66.68 |
| GO | 4163 | 58.34 |
| KEGG | 3700 | 51.85 |
| NR | 6726 | 94.25 |
| Swissprot | 2268 | 31.78 |
| TrEMBL | 6096 | 85.43 |
| Annotated | 6962 | 97.56 |
| Total | 7136 | 100 |
General Information of five Mesorhizobium genome
| CCNWGS0123 | 7653R | MAFF303099 | WSM1271 | WSM2075 | |
|---|---|---|---|---|---|
| length | 7,343,952 | 6,881,676 | 7,569,297 | 6,690,028 | 6,884,444 |
| G + C% | 62.8 | 62.86 | 62.51 | 62.56 | 62.87 |
| gene | 7378 | 6661 | 7298 | 6532 | 6576 |
| CDS | 7136 | 6235 | 7076 | 6264 | 6418 |
| RNA | 63 | 55 | 60 | 62 | 61 |
Fig. 4Core and accessory genome analysis of five Mesorhizobium strains
Fig. 5The pathway of synthesis and degradation of nitrogen