| Literature DB >> 26478786 |
Sofie E De Meyer1, Matthew Parker2, Peter Van Berkum3, Rui Tian1, Rekha Seshadri4, T B K Reddy4, Victor Markowitz5, Natalia Ivanova4, Amrita Pati4, Tanja Woyke4, Nikos Kyrpides4, John Howieson1, Wayne Reeve1.
Abstract
Cupriavidus sp. strain AMP6 is an aerobic, motile, Gram-negative, non-spore-forming rod that was isolated from a root nodule of Mimosa asperata collected in Santa Ana National Wildlife Refuge, Texas, in 2005. Mimosa asperata is the only legume described so far to exclusively associates with Cupriavidus symbionts. Moreover, strain AMP6 represents an early-diverging lineage within the symbiotic Cupriavidus group and has the capacity to develop an effective nitrogen-fixing symbiosis with three other species of Mimosa. Therefore, the genome of Cupriavidus sp. strain AMP6 enables comparative analyses of symbiotic trait evolution in this genus and here we describe the general features, together with sequence and annotation. The 7,579,563 bp high-quality permanent draft genome is arranged in 260 scaffolds of 262 contigs, contains 7,033 protein-coding genes and 97 RNA-only encoding genes, and is part of the GEBA-RNB project proposal.Entities:
Keywords: Betaproteobacteria; GEBA-RNB; Mimosa asperata; Nitrogen fixation; Root-nodule bacteria; Texas
Year: 2015 PMID: 26478786 PMCID: PMC4609095 DOI: 10.1186/s40793-015-0074-1
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Fig. 1Images of Cupriavidus sp. strain AMP6 using scanning (Left) and transmission (Center) electron microscopy and the appearance of colony morphology on solid media (Right)
Fig. 2Phylogenetic tree highlighting the position of Cupriavidus sp. strain AMP6 (shown in blue print) relative to other type and non-type strains in the Cupriavidus genus using a 1,024 bp internal region of the 16S rRNA gene. Several Alpha-rhizobia sequences were used as an outgroup. All sites were informative and there were no gap-containing sites. Phylogenetic analyses were performed using MEGA, version 5.05 [46]. The tree was build using the maximum likelihood method with the General Time Reversible model. Bootstrap analysis with 500 replicates was performed to assess the support of the clusters. Type strains are indicated with a superscript T. Strains with a genome sequencing project registered in GOLD [30] have the GOLD ID mentioned after the strain number, otherwise the NCBI accession number is provided. Finished genomes are designated with an asterisk
Classification and general features of Cupriavidus sp. strain AMP6 in accordance with the MIGS recommendations [28] published by the Genome Standards Consortium [47]
| MIGS ID | Property | Term | Evidence code |
|---|---|---|---|
| Classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| (Type) strain AMP6 | TAS [ | ||
| Gram stain | Negative | TAS [ | |
| Cell shape | Rod | IDA | |
| Motility | Motile | IDA | |
| Sporulation | Non-sporulating | TAS [ | |
| Temperature range | Mesophile | TAS [ | |
| Optimum temperature | 28 °C | IDA | |
| pH range; Optimum | Not reported | ||
| Carbon source | Not reported | ||
| MIGS-6 | Habitat | Soil, root nodule on host | IDA |
| MIGS-6.3 | Salinity | Not reported | |
| MIGS-22 | Oxygen requirement | Aerobic | IDA |
| MIGS-15 | Biotic relationship | Symbiotic | IDA |
| MIGS-14 | Pathogenicity | Non-pathogenic | NAS |
| MIGS-4 | Geographic location | Texas, USA | TAS [ |
| MIGS-5 | Nodule collection date | 2005 | TAS [ |
| MIGS-4.1 | Longitude | −98.138 | TAS [ |
| MIGS-4.2 | Latitude | 26.0794 | TAS [ |
| MIGS-4.4 | Altitude | 30 m | IDA |
Evidence codes – IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [55]
Genome sequencing project information for Cupriavidus sp. strain AMP6
| MIGS ID | Property | Term |
|---|---|---|
| MIGS-31 | Finishing quality | High-quality permanent draft |
| MIGS-28 | Libraries used | Illumina Std PE |
| MIGS-29 | Sequencing platforms | Illumina HiSeq 2000 |
| MIGS-31.2 | Fold coverage | 117.0x Illumina |
| MIGS-30 | Assemblers | Velvet 1.1.04, ALLPATHS V.r42328 |
| MIGS-32 | Gene calling methods | Prodigal 1.4 |
| Locus Tag | K309 | |
| Genbank ID | AUFE00000000 | |
| Genbank Date of Release | December 12, 2013 | |
| GOLD ID | Gp0009812 | |
| BIOPROJECT | PRJNA195776 | |
| MIGS-13 | Source Material Identifier | AMP6 |
| Project relevance | Symbiotic N2fixation, agriculture |
Genome statistics for Cupriavidus sp. AMP6
| Attribute | Value | % of Total |
|---|---|---|
| Genome size (bp) | 7,579,563 | 100.00 |
| DNA coding (bp) | 6,545,489 | 86.36 |
| DNA G + C (bp) | 4,961,426 | 65.46 |
| DNA scaffolds | 260 | 100.00 |
| Total genes | 7,130 | 100.00 |
| Protein-coding genes | 7,033 | 98.64 |
| RNA genes | 97 | 1.36 |
| Pseudo genes | 0 | 0.00 |
| Genes in internal clusters | 538 | 7.55 |
| Genes with function prediction | 5,721 | 80.24 |
| Genes assigned to COGs | 4,791 | 67.19 |
| Genes with Pfam domains | 5,837 | 81.87 |
| Genes with signal peptides | 681 | 9.55 |
| Genes with transmembrane helices | 1,477 | 20.72 |
| CRISPR repeats | 1 |
Number of genes associated with general COG functional categories
| Code | Value | % age | COG Category |
|---|---|---|---|
| J | 182 | 3.37 | Translation, ribosomal structure and biogenesis |
| A | 1 | 0.02 | RNA processing and modification |
| K | 527 | 9.76 | Transcription |
| L | 188 | 3.48 | Replication, recombination and repair |
| B | 4 | 0.07 | Chromatin structure and dynamics |
| D | 32 | 0.59 | Cell cycle control, Cell division, chromosome partitioning |
| V | 59 | 1.09 | Defense mechanisms |
| T | 210 | 3.89 | Signal transduction mechanisms |
| M | 275 | 5.09 | Cell wall/membrane/envelope biogenesis |
| N | 96 | 1.78 | Cell motility |
| U | 119 | 2.20 | Intracellular trafficking, secretion, and vesicular transport |
| O | 164 | 3.04 | Posttranslational modification, protein turnover, chaperones |
| C | 447 | 8.28 | Energy production and conversion |
| G | 256 | 4.74 | Carbohydrate transport and metabolism |
| E | 501 | 9.28 | Amino acid transport and metabolism |
| F | 90 | 1.67 | Nucleotide transport and metabolism |
| H | 185 | 3.43 | Coenzyme transport and metabolism |
| I | 344 | 6.37 | Lipid transport and metabolism |
| P | 272 | 5.04 | Inorganic ion transport and metabolism |
| Q | 235 | 4.35 | Secondary metabolite biosynthesis, transport and catabolism |
| R | 659 | 12.21 | General function prediction only |
| S | 552 | 10.23 | Function unknown |
| - | 2339 | 32.81 | Not in COGS |
The total is based on the total number of protein coding genes in the genome