| Literature DB >> 31719174 |
Christian M K Sieber1,2, Blair G Paul3, Cindy J Castelle1, Ping Hu4,5, Susannah G Tringe2, David L Valentine3, Gary L Andersen4,6, Jillian F Banfield7,6.
Abstract
The candidate phyla radiation (CPR) comprises a large monophyletic group of bacterial lineages known almost exclusively based on genomes obtained using cultivation-independent methods. Within the CPR, Gracilibacteria (BD1-5) are particularly poorly understood due to undersampling and the inherent fragmented nature of available genomes. Here, we report the first closed, curated genome of a gracilibacterium from an enrichment experiment inoculated from the Gulf of Mexico and designed to investigate hydrocarbon degradation. The gracilibacterium rose in abundance after the community switched to dominance by Colwellia Notably, we predict that this gracilibacterium completely lacks glycolysis, the pentose phosphate and Entner-Doudoroff pathways. It appears to acquire pyruvate, acetyl coenzyme A (acetyl-CoA), and oxaloacetate via degradation of externally derived citrate, malate, and amino acids and may use compound interconversion and oxidoreductases to generate and recycle reductive power. The initial genome assembly was fragmented in an unusual gene that is hypervariable within a repeat region. Such extreme local variation is rare but characteristic of genes that confer traits under pressure to diversify within a population. Notably, the four major repeated 9-mer nucleotide sequences all generate a proline-threonine-aspartic acid (PTD) repeat. The genome of an abundant Colwellia psychrerythraea population has a large extracellular protein that also contains the repeated PTD motif. Although we do not know the host for the BD1-5 cell, the high relative abundance of the C. psychrerythraea population and the shared surface protein repeat may indicate an association between these bacteria.IMPORTANCE CPR bacteria are generally predicted to be symbionts due to their extensive biosynthetic deficits. Although monophyletic, they are not monolithic in terms of their lifestyles. The organism described here appears to have evolved an unusual metabolic platform not reliant on glucose or pentose sugars. Its biology appears to be centered around bacterial host-derived compounds and/or cell detritus. Amino acids likely provide building blocks for nucleic acids, peptidoglycan, and protein synthesis. We resolved an unusual repeat region that would be invisible without genome curation. The nucleotide sequence is apparently under strong diversifying selection, but the amino acid sequence is under stabilizing selection. The amino acid repeat also occurs in a surface protein of a coexisting bacterium, suggesting colocation and possibly interdependence.Entities:
Keywords: BD1-5; CPR; candidate phyla radiation; genomes from metagenomes; gracilibacteria; surface proteins
Mesh:
Substances:
Year: 2019 PMID: 31719174 PMCID: PMC6851277 DOI: 10.1128/mBio.02128-19
Source DB: PubMed Journal: mBio Impact factor: 7.867
FIG 1Repeat locus from the BD1-5 genome. Colored arrows represent repeated sequence blocks, the sequences for which are shown in the “Repeats” insert. Sets of arrows represent reads, and reads linked within this region to paired reads are indicated by a thin connecting line.
FIG 2Diagram showing the GC skew (gray dots) and calculated cumulative GC skew (green line) across the finished BD1-5 genome. The pattern is typical of a correctly assembled genome of a bacterium that undergoes bidirectional replication from origin to terminus.
General information about the Gracilibacteria genome
| Genome statistical parameter | Result for |
|---|---|
| Genome size (Mb) | 1.343 |
| GC content (%) | 28.87 |
| Avg coverage | 42.26 |
| No. of: | |
| Protein-coding genes | 1,243 |
| tRNA genes | 33 |
| rRNA genes | 1 |
| Transcription factors | 29 |
| Secreted proteins (signal peptide) | 66 |
| Small secreted proteins (<300 aa) | 24 |
| Non-classically secreted proteins (no signal peptide) | 104 |
| Small non-classically secreted proteins (<300 aa) | 61 |
| Transmembrane proteins (>3 TM domains) | 122 |
| Transporters | 80 |
| ABC transporter-related proteins | 11 |
| Amino acid permeases | 34 |
FIG 3Phylogenetic placement of the Gracilibacteria genome from sample BD02T64 reported here. The 16S rRNA tree was constructed using the maximum likelihood method RAxML. The small black circles indicate nodes with values of >70% bootstrap support. 16S rRNA genes retrieved from genomes are indicated by green circles. Dotted circles represent published draft genomes, and the full circle indicates the finished and curated genome from this study. Colored circles indicate the type of ecosystem from which sequences were obtained. The full tree file is provided in the Data Set S1.
FIG 4Cell cartoon depicting a reconstruction of the metabolism of the gracilibacterium. Bold text indicates prominent functions, blue text indicates resources inferred to be externally derived. * indicates that reactions for biosynthesis of cofactors require a precursor compound. Abbreviations: PEP, phosphoenolpyruvate; UDP-GlcNAc, UDP-N-acetyl-α-d-glucosamine; OFOR, 2-oxoacid ferredoxin oxidoreductase; 3PG, 3-phospho-d-glycerate; 3-PoxyP, 3-phosphonooxypyruvate; 2-oxoglu, 2-oxoglutarate; FNR, ferredoxin reductase; PPPi, PPi, and Pi: phosphate compounds interconverted by inorganic pyrophosphatase; Mk-n, metaquinone; Mk-l, metaquinol; Succ. semiald., succinate semialdehyde; l-Glu, l-glutamate; R-COOH, a carboxylic acid; CPG, cyanophycin; FG, N-formyl-l-glutamate. PTD is a tripeptide repeat.