| Literature DB >> 33261555 |
Chang Ding1, Lorenz Adrian2,3.
Abstract
BACKGROUND: Anaerobic ammonium oxidizing bacteria (anammox bacteria) are contributing significantly to the nitrogen cycle and are successfully used in wastewater treatment. Due to the lack of complete genomes in the databases, little is known about the stability and variability of their genomes and how the genomes evolve in response to changing environments.Entities:
Keywords: CRISPR; Genome sequencing; Nitrite reductase; Proteomics; S-layer protein; “Ca. Kuenenia stuttgartiensis”
Mesh:
Substances:
Year: 2020 PMID: 33261555 PMCID: PMC7709395 DOI: 10.1186/s12864-020-07242-1
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Comparison of the genomes of “Ca. Kuenenia stuttgartiensis” strain KUST, strain MBR1, and strain CSTR1
| Strain | KUST [ | MBR1 [ | CSTR1 (this study) |
|---|---|---|---|
| Status | High quality draft (5 supercontigs) | Complete genome | Complete genome |
| Size | 4,218,325 | 4,406,132 | 4,334,932 |
| Sequencing technology | Sanger sequencing | SMRT sequencing | SMRT sequencing |
| GC % | 41.0 | 41.1 | 41.0 |
| Prediction of coding sequence | dps/orpheus, AMIGene, glimmer/rbsfinder and genemarks/genemark. hmm | Prodigal [ | AMIGene in MaGe pipeline [ |
| # Coding sequences (CDSs) | 4664 | 4043 | 4965 |
| Coding density % | 85.9 | 81.2 | 85.7 |
| Average gene length | 776 | 902 | 751 |
| # rRNA operons (5S, 16S, 23S) | 1 | 1 | 1 |
| # tRNAs | 44 | 45 | 45 |
MBR1 has much fewer predicted CDSs than the other two genomes and therefore a lower coding density and a higher average gene length due to usage of a different coding sequence prediction tool (Prodigal)
Fig. 1Visualized genome alignment of the genome of “Ca. Kuenenia stuttgartiensis” strain CSTR1 and MBR1b. In total, 20 locally collinear blocks (LCBs) were identified by Mauve and labelled in the figure. Histograms within the LCBs indicate sequence similarity within the regions
Genome sequence similarity among the genomes of “Ca. Kuenenia stuttgartiensis” KUST, strain MBR1, and strain CSTR1
| KUST | MBR1 | CSTR1 | |
|---|---|---|---|
| Tetra 0.9981 | Tetra 0.9985 | ||
| ANI 0.9940 | Tetra 0.9994 | ||
| ANI 0.9960 | ANI 0.9922 |
ANI - average nucleotide identity between genomes as calculated using the algorithm OrthoANIu [21, 22]
Tetra - correlation of tetranucleotide signatures between genomes as calculated using the program Tetra v1.02 [23]
Fig. 2Visualized alignment of the largest CRISPR locus in “Ca. Kuenenia stuttgartiensis” genomes KUST, MBR1b, and CSTR1. The CRISPR locus in KUST2012 is almost identical to that in the KUST, except that the 31st and 32nd spacers were not present in KUST2012 (counting from the leader sequence end) and is therefore not shown. Orange: identical spacers among the three genomes; Blue/Green/Purple: identical spacers between two of the three genomes; Yellow: unique spacers; White: gap in the alignment; Brown: type I-B Cas gene clusters. These Cas gene clusters span a length of ~ 6 kbp and are not drawn to scale. Arrows indicate the location of a 138-bp leader sequence
Fig. 3Amino acid sequence identity comparison among the four S-layer protein homologs in KUST (kustd1514), KUST2012 (kustd1514b), strain MBR1 (KSMBR1_1301), and strain CSTR1 (KsCSTR_09970). Shown are always comparisons of the two sequences indicated on the right side. Positions (aa) refer to the amino acid (aa) positions in the alignment of the four proteins. Window size: 20 amino acids. Step: 1 amino acid