| Literature DB >> 31898479 |
Joni Frederick1,2,3, Fritha Hennessy1, Uli Horn4, Pilar de la Torre Cortés5, Marcel van den Broek5, Ulrich Strych6,7, Richard Willson6,8, Charles A Hefer9,10, Jean-Marc G Daran5, Trevor Sewell2, Linda G Otten11, Dean Brady12,13.
Abstract
BACKGROUND: Rhodococci are industrially important soil-dwelling Gram-positive bacteria that are well known for both nitrile hydrolysis and oxidative metabolism of aromatics. Rhodococcus rhodochrous ATCC BAA-870 is capable of metabolising a wide range of aliphatic and aromatic nitriles and amides. The genome of the organism was sequenced and analysed in order to better understand this whole cell biocatalyst.Entities:
Mesh:
Substances:
Year: 2020 PMID: 31898479 PMCID: PMC6941271 DOI: 10.1186/s12864-019-6405-7
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fully sequenceda and well described Rhodococcus species ranked by completion date
| Organism | Date Completedb | Group | Reference | Chromosome (Mbp) | Plasmid (Mbp) | Total Size, Mbp | G + C % | Protein coding genes |
|---|---|---|---|---|---|---|---|---|
| 2018 | This study | This paper | 5.37 | 0.53 | 5.9 | 65 | 7548e | |
| 19-03-2013 | Centre National de la Recherche Scientifique, Institut des Sciences du Vegetal, France | NZ_CP007255 [ | 6.2 | 477,915; 91,729 | 6.8 | 62 | 6130 | |
| 26-11-2012 | National Laboratory of Macromolecules, Chinese Academy of Sciences, Beijing | NZ_CP003949 [ | 8.38 | 9 plasmids | 9.17 | 67 | 8947 | |
| 10-11-2011 | Massachusetts Institute of Technology and The Broad Institute | GCF_000234335 [ | – | – | 9.27 | 67 | 7910 | |
| 21-10-2009 | IREC (International | NC_014659 [ | 5.04 | None determined | 5.04 | 69 | 4540 | |
| 24-07-2006 | Genome British Columbia, Vancouver | NC_008268 [ | 7.8 | 1,123,075; 442,536; 332,361 | 9.7 | 67 | 8690 | |
| 31-03-2005 | Sequencing Center: National Institute of Technology and Evaluation, Japan | NC_012490 [ | 6.5 | 271,577; 104,014; 3637 | 6.9 | 62 | 6321 |
aAll sequences are completed and fully assembled, except GCF_000234335, which consists of 282 contigs
bDate completed refers to genome sequence completion/submission to database; plasmids may have been completed at another time. Total genome size comprises the chromosome and the plasmid sequence. Genome information of strains other than BAA-870 is obtained from the NCBI database
cTwo separate references, therefore 2 entries
dR. equi is renamed to R. hoagii
eBased on BASys annotation
Fig. 1Phylogenetic tree created using rhodococcal 16S rRNA ClustalW sequence alignments. Neighbour joining, phylogenetic cladogram created using Phylogeny in ClustalW, and ClustalO multiple sequence alignment of R. rhodochrous ATCC BAA-870 16S rRNA genes and other closely matched genes from rhodococcal species. R. rhodochrous ATCC BAA-870 contains four copies of the 16S rRNA gene (labelled RNA_1 to RNA_4) and are indicated with an asterisk. For clarity, only closely matched BLAST results with greater than 95.5% sequence identity and those with complete 16S rRNA gene sequences, or from complete genomes, are considered. Additionally, 16S rRNA gene sequences (obtained from the NCBI gene database) from R. jostii RHA1, R. fascians A44A and D188, R. equi 103S, R. erythropolis CCM2595, and R. aetherivorans strain IcdP1 are included for comparison. Strain names are preceded by their NCBI accession number, as well as sequence position if there are multiple copies of the 16S rRNA gene in the same species
Fig. 2BASys bacterial annotation summary view of the Rhodococcus rhodochrous ATCC BAA-870 genome. BASys visual representation of a the 5,370,537 bp chromosome, with a breakdown of the 6871 genes encoded, and b the 533,288 bp linear plasmid, with a breakdown of the 677 genes encoded. Different colours indicate different subsystems for catabolic and anabolic routes
Fig. 3Protein function breakdown of Rhodococcus rhodochrous ATCC BAA-870 based on BASys annotation COG classifications. Unknown proteins form the majority of proteins in the BASys annotated genome, and make up 55 and 59% respectively of genes in the a chromosome and b plasmid. For simplicity, functional categories less than 0.02% are not included in the graphic. Letters refer to COG functional categories, with one-letter abbreviations: C - Energy production and conversion; D - Cell division and chromosome partitioning; E - Amino acid transport and metabolism; F - Nucleotide transport and metabolism; G - Carbohydrate transport and metabolism; H - Coenzyme metabolism; I - Lipid metabolism; J - Translation, ribosomal structure and biogenesis; K - Transcription; L - DNA replication, recombination and repair; M - Cell envelope biogenesis, outer membrane; N - Secretion, motility and chemotaxis; O - Posttranslational modification, protein turnover, chaperones; P - Inorganic ion transport and metabolism; Q - Secondary metabolites biosynthesis, transport and catabolism; R - General function prediction only; S - COG of unknown function; T - Signal transduction mechanisms
Fig. 4RAST annotation summary of the Rhodococcus rhodochrous ATCC BAA-870 genome. RAST annotation results show a the subsystem coverage, b the subsystem coverage breakdown, and c organisation of the subsystems by cellular process as a percentage showing the distribution of annotations across defined structural and functional subsystem roles. RAST uses a subsystem approach, in which annotations are assigned to groups with similar functional or structural roles. For R. rhodochrous ATCC BAA-870, 26% of annotated genes belong to an identified functional role, or subsystem. The coverage breakdown shows the percentage of hypothetical and non-hypothetical annotations for genes assigned to subsystems and those for which a known functional role is not assigned (i.e. those not in the subsystem)
Comparison of nitrile converting enzymes in different Rhodococcus species
| Organism | Nitrilase | Nitrile Hydratase | NHase regulators | Amidase | Amidase Regulators | NCBI Assembly Reference |
|---|---|---|---|---|---|---|
| 2 (pl) | 1 (pl) | 4 (pl) | 7 (chr) 2 (pl) | 2 (pl) | this study | |
| – | 1 | 4 | 12 | 1 | GCF_000010105 [ | |
| – | 1 | – | 2 | – | GCF_000174835 (no reference) | |
| – | – | – | 11 | – | GCF_000196695 [ | |
| – | – | – | 11 | – | GCF_000164155 (no reference) | |
| 1 | 1 | – | 14 (chr) 1 (pl) | – | GCF_000014565 [ | |
| – | – | – | 13 | – | GCF_000010805 (no reference) | |
| – | 2 | – | 13 | 2 | GCF_000599555 GCF_000234335 [ | |
| – | 2 | 1 | 9 | – | GCF_001890475 [ | |
| 1 | 2 | 1 | 13 | 1 | GCF_001653035 [ |
Number of enzymes on the chromosome. If multiple enzymes are present on different genomic elements, the location is mentioned: chr chromosome or pl plasmids