| Literature DB >> 28066333 |
Phillip J Brumm1, Krishne Gowda2, Frank T Robb3, David A Mead4.
Abstract
Here we report the complete genome sequence of the chemoorganotrophic, extremely thermophilic bacterium, Dictyoglomus turgidum, which is a Gram negative, strictly anaerobic bacterium. D. turgidum and D. thermophilum together form the Dictyoglomi phylum. The two Dictyoglomus genomes are highly syntenic, and both are distantly related to Caldicellulosiruptor spp. D. turgidum is able to grow on a wide variety of polysaccharide substrates due to significant genomic commitment to glycosyl hydrolases, 16 of which were cloned and expressed in our study. The GH5, GH10, and GH42 enzymes characterized in this study suggest that D. turgidum can utilize most plant-based polysaccharides except crystalline cellulose. The DNA polymerase I enzyme was also expressed and characterized. The pure enzyme showed improved amplification of long PCR targets compared to Taq polymerase. The genome contains a full complement of DNA modifying enzymes, and an unusually high copy number (4) of a new, ancestral family of polB type nucleotidyltransferases designated as MNT (minimal nucleotidyltransferases). Considering its optimal growth at 72°C, D. turgidum has an anomalously low G+C content of 39.9% that may account for the presence of reverse gyrase, usually associated with hyperthermophiles.Entities:
Keywords: DNA polymerase; Dictyoglomi; Dictyoglomus turgidum; biomass degradation; glucanase; phage; reverse gyrase; thermophile
Year: 2016 PMID: 28066333 PMCID: PMC5167688 DOI: 10.3389/fmicb.2016.01979
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
Figure 1Genome map of . From outside to the center: genes on forward strand (color by COG categories); genes on reverse strand (color by COG categories); RNA genes (tRNAs green, rRNAs red, other RNAs black); GC content; GC skew.
Number of genes associated with general COG functional categories.
| J | 168 | 11.0% | Translation, ribosomal structure and biogenesis |
| K | 76 | 5.0% | Transcription |
| L | 61 | 4.0% | Replication, recombination and repair |
| B | 1 | 0.1% | Chromatin structure and dynamics |
| D | 19 | 1.2% | Cell cycle control, Cell division, chromosome partitioning |
| V | 40 | 2.6% | Defense mechanisms |
| T | 48 | 3.1% | Signal transduction mechanisms |
| M | 87 | 5.7% | Cell wall/membrane biogenesis |
| N | 20 | 1.3% | Cell motility |
| U | 18 | 1.2% | Intracellular trafficking and secretion |
| O | 61 | 4.0% | Posttranslational modification, protein turnover, chaperones |
| C | 79 | 5.2% | Energy production and conversion |
| E | 170 | 11.1% | Amino acid transport and metabolism |
| F | 60 | 3.9% | Nucleotide transport and metabolism |
| H | 73 | 4.8% | Coenzyme transport and metabolism |
| I | 44 | 2.9% | Lipid transport and metabolism |
| P | 77 | 5.0% | Inorganic ion transport and metabolism |
| Q | 18 | 1.2% | Secondary metabolites biosynthesis, transport and catabolism |
| R | 130 | 8.5% | General function prediction only |
| S | 58 | 3.8% | Function unknown |
| – | 511 | 27.4% | Not in COGs |
Highlighted in bold, COG class G. The fraction of the genes annotated as members of this class is greater than the fraction observed for 95% of genomes in the MicrobesOnline database.
Figure 2Synteny plot of selected genomes. MUMmer (Delcher et al., 2003) was used to generate the dotplot diagram between sets of two genomes. The six frame amino acid translation of the DNA input sequences were used for comparing genomes using PROmer software. Clockwise from top (A) genomes of D. turgidum and D. thermophilum; (B) genomes of T. thermophilus and T. aquaticus; (C) genomes of C. bescii and C. saccharolyticus.
Figure 3Molecular phylogenetic analysis of . Molecular phylogenetic analysis by Maximum Likelihood method was detailed in the Material and Methods Section. The bootstrap consensus tree inferred from 550 replicates [2] is taken to represent the evolutionary history of the taxa analyzed. Branches corresponding to partitions reproduced in less than 50% bootstrap replicates are collapsed. The percentage of replicate trees in which the associated taxa clustered together in the bootstrap test (550 replicates) are shown next to the branches. Sequences used for the analysis are: Dictyoglomus turgidum strain DSM 6724; NR_074885; Dictyoglomus thermophilum strain H-6-12, NR_029235.1; Fervidicola ferrireducens strain Y170, NR_044504.1; Thermosediminibacter oceani strain DSM 16646; NR_074461.1; Caldicellulosiruptor saccharolyticus strain DSM 8903; NR_074845.1; Caldicellulosiruptor hydrothermalis strain 108, NR_074767.1; Caldicellulosiruptor bescii strain DSM 6725; NR_074788.1; Desulfotomaculum kuznetsovii strain DSM 6115; NR_075068.1; Thermovirga lienii strain DSM 17291; NR_074606.1; Thermotoga petrophila strain RKU-10, NR_042374.1; Thermotoga naphthophila strain RKU-10, NR_112092.1; Thermotoga maritima strain MSB-8, NR_029163.1; and Geobacillus thermoglucosidasius strain ATCC 43742; NR_112058.1.
Annotated secreted polysaccharide-degrading enzymes.
| Dtur_0097 | GH 44 | β-mannanase | Calkro_0851 | 70.1% |
| Dtur_0172 | GH 28 | pectinase | Cphy_3310 | 47.1% |
| Dtur_0243 | GH 11 | xylanase | Calkro_0081 | 83.7% |
| Dtur_0276 | GH 5 | cellulase | Mahau_0466 | 59.9% |
| Dtur_0277 | GH 26 | β-mannanase | BG52_11385 | 52.7% |
| Dtur_0430 | PL 1 | pectate lyase | SNOD_03765 | 42.1% |
| Dtur_0431 | PL 1 | pectate lyase | M769_0111315 | 60.1% |
| Dtur_0432 | PLNC | pectate lyase | CSE_02370 | 57.3% |
| Dtur_0433 | CE 8 | pectin esterase | Calkro_0154 | 56.0% |
| Dtur_0628 | GH 12 | curdlanase | CTN_1107 | 48.4% |
| Dtur_0669 | GH 5 | cellulase | Mahau_0466 | 54.7% |
| Dtur_0675 | GH 57 | α-amylase | ANT_11030 | 41.3% |
| Dtur_0676 | CBM9 | α-amylase | COCOR_00322 | 39.7% |
| Dtur_0857 | GH 53 | β-galactanase | TRQ7_08325 | 56.5% |
| Dtur_1586 | GH 5 | cellulase | BSONL12_10711 | 41.5% |
| Dtur_1675 | GH 13 | α-amylase | CAAU_0986 | 51.6% |
| Dtur_1715 | GH 10 | xylanase | Pmob_0231 | 46.9% |
| Dtur_1729 | GH 43 | β-xylosidase | Csac_1560 | 67.9% |
| Dtur_1739 | GH 51 | β-xylosidase | Calhy_1625 | 58.9% |
| Dtur_1740 | GH 39 | β-xylosidase | TRQ7_03440 | 38.3% |
Annotated intracellular polysaccharide-degrading enzymes.
| Dtur_0081 | GH 2 | β-galactosidase | Calhy_1828 | 60.9% |
| Dtur_0157 | GH 4 | α-glucosidase | Mc24_02443 | 47.5% |
| Dtur_0171 | GH 31 | α-glucosidase | A500_11654 | 44.9% |
| Dtur_0219 | GH 3 | β-glucosidase | 67.3% | |
| Dtur_0222 | GH 20 | β-hexosaminidase | CDSM653_01797 | 67.2% |
| Dtur_0242 | CE NC | feruloyl esterase | TM_0033 | 55.1% |
| Dtur_0265 | CE 7 | acetyl xylan esterase | Tmari_0074 | 66.4% |
| Dtur_0289 | GH 3 | β-glucosidase | Cst_c03130 | 66.8% |
| Dtur_0315 | GH 29 | α-fucosidase | Tthe_0662 | 60.7% |
| Dtur_0320 | GH 31 | α-glucosidase | Csac_1354 | 65.9% |
| Dtur_0321 | GH 3 | β-glucosidase | Cst_c12090 | 50.1% |
| Dtur_0384 | GH 4 | α-glucosidase | CTER_5006 | 48.4% |
| Dtur_0435 | PL 1 | pectate lyase | MB27_42800 | 36.0% |
| Dtur_0440 | GH 4 | α-galacturonidase | BTS2_1711 | 61.6% |
| Dtur_0450 | CE 4 | deacetylase | Tnap_0743 | 67.4% |
| Dtur_0451 | GH 16 | curdlanase | TRQ7_04835 | 50.9% |
| Dtur_0462 | GH 1 | β-glucosidase | CLDAP_02840 | 48.5% |
| Dtur_0490 | GH 31 | α-glucosidase | Tbis_2416 | 45.4% |
| Dtur_0502 | GH 127 | β-L-arabinofuranosidase | CTN_0404 | 56.3% |
| Dtur_0505 | GH 42 | β-galactosidase | Mahau_1293 | 59.2% |
| Dtur_0523 | GH 18 | chitinase | Bccel_2454 | 50.1% |
| Dtur_0551 | GH 32 | invertase | Calhy_2186 | 47.6% |
| Dtur_0629 | GH 26 | β-mannanase | Calkro_1144 | 54.5% |
| Dtur_0650 | GH 31 | α-glucosidase | TheetDRAFT_1156 | 45.2% |
| Dtur_0658 | GH 130 | α-D-mannosyltransferase | X274_02975 | 41.2% |
| Dtur_0670 | GH 5 | cellulase | Mahau_0466 | 61.7% |
| Dtur_0671 | GH 5 | cellulase | TM_1752 | 58.7% |
| Dtur_0770 | GH 57 | α-amylase | BROSI_A0626 | 37.3% |
| Dtur_0794 | GH 13 | α-amylase | AC812_10325 | 35.3% |
| Dtur_0852 | GH 3 | β-glucosidase | M164_2324 | 58.2% |
| Dtur_0895 | GH 57 | α-amylase | TSIB_1115 | 46.5% |
| Dtur_0896 | GH 57 | α-amylase | Calab_2422 | 40.9% |
| Dtur_1539 | GH 2 | β-glucuronidase | Calkro_0120 | 60.7% |
| Dtur_1647 | GH 10 | xylanase | PaelaDRAFT_3013 | 51.2% |
| Dtur_1670 | GH 36 | α-galactosidase | Calla_1244 | 77.7% |
| Dtur_1677 | GH 4 | β-glucosidase | L21TH_1859 | 47.0% |
| Dtur_1714 | GH 67 | α-glucuronidase | Mc24_01903 | 69.4% |
| Dtur_1723 | GH 3 | β-glucosidase | 46.7% | |
| Dtur_1735 | GH 51 | β-xylosidase | COB47_1422 | 70.2% |
| Dtur_1749 | GH 4 | α-glucosidase | TRQ7_00895 | 68.7% |
| Dtur_1758 | GH 38 | α-mannosidase | CTN_0786 | 41.3% |
| Dtur_1799 | GH 1 | β-glucosidase | Hore_15280 | 57.7% |
| Dtur_1800 | GH 43 | β-xylosidase | Athe_2555 | 82.9% |
| Dtur_1802 | GH 2 | β-galactosidase | Thewi_0408 | 42.2% |
Enzymatic activity of cloned gene products.
| Dtur_0462 | GH 1 | β-glucosidase | MUA, MUC, MUG, MUX, XG |
| Dtur_1799 | GH 1 | β-glucosidase | MUA, MUC, MUG, MUX, XG |
| Dtur_0852 | GH 3 | β-glucosidase | MUA, MUC, MUG, MUX |
| Dtur_1723 | GH 3 | β-glucosidase | MUA, MUG, MUX |
| Dtur_0276 | GH 5 | Cellulase | BG |
| Dtur_0669 | GH 5 | Cellulase | BG, GM |
| Dtur_0670 | GH 5 | Cellulase | AX, BG, GM, HEC, XG |
| Dtur_0671 | GH 5 | Cellulase | GM |
| Dtur_1647 | GH 10 | Xylanase | AX, ARA, BG, HEC, MUG, MUX |
| Dtur_1715 | GH 10 | Xylanase | AX |
| Dtur_1670 | GH 36 | α-galactosidase | XAG |
| Dtur_0505 | GH 42 | β-galactosidase | MUA, MUX |
| Dtur_1729 | GH 43 | α-arabinase | ARA |
| Dtur_0857 | GH 53 | β-galactanase | MUA, XAG |
| Dtur_0675 | GH 57 | α-amylase | PUL |
| Dtur_1714 | GH 67 | α-glucuronidase | Xylan, xylooligosaccharides |
Legend: MUC, 4-methylumbelliferyl-β-D-cellobioside; MUX, 4-methylumbelliferyl-β-D–xylopyranoside; MUG, 4-methylumbelliferyl-β-D-glucoyranoside; MUA, 4-methylumbelliferyl-α-D-arabinofuranoside; XAG, 5-Bromo-4-chloro-3-indolyl α-D-galactopyranoside; XG, 5-Bromo-4-chloro-3-indolyl β-D-galactopyranoside; AR, AZCL-arabinan; AX, AZCL-arabinoxylan; BG, AZCL-β-glucan; GL, AZCL-galactan; GM, AZCL-galactomannan; HEC, AZCL-hydroxyethyl cellulose; PUL, AZCL-pullulan; XG, AZCL-xyloglucan.
.
| 0745 | ATP-dependent DNA helicase | |
| 1397 | ATP-dependent DNA helicase PcrA | |
| 1140 | ATP-dependent DNA helicase RecG | |
| 0780 | ATP-dependent DNA ligase | uncultured |
| 1514 | Bacterial nucleoid DNA-binding protein | |
| 1151 | Cell division protein FtsK | |
| 0001 | Chromosomal replication initiator protein DnaA | |
| 1468 | Chromosome segregation and condensation protein ScpA | |
| 1467 | Chromosome segregation and condensation protein ScpB | |
| 0947 | Chromosome segregation protein SMC | |
| 1104 | Competence protein ComE | |
| 1103 | Competence protein ComEA | |
| 1614 | Crossover junction endodeoxyribonuclease RuvC | |
| 0771 | Deoxyinosine 3'endonuclease (endonuclease V) | bacterium JGI-24/54 |
| 1268 | DNA and RNA Helicase | candidate division |
| 1547 | DNA gyrase B subunit | |
| 1263 | DNA gyrase, A subunit | |
| 1478 | DNA integrity scanning protein DisA | |
| 1078 | DNA mismatch repair protein MutL | |
| 1077 | DNA mismatch repair protein MutS | |
| 0884 | DNA or RNA helicase of superfamily II (UvrB) | |
| 0104 | DNA polymerase beta domain-containing protein | |
| 0317 | DNA polymerase beta domain-containing protein | marine sediment metagenome/38 |
| 0545 | DNA polymerase beta domain-containing protein | |
| 1295 | DNA polymerase beta domain-containing protein* | |
| 0882 | DNA polymerase I 3'-5' exonuclease & polymerase domains | |
| 1391 | DNA polymerase III alpha subunit | |
| 1105 | DNA polymerase III delta subunit | |
| 0257 | DNA polymerase III gamma/tau subunits | |
| 0789 | DNA polymerase III gamma/tau subunits | marine sediment metagenome/39 |
| 1551 | DNA polymerase III sliding clamp subunit beta | |
| 1600 | DNA Polymerase X | |
| 1316 | DNA primase N | |
| 1527 | DNA protecting protein DprA | |
| 1549 | DNA recombination protein RecF | |
| 1625 | DNA repair exonuclease | |
| 0327 | DNA repair photolyase | |
| 0463 | DNA repair photolyase | |
| 1479 | DNA repair protein RadA | |
| 0881 | DNA repair protein RADC | |
| 1047 | DNA repair protein RecN | |
| 1321 | DNA repair protein RecO | |
| 0015 | DNA replication and repair protein RecF | |
| 1526 | DNA topoisomerase type I | |
| 1522 | Double-stranded DNA repair protein Rad50 | |
| 1626 | Double-stranded DNA repair protein Rad50 | |
| 0264 | Endonuclease IV | |
| 1485 | Excinuclease ABC subunit C | |
| 0885 | Excinuclease ATPase subunit (ABC-ATPase UvrA) | |
| 1613 | Holliday junction DNA helicase RuvA | |
| 1612 | Holliday junction DNA helicase RuvB | |
| 0792 | Holliday junction resolvasome helicase subunit | |
| 1284 | Integrase* | |
| 0886 | Methylated DNA-protein cysteine methyltransferase | |
| 1227 | Mg-dependent Dnase—deoxyribonuclease TatD | |
| 1308 | Mismatch repair ATPase (MutS family) | |
| 1294 | Modification methylase, type III R/M system* | |
| 0341 | N6-adenine-specific methylase | |
| 1141 | N6-adenine-specific methylase | |
| 1497 | Poly(A) polymerase | |
| 0683 | Predicted EndoIII-related endonuclease | |
| 0846 | Predicted EndoIII-related endonuclease | |
| 1024 | Predicted endonuclease involved in recombination | |
| 1530 | Predicted endonuclease related to Holliday junction resolvase | |
| 1709 | Predicted exonuclease | marine sediment metagenome/61 |
| 0102 | Predicted nucleic acid-binding protein contains PIN domain | |
| 1441 | Primosomal protein N' (replication factor Y) | human gut metagenome/29 |
| 1507 | Putative chromosome partioning protein | |
| 1162 | RecA/RadA recombinase | |
| 1524 | Recombinase XerD | Symbiobacterium thermophilum/47 |
| 0259 | Recombination protein RecR | |
| 1358 | Replicative DNA helicase | |
| 0014 | Reverse gyrase | |
| 0708 | Ribonuclease HII | |
| 1531 | Ribonuclease HII | |
| 0118 | Single stranded nucleic acid binding protein (R3H domain) | |
| 1602 | Single-stranded DNA exonuclease RecJ | |
| 1362 | Single-stranded DNA-binding protein | |
| 0786 | Single-stranded nucleic acid binding protein (R3H domain) | |
| 0880 | Smc1 chromosome segregation protein, putative | marine sediment metagenome/28 |
| 0202 | Thermostable 8-oxoguanine DNA glycosylase | |
| 1511 | Transcription-repair coupling factor | marine sediment metagenome/40 |
| 1297 | Type III restriction endonuclease subunit R* | bacterium JGI-6/67 |
| 1473 | Tyrosine recombinase XerD | |
| 1393 | Uracil-DNA glycosylase |
Prophage related genes are marked.
Figure 4Multiple sequence alignment of all five . From top to bottom: Dtur_0104, 114AA; Dtur_0317, 121AA, Dtur_0545, 135AA, Dtur_1295, 99AA, Dicth_0227, 114AA. Dtur_1295 is located in the prophage region of D. turdigum.
Figure 5Comparison of PCR efficacy between Dtur_0882 and Taq DNAP (A) and 3′-5′ exonuclease activity between Dtur_0882; Taq and OmniAmp DNAP (B). PCR amplicons of 0.9 kb (lane A2), 2.8 kb (lane A3), 5 kb (lane A4), or 10 kb (lane A5) were produced by Taq (T) or Dtur (D) DNAP. To assess exonuclease activity (lanes B2-9) lambda DNA restriction digested with Hind III was incubated with 5U of Taq (T), Dtur (D) or OmniAmp (A) DNAP (in duplicate) overnight at 37°C in PCR buffer. Lane 1 is a 1 kb DNA ladder (Promega).
.
| Dtur_0606 | CRISPR-associated protein, TM1812 family | Dicth_0458 |
| Dtur_0607 | CRISPR-associated protein DxTHG motif protein | Dicth_0187 |
| Dtur_0608 | CRISPR-associated RAMP protein, Csm5 family | Dicth_0186 |
| Dtur_0609 | CRISPR-associated RAMP protein, Csm4 family | Dicth_0185 |
| Dtur_0610 | CRISPR-associated RAMP protein, Csm3 family | Dicth_0184 |
| Dtur_0611 | CRISPR-associated RAMP protein, Csm2 family | Dicth_0183 |
| Dtur_0612 | CRISPR-associated RAMP protein, Csm1 family | Dicth_0182 |
| Dtur_0613 | CRISPR-associated protein, Csx3 family | Dicth_0181 |
| Dtur_0618 | CRISPR-associated protein, Cas2 family | Dicth_0165 |
| Dtur_0619 | CRISPR-associated protein, Cas1 family | Dicth_0164 |
| Dtur_0620 | CRISPR-associated endonuclease, Cas4 family | Dicth_0163 |
| Dtur_0621 | CRISPR-associated helicase, Cas3 family | Dicth_0162 |
| Dtur_0622 | CRISPR-associated protein, Cas5h family | Dicth_0161 |
| Dtur_0623 | CRISPR-associated protein, Csh2 family | Dicth_0160 |
| Dtur_0624 | CRISPR-associated protein, Csh1 family | Dicth_0159 |
| Dtur_0625 | CRISPR-associated protein, Cas6 family | Dicth_0158 |
Figure 6Evolutionary relationships of reverse gyrase. The evolutionary history was inferred by using the Maximum Likelihood method based on the JTT matrix-based model (Tamura et al., 2011) The tree with the highest log likelihood (−22290.6302) is shown. Initial tree(s) for the heuristic search were obtained automatically by applying Neighbor-Join and BioNJ algorithms to a matrix of pairwise distances estimated using a JTT model, and then selecting the topology with superior log likelihood value. The tree is drawn to scale, with branch lengths measured in the number of substitutions per site. The analysis involved 7 amino acid sequences. All positions containing gaps and missing data were eliminated. There were a total of 1122 positions in the final dataset. Evolutionary analyses were conducted in MEGA7 (Kumar et al., 2016). UniPpot sequences used for the analysis were: B8DYH3, Dictyoglomus turgidum strain DSM 6724; A7HMS7, Fervidobacterium nodosum strain DSM 5306; H9UDK4, Fervidobacterium pennivorans strain DSM 9078; C1DT23, Sulfurihydrogenibium azorense strain DSM 15241; B2V6S9, Sulfurihydrogenibium sp. strain YO3AOP; F8C2X1, Thermodesulfobacterium geofontis strain OPF15, and P95479; Pyrococcus furiosus strain DSM 3638.
Figure 7Evolutionary relationships of SpoVS proteins. The evolutionary history was inferred by using the Maximum Likelihood method based on the JTT matrix-based model. The bootstrap consensus tree inferred from 550 replicates is taken to represent the evolutionary history of the taxa analyzed. Branches corresponding to partitions reproduced in less than 50% bootstrap replicates are collapsed. The percentage of replicate trees in which the associated taxa clustered together in the bootstrap test (550 replicates) are shown next to the branches. Initial tree(s) for the heuristic search were obtained automatically by applying Neighbor-Join and BioNJ algorithms to a matrix of pairwise distances estimated using a JTT model, and then selecting the topology with superior log likelihood value. The analysis involved 22 amino acid sequences. All positions containing gaps and missing data were eliminated. There were a total of 86 positions in the final dataset.