| Literature DB >> 26594307 |
Andrew F Billings1, Julian L Fortney2, Terry C Hazen3, Blake Simmons4, Karen W Davenport5, Lynne Goodwin5, Natalia Ivanova6, Nikos C Kyrpides6, Konstantinos Mavromatis6, Tanja Woyke6, Kristen M DeAngelis1.
Abstract
Tolumonas lignolytica BRL6-1(T) sp. nov. is the type strain of T. lignolytica sp. nov., a proposed novel species of the Tolumonas genus. This strain was isolated from tropical rainforest soils based on its ability to utilize lignin as a sole carbon source. Cells of Tolumonas lignolytica BRL6-1(T) are mesophilic, non-spore forming, Gram-negative rods that are oxidase and catalase negative. The genome for this isolate was sequenced and returned in seven unique contigs totaling 3.6Mbp, enabling the characterization of several putative pathways for lignin breakdown. Particularly, we found an extracellular peroxidase involved in lignin depolymerization, as well as several enzymes involved in β-aryl ether bond cleavage, which is the most abundant linkage between lignin monomers. We also found genes for enzymes involved in ferulic acid metabolism, which is a common product of lignin breakdown. By characterizing pathways and enzymes employed in the bacterial breakdown of lignin in anaerobic environments, this work should assist in the efficient engineering of biofuel production from lignocellulosic material.Entities:
Keywords: Anaerobic lignin degradation; Tolumonas; Tropical forest soil isolate
Year: 2015 PMID: 26594307 PMCID: PMC4653933 DOI: 10.1186/s40793-015-0100-3
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Classification and general features of Tolumonas lignolytica BRL6-1
| MIGS ID | Property | Term | Evidence codea |
|---|---|---|---|
| Classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | |||
| Strain BRL6-1T | |||
| Gram stain | negative | IDA | |
| Cell shape | rod | IDA | |
| Motility | motile | IDA | |
| Sporulation | non-sporulating | IDA | |
| Temperature range | 15–37 °C | IDA | |
| Optimum temperature | 30 °C | IDA | |
| pH range; Optimum | 4.5–8.5; 7 | ||
| Carbon source | glucose, lactose, others (Table | IDA | |
| MIGS-6 | Habitat | Tropical forest soil | TAS [ |
| MIGS-6.3 | Salinity | 1 % NaCl | IDA |
| MIGS-22 | Oxygen requirement | Facultative aerobe | IDA |
| MIGS-15 | Biotic relationship | Free-living | IDA |
| MIGS-14 | Pathogenicity | Non-pathogenic | NAS |
| MIGS-4 | Geographic location | Soil collected from a subtropical lower montane wet forest in the Luquillo Experimental Forest, part of the NSF- sponsored Long-Term Ecological Research program in Puerto Rico | IDA |
| MIGS-5 | Sample collection | July 2009 | IDA |
| MIGS-4.1 | Latitude | 18.268 N | IDA |
| MIGS-4.2 | Longitude | 65.760 W | IDA |
| MIGS-4.4 | Altitude | 375 m | IDA |
aEvidence codes - IDA Inferred from Direct Assay, TAS, Traceable Author Statement (i.e., a direct report exists in the literature), NAS Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [53]
Fig. 1Photomicrograph of Tolumonas lignolytica BRL6-1T. The sample was prepared by growing a culture overnight in 10 % Tryptic Soy Broth (TSB), then adhering to microscope slide coverslips that were treated with poly-lysine to facilitate attachment of cells. The image was taken with a Nikon E500 Fluorescence Microscope
Fig. 2Phylogenetic tree highlighting the position of Tolumonas lignolytica BRL6-1T among the Aeromonadales. a The phylogenetic tree based on 16S ribosomal RNA gene sequence was inferred using the Neighbor-Joining method [54] within MEGA6 [55]. Bootstrap values of 1000 replicate trees are shown at the branches [56]. The tree is drawn to scale, with branch lengths in the same units as those of the evolutionary distances used to infer the phylogenetic tree. The evolutionary distances were computed using the Jukes-Cantor method [57] and are in units of the number of base substitutions per site. All positions containing gaps and missing data were eliminated, creating a total of 1234 positions in the final dataset. GenBank accession numbers are shown in parentheses after strain numbers. Type strains are indicated with a superscript T. Organisms with genomes available are indicated by an asterisk before the name. b Whole genomes were hierarchically clustered based on COG profiles using tools in IMG [58]. T. lignolytica BRL6-1T is indicated in bold in both phylogenetic trees
Sequencing project information
| MIGS ID | Property | Term |
|---|---|---|
| MIGS-31 | Finishing quality | High-quality draft |
| MIGS-28 | Libraries used | Three libraries: Illumina std shotgun library, Illumina long insert mate pair library, Pacbio SMRTbell™ library |
| MIGS-29 | Sequencing platforms | Illumina HiSeq 2000, PacBio RS |
| MIGS-31.2 | Fold coverage | 2680X, 1157X, 33X |
| MIGS-30 | Assemblers | AllpathsLG |
| MIGS-32 | Gene calling method | Prodigal, GenePRIMP |
| Locus Tag | H027 | |
| GenBank ID | AZUK00000000.1 | |
| GenBank Date of Release | May 23, 2014 | |
| GOLD ID | Gi0037033 | |
| BIOPROJECT | PRJNA186459 | |
| MIGS-13 | Source Material Identifier | In submission |
| Project relevance | Biofuels, Environmental |
Fig. 3Z-curve graph of T. lignolytica BRL6-1T contig 1. The Z-curve graph was made using Ori-Finder software. The sequence is rotated such that the beginning and ending are the maximum of the GC disparity curve. The short vertical red line indicates the location of the indicator gene dnaA, while purple peaks with diamonds indicate DnaA box clusters. The shift in base pair disparity coincides with the location of the dnaA gene and a large cluster of DnaA boxes, which together strongly suggest the location of the origin of replication. MK disparity represents the ratio of amino (A,C) to keto (G,T) bases, while RY disparity represents the ratio of purine (A,G) to pyrimidine (C,T) bases. Finally, the AT and GC disparities represent the ratio of weak to strong hydrogen bond forming bases, respectively
Summary of genome: one chromosome and six other contigs
| Label | Size (kb) | Topology | INSDC identifier |
|---|---|---|---|
| Chromosome 1 | 3187 | linear | AZUK01000001 |
| Contig 2 | 316 | linear | AZUK01000002 |
| Contig 3 | 44 | linear | AZUK01000003 |
| Contig 4 | 24 | linear | AZUK01000004 |
| Contig 6 | 12 | linear | AZUK01000006 |
| Contig 7 | 11 | linear | AZUK01000007 |
| Contig 8 | 10 | linear | AZUK01000008 |
Nucleotide content and gene count levels of the genome
| Attribute | Value | % of total |
|---|---|---|
| Genome size (bp) | 3,607,052 | 100.00 |
| DNA coding (bp) | 3,268,621 | 89.99 |
| DNA G+C (bp) | 1,715,377 | 47.56 |
| DNA scaffolds | 7 | 100.00 |
| Total genes | 3427 | 100.00 |
| Protein coding genes | 3296 | 96.18 |
| RNA genes | 131 | 3.82 |
| Pseudo genes | 24 | 0.70 |
| Genes in internal clusters | 2431 | 70.94 |
| Genes with function prediction | 2571 | 75.02 |
| Genes assigned to COGs | 2797 | 81.62 |
| Genes with Pfam domains | 2916 | 85.09 |
| Genes with signal peptides | 273 | 7.97 |
| Genes with transmembrane helices | 755 | 22.03 |
| CRISPR repeats | 1 | – |
The total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome
Number of genes associated with COG functional categories
| Code | Value | % age | Description |
|---|---|---|---|
| J | 168 | 5.10 | Translation, ribosomal structure and biogenesis |
| A | 1 | 0.03 | RNA processing and modification |
| K | 217 | 6.58 | Transcription |
| L | 129 | 3.91 | Replication, recombination and repair |
| B | 0 | 0.00s | Chromatin structure and dynamics |
| D | 35 | 1.06 | Cell cycle control, cell division, chromosome partitioning |
| V | 37 | 1.12 | Defense mechanisms |
| T | 171 | 5.19 | Signal transduction mechanisms |
| M | 159 | 4.82 | Cell wall/membrane biogenesis |
| N | 112 | 3.40 | Cell motility |
| U | 88 | 2.67 | Intracellular trafficking and secretion |
| O | 106 | 3.22 | Posttranslational modification, protein turnover, chaperones |
| C | 176 | 5.34 | Energy production and conversion |
| G | 247 | 7.49 | Carbohydrate transport and metabolism |
| E | 241 | 7.31 | Amino acid transport and metabolism |
| F | 73 | 2.21 | Nucleotide transport and metabolism |
| H | 144 | 4.37 | Coenzyme transport and metabolism |
| I | 50 | 1.52 | Lipid transport and metabolism |
| P | 126 | 3.82 | Inorganic ion transport and metabolism |
| Q | 31 | 0.94 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 259 | 7.86 | General function prediction only |
| S | 245 | 7.43 | Function unknown |
| – | 930 | 28.21 | Not in COGs |
The total is based on the total number of protein coding genes in the annotated genome
Fig. 4Oxidase and peroxidase enzyme activity for three Tolumonas organisms. Oxidative enzyme assays were performed by incubating isolates in nutrient broth with 25 mM L-DOPA. Peroxidase enzyme assays were performed in the same manner, with the addition of 3 % hydrogen peroxide. A negative control without inoculum was also measured and subtracted from experimental values
Carbon sources utilized by Tolumonas lignolytica BRL6-1T anaerobically
| Carbon source | Omnilog units |
|---|---|
| D-Trehalose | 168 |
| L-Arabinose | 166 |
| D-Fructose | 164 |
| D-Sorbitol | 161 |
| D-Gluconic Acid | 161 |
| a-D-Glucose | 161 |
| Maltotriose | 156 |
| N-Acetyl-D-Glucosamine | 145 |
| Sucrose | 131 |
| D-Galactose | 128 |
| Tween 20 | 128 |
| Maltose | 128 |
| D-Mannose | 108 |
| Tween 40 | 106 |
| D-Melibiose | 102 |
| D-Mannitol | 95 |
| D-Ribose | 91 |
| Tween 80 | 62 |
| L-Lyxose | 36 |
Carbon sources differentially utilized by Tolumonas species anaerobically
| Carbon source |
|
|
|
|---|---|---|---|
| D-Arabinose | + | − | + |
| D-Lactose | − | + | + |
| D-Xylose | − | − | + |
| Glycerol | − | − | + |
| Fumarate | − | + | − |
| Pyruvate | − | + | − |
Enzymes involved in lignin degradation with homologues in BRL6-1T genome
| Enzyme | Gene name | Characterized in | # of copies in genome | E value | Gene annotation in BRL6-1T genome |
|---|---|---|---|---|---|
| Cα-dehydrogenase involved in b-aryl ether cleavage | ligD |
| 2 | 6e-12; 4e-10 | 1)3-oxoacyl-[acyl-carrier-protein] reductase (EC 1.1.1.100) |
| Beta-etherase | ligF |
| 1 | 4e-11 | Glutathione S-transferase |
| Feruloyl-CoA synthetase | fcs |
| 1 | 1e-10 | Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II |
| Feruloyl-CoA hydratase/lyase | ferB |
| 1 | 7e-15 | 1,4-Dihydroxy-2-naphthoyl-CoA synthase (EC 4.1.3.36) |
| Feruloyl-CoA hydratase/lyase | ferB2 |
| 1 | 3e-32 | 1,4-Dihydroxy-2-naphthoyl-CoA synthase (EC 4.1.3.36) |
| Vanillin dehydrogenase | ligV |
| 5 | 4e-66; 3e-65; 2e-62; 2e-35; 8e-08 | 1) succinate semialdehyde dehydrogenase (EC 1.2.1.16) |
| Vanillate O-demethylase oxidoreductase | vanB |
| 3 | 9e-18; 7e-14; 9e-12 | 1)Hemoglobin-like flavoprotein |
| 3-carboxymuconate cycloisomerase | pcaB |
| 2 | 9e-13; 5e-09 | 1)Adenylosuccinate lyase (EC 4.3.2.2) |
| 4-carboxymuconolactone decarboxylase | pcaC |
| 3 | 6e-30; 7e-27; 1e-06 | Uncharacterized homolog of gamma-carboxymuconolactone decarboxylase subunit (all 3) |
| Β-ketoadipate enol-lactone hydrolase | pcaD |
| 2 | 5e-15; 5e-13 | 1)carboxylesterase BioH (pimeloyl-CoA synthesis) (EC 3.1.1.85) |
Enzyme protein sequences were obtained from UniProt. E values are based on blastp results performed in img/er against the T. lignolytica BRL6-1T genome
Fig. 5Growth curve of Tolumonas lignolytica BRL6-1T in mGV medium. Solid lines depict the growth (left axis) of the organism with and without lignin amendment, with error bars showing the standard error of triplicate samples. The dashed line shows lignin concentration (right axis) throughout the growth curve. Lignin concentration was measured by removing 1 ml of culture from anaerobic septum bottles, diluting 1:10 in distilled water, filtering out cells, then measuring the absorbance at 310 nm. These values were compared to a standard curve of known concentrations of lignin in mGV medium measured at this wavelength