| Literature DB >> 29225727 |
Laura T Morales1, Laura N González-García2, María C Orozco1, Silvia Restrepo2, Martha J Vives1,3.
Abstract
Crude oil contamination of soils and waters is a worldwide problem, which has been actively addressed in recent years. Sequencing genomes of microorganisms involved in the degradation of hydrocarbons have allowed the identification of several promoters, genes, and degradation pathways of these contaminants. This knowledge allows a better understanding of the functional dynamics of microbial degradation. Here, we report a first draft of the 44.2 Mbp genome assembly of an environmental strain of the fungus Scedosporium apiospermum. The assembly consisted of 178 high-quality DNA scaffolds with 1.93% of sequence repeats identified. A total of 11,195 protein-coding genes were predicted including a diverse group of gene families involved in hydrocarbon degradation pathways like dioxygenases and cytochrome P450. The metabolic pathways identified in the genome can potentially degrade hydrocarbons like chloroalkane/alkene, chorocyclohexane, and chlorobenzene, benzoate, aminobenzoate, fluorobenzoate, toluene, caprolactam, geraniol, naphthalene, styrene, atrazine, dioxin, xylene, ethylbenzene, and polycyclic aromatic hydrocarbons. The comparison analysis between this strain and the previous sequenced clinical strain showed important differences in terms of annotated genes involved in the hydrocarbon degradation process.Entities:
Keywords: Genome; Hydrocarbon degradation; Scedosporium apiospermum
Year: 2017 PMID: 29225727 PMCID: PMC5716253 DOI: 10.1186/s40793-017-0287-6
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Fig. 1Micrograph of Scedosporium apiospermum. a Optical microscopy of hyphae and conidia from a PDA culture, at 100× total magnification. Lactophenol cotton blue wet mount preparation. b Scanning electron microscopy of hyphae and conidia from a liquid culture grown in minimal salt medium plus crude oil as the sole carbon and energy source
Fig. 2Phylogenetic Analysis of S. apiospermum HDO1. Estimated relationships of S. apiospermum HDO1 with S. apiospermum IHEM 14462 and other species from the Microascaceae family. The tree shows the concatenated analysis of the Internal Transcribed Spacer, the Large Subunit and the Elongation factor gene regions. Sequences from reference strains were used (Additional file 1:Table S3). Support values represent Bootstrap support values (Maximum Likelihood)
Classification and general features of Scedosporium apiospermum strain HDO1
| MIGS ID | Property | Term | Evidence codea |
|---|---|---|---|
| Classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family – Microascaceae | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| strain: HDO1 | |||
| Gram stain | n/a | ||
| Cell shape | Mycelium with septae | IDA | |
| Motility | non-motile | IDA | |
| Sporulation | Conidia | TAS [ | |
| Temperature range | 25–42 °C | TAS [ | |
| Optimum temperature | 30–37 °C | TAS [ | |
| pH range; Optimum | 5.5–8.5: 7.3 | TAS [ | |
| Carbon source | Glucose, sucrose, urea, ethanol, ribitol, xylitol, L-arabinitol, phenol, p-cresol, hydroquinone, 1,2,4-benzenetriol, catechol, 4-hydroxybenzylalcohol, 4-hydroxybenzaldehyde,, 4-hydroxybenzoate, protocatechuate, 3-oxoadipate, phenylbenzoate, naphthalene, pyrene, phenantrene, crude oil. | TAS [ | |
| MIGS-6 | Habitat | Soil and polluted water | TAS [ |
| MIGS-6.3 | Salinity | 1.7–2.8/Up to 5% in vitro | TAS [ |
| MIGS-22 | Oxygen requirement | Aerobic/Tolerate low pressure of O2 | TAS [ |
| MIGS-15 | Biotic relationship | free-living | IDA |
| MIGS-14 | Pathogenicity | Pathogenic | TAS [ |
| MIGS-4 | Geographic location | Bogotá, Colombia | IDA |
| MIGS-5 | Sample collection | 20 May 2008 | IDA |
| MIGS-4.1 | Latitude | 4.600659 | IDA |
| MIGS-4.2 | Longitude | −74.065592 | IDA |
| MIGS-4.4 | Altitude | 2658 m | IDA |
aEvidence codes: IDA inferred from direct assay (first time in publication); TAS traceable author statement (i.e., a direct report exists in the literature); NAS non-traceable author statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These codes are from the Gene Ontology project [56]
Project information
| MIGS ID | Property | Term |
|---|---|---|
| MIGS 31 | Finishing quality | Draft |
| MIGS-28 | Libraries used | One 250 pb Paired-End library, one 5 kb Mate pair library |
| MIGS 29 | Sequencing platforms | Illumina HiSeq2500 |
| MIGS 31.2 | Fold coverage | 540 x |
| MIGS 30 | Assemblers | Abyss 1.9.0.20 |
| MIGS 32 | Gene calling method | Augustus 3.0.3 |
| Locus Tag | BTW05 | |
| Genbank ID | MVOQ00000000 | |
| GenBank Date of Release | May 26, 2017 | |
| GOLD ID | NA | |
| BIOPROJECT | PRJNA357602 | |
| MIGS 13 | Source Material Identifier | Strain HDO1 Museo de Historia Natural Andes |
| Project relevance | Biotechnological |
Genomic statistics
| Attribute | Value | % of total |
|---|---|---|
| Genome size (bp) | 44,188,879 | 100 |
| DNA coding (bp) | 18,219,288 | 41.2 |
| DNA G + C (bp) | 22,057,939 | 49.91 |
| DNA scaffolds | 178 | 100 |
| Total genes | 11,278 | 100 |
| Protein coding genes | 11,184 | 99.16 |
| RNA genes | 92 | 0.81 |
| Pseudo genes | 2 | 0,1 × 10−1 |
| Genes in internal clusters | ND | ND |
| Genes with function prediction | 8575 | 76.0 |
| Genes assigned to COGs | 4789 | 42.4 |
| Genes with Pfam domains | 7978 | 70.8 |
| Genes with signal peptides | 1333 | 11.8 |
| Genes with transmembrane helices | 2293 | 20.3 |
Number of genes associated with general COG functional categories
| Code | Value | % age | Description |
|---|---|---|---|
| J | 234 | 2.09 | Translation, ribosomal structure and biogenesis |
| A | 42 | 0.37 | RNA processing and modification |
| K | 163 | 1.46 | Transcription |
| L | 126 | 1.12 | Replication, recombination and repair |
| B | 19 | 0.17 | Chromatin structure and dynamics |
| D | 32 | 0.29 | Cell cycle control, Cell division, chromosome partitioning |
| V | 32 | 0.29 | Defense mechanisms |
| T | 153 | 1.37 | Signal transduction mechanisms |
| M | 126 | 0.44 | Cell wall/membrane biogenesis |
| N | 2 | 0.02 | Cell motility |
| U | 241 | 2.15 | Intracellular trafficking and secretion |
| O | 294 | 2.63 | Posttranslational modification, protein turnover, chaperones |
| C | 216 | 1.93 | Energy production and conversion |
| G | 399 | 3.56 | Carbohydrate transport and metabolism |
| E | 257 | 2.30 | Amino acid transport and metabolism |
| F | 65 | 0.58 | Nucleotide transport and metabolism |
| H | 63 | 0.56 | Coenzyme transport and metabolism |
| I | 140 | 1.25 | Lipid transport and metabolism |
| P | 131 | 1.17 | Inorganic ion transport and metabolism |
| Q | 186 | 1.66 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 0 | 0 | General function prediction only |
| S | 1918 | 17.14 | Function unknown |
| – | 6402 | 57.2 | Not in KOGs |
Fig. 3Distribution of the hydrocarbon degradation genes in KEEG pathways. The bars represent the number of genes mapped in KEEG pathways related to hydrocarbon degradation. Most of the genes were mapped to the benzoate and its derivate compounds as aminobenzoate and fluorobenzoate
Genomic features comparison between HDO1 strain and IHEM 14462 strain [22]
| Parameter | IHEM 14462 | HDO1 |
|---|---|---|
| Size (Mb) | 43.44 | 44.19 |
| Content G-C (%) | 50.4 | 49.91 |
| Predicted genes | 10,919 | 11,278 |
| Predicted proteins | 8.375 | 11,184 |
Fig. 4MAUVE [62] alignment of draft genome sequence of HDO1 strain and draft genome sequence of IHEM 14462 strain. The figure represents the locally contiguous blocks (LCBs) that both sequences share, connected by lines to show their positions in the genomes. At the top the sequence of HDO1 strain is visualized and at the bottom the re-ordered sequence of the IHEM 14462 strain appears [23]. Blocks that are shown below indicate regions that have the reverse sequence related to the HDO1 sequence
Fig. 5Dot plot analysis comparing the HDO1 and IHEM14462 strains’ genomes. a Comparison at the nucleotide level. b Comparison at the protein level. It shows the alignment of the genome sequence of IHEM 14462 strains (y axis) against HDO1 genome sequence (x axis). The red color lines and dots represent the forward matches between the both genome sequences while the blue color ones represent reverse matches
Annotated genes involved in hydrocarbons degradation pathways
| Gene | Pathway | # of genes in HDO1 | # of genes in IHEM 14462 [ |
|---|---|---|---|
| Cytochrome P450 monooxygenase (EC:1.14.13.12) | PAHs degradation, alkane biodegradation [ | 79 | 44 |
| Phenol hydroxylase | Phenol degradation [ | 4 | 4 |
| Epoxide hydrolase (EC:3.3.2.9) | PAHs degradation [ | 3 | 2 |
| Oxidoreductase | Organic compound metabolism [ | 13 | 8 |
| Salicylate hydroxylase (EC:1.14.13.24) | Naphthalene degradation [ | 13 | 4 |
| Laccase (EC:1.10.3.2) | PAHs degradation [ | 2 | 2 |
| Catechol 1,2-dioxygenase (EC:1.13.11.1) | Phenol degradation [ | ||
| 2,4-dichlorophenol 6-monooxygenase (EC:1.14.13.7; EC:1.14.13.20) | Chlorinated phenols degradation [ | 5 | 5 |
| 2,3-dihydroxybenzoate decarboxylase (EC:4.1.1.46) | 2,3-dihydroxybenzoate degradation [ | 1 | 1 |
| Carboxy-cis,cis-muconate cyclase | Phenol degradation [ | 4 | 2 |
| Phenylacetate 2-hydroxylase | Homogentisate degradation [ | 1 | 2 |
| 2-nitropropane dioxygenase (EC:1.13.12.16) | Nitroalquene oxidation [ | 4 | 4 |
| Biphenyl-2,3-diol 1,2-dioxygenase | Byphenyl degradation [ | 1 | 2 |
| Dienelactone hydrolase | Chloroaromatic degradation [ | 2 | 7 |
| Vanillyl-alcohol oxidase (EC:1.1.3.38) | Aromatic degradation [ | 4 | 6 |
| Cyclopentanone 1,2-monooxygenase (EC:1.14.13.8; EC:1.14.13.16) | Cyclopentanol degradation [ | 2 | 2 |
| Tyrosinase | Phenolic compounds degradation [ | 1 | 3 |
| Lignostilbene dioxygenase (EC:1.13.11.43) | Lignin degradation [ | 2 | 1 |
| Total number of genes | 145 | 103 |
Annotated genes found only in the HDO1 strain
| Genes in HDO1 | Pathway | # of genes |
|---|---|---|
| 3-oxoadipate enol-lactonase | Phenol degradation [ | 5 |
| 5-carboxymethyl-2-hydroxymuconate isomerase | Homoprotocatechuate degradation pathway [ | 1 |
| Trihydroxytoluene oxygenase | 2,4-dinitrotoluene degradation [ | 1 |
| Benzoate 4-monooxygenase (EC:3.6.1.3) | Benzoate degradation [ | 5 |
| Diphenol oxidase | Phenolic compounds degradation [ | 1 |
| Cyclohexanone monooxygenase (EC:1.14.13.8) | Cyclohexane degradation [ | 4 |
| Gentisate 1,2-dioxygenase | PAHs degradation [ | 2 |
| 2-keto-4-pentenoate hydratase (EC:3.7.1.5) | Benzoate degradation [ | 1 |
| carboxymuconolactone decarboxylase | Protocatechuate degradation [ | 1 |
| 3-(3-hydroxy-phenyl)propionate 3-hydroxycinnamic acid hydroxylase | Phenyl propionate degradation [ | 2 |
| 3-hydroxybenzoate 6-hydroxylase | Xylenol [ | 1 |
| 3-hydroxyisobutyrate dehydrogenase (EC:1.1.1.44; EC:2.1.1.43) | Aromatic compounds metabolism [ | 5 |
| Total number of genes | 30 |