| Literature DB >> 24030592 |
Naseer Sangwan1, Helianthous Verma1, Roshan Kumar1, Vivek Negi1, Simon Lax2, Paramjit Khurana3, Jitendra P Khurana3, Jack A Gilbert2, Rup Lal1.
Abstract
Over the last 60 years, the use of hexachlorocyclohexane (HCH) as a pesticide has resulted in the production of >4 million tons of HCH waste, which has been dumped in open sinks across the globe. Here, the combination of the genomes of two genetic subspecies (Sphingobium japonicum UT26 and Sphingobium indicum B90A; isolated from two discrete geographical locations, Japan and India, respectively) capable of degrading HCH, with metagenomic data from an HCH dumpsite (∼450 mg HCH per g soil), enabled the reconstruction and validation of the last-common ancestor (LCA) genotype. Mapping the LCA genotype (3128 genes) to the subspecies genomes demonstrated that >20% of the genes in each subspecies were absent in the LCA. This includes two enzymes from the 'upper' HCH degradation pathway, suggesting that the ancestor was unable to degrade HCH isomers, but descendants acquired lin genes by transposon-mediated lateral gene transfer. In addition, anthranilate and homogentisate degradation traits were found to be strain (selectively retained only by UT26) and environment (absent in the LCA and subspecies, but prevalent in the metagenome) specific, respectively. One draft secondary chromosome, two near complete plasmids and eight complete lin transposons were assembled from the metagenomic DNA. Collectively, these results reinforce the elastic nature of the genus Sphingobium, and describe the evolutionary acquisition mechanism of a xenobiotic degradation phenotype in response to environmental pollution. This also demonstrates for the first time the use of metagenomic data in ancestral genotype reconstruction, highlighting its potential to provide significant insight into the development of such phenotypes.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24030592 PMCID: PMC3906814 DOI: 10.1038/ismej.2013.153
Source DB: PubMed Journal: ISME J ISSN: 1751-7362 Impact factor: 10.302
Sequencing and assembly statistics
| Sequence data | 450 Mbp (320 bp) | 1.6 Gbp (75 bp) |
| Reads after quality filtration | 42 044 290 | 20 111 630 |
| No. of contigs | 149 (>500 bp) | 1 216 300 (>200 bp) |
| Average contig coverage | 80 | 3 |
| Reads used in assembly | 33 635 432 (80%) | 1 608 930(8%) |
| N50 | 95 Kbp | 253 bp |
| Max. contig size | ∼253 Kbp | ∼3 Kbp |
| GC content | 65 | 61 |
Abbreviation: N50 is the length of the smallest contig in a bin that contains largest contigs representing at least half of the total assembly length.
Calculated by mapping of reads to contigs with the criteria of 95% identity over 90% of the read length.
Based on minimum contig length criteria, that is, genome ⩾500 bp and metagenome ⩾200 bp.
Figure 1Predicted ancestral genotype, its metagenomic fingerprints and genetic rearrangements across S. indicum B90A and S. japonicum UT26. (a) Metagenomic fingerprints of ancestral gene content; Pyrosequence reads from the previous dumpsite metagenome survey (Sangwan ) and illumina reads from this study were plotted on the predicted ancestral gene content (yellow shade) and genome sequences of the S. indicum B90A (gray shade) and S. japonicum UT26 (pink shade). Each dot on the graph represents an individual sequence read aligned along its reference genotypes. x and y axis represents the genomic coordinate (Mbp) and percentage sequence identity, respectively. (b) Genetic breakpoints between ancestor genotype and HCH-degrading genetic subspecies from outside towards centre; outermost circle: metagenomic contigs, circle 2: draft assembly of S. indicum B90A, circle 3: complete genome of S. japonicum UT26 mapped (BLASTN) over ancestral genotype (black color intensity represent the percentage identity, that is, darker the shade higher is the sequence identity), circle 4: lin genes homology between ancestor and subspecies, circle 5: metagenomic read coverage plot of ancestral gene content (5 × as coverage cutoff), outermost circle: GC plot of the predicted ancestral gene content.
Figure 2Metagenomic validation of de-novo assembled strain-specific mobile genetic elements. Total 8 transposons (a–h) were selected after reconstructed from de-novo assembly and graph-based clustering of genome (S. indicum B90A) and HCH dumpsite metagenomes. For each transposon; purple color dots represents the genomic reads of contigs (from graph-based clustering) assigned to a transposon and overlapping yellow reads represents the metagenomic validation with minimum sequence identity of 97%.
Figure 3Genotypic synteny and metagenomic recruitment of reference genotypes. Whole-genome alignments between reference genomes and their reconstructed genotypes: (a) pSLPG plasmid of Sphingobium sp. SYK-6, (b) pSPHCH01 plasmid of Sphingobium chlorophenolicum L1 and (c) chromosome 2 of S. japonicum UT26. Overlaid the synteny plots are the metagenomic read coverage on a numerical scale. Mc, metagenome contigs.