| Literature DB >> 28409115 |
Saurav Das1,2, Sudipta Sankar Bora1, R N S Yadav2, Madhumita Barooah1.
Abstract
Metagenomic approach was used to understand the structural and functional diversity present in arsenic contaminated groundwater of the Ganges Brahmaputra Delta aquifer system. A metagene dataset (coded as TTGW1) of 89,171 sequences (totaling 125,449,864 base pairs) with an average length of 1406 bps was annotated. About 74,478 sequences containing 101,948 predicted protein coding regions passed the quality control. Taxonomical classification revealed abundance of bacteria that accounted for 98.3% of the microbial population of the metagenome. Eukaryota had an abundance of 1.1% followed by archea that showed 0.4% abundance. In phylum based classification, Proteobacteria was dominant (62.6%) followed by Bacteroidetes (11.7%), Planctomycetes (7.7%), Verrucomicrobia (5.6%), Actinobacteria (3.7%) and Firmicutes (1.9%). The Clusters of Orthologous Groups (COGs) analysis indicated that the protein regulating the metabolic functions constituted a high percentage (18,199 reads; 39.3%) of the whole metagenome followed by the proteins regulating the cellular processes (22.3%). About 0.07% sequences of the whole metagenome were related to genes coding for arsenic resistant mechanisms. Nearly 50% sequences of these coded for the arsenate reductase enzyme (EC. 1.20.4.1), the dominant enzyme of ars operon. Proteins associated with iron acquisition and metabolism were coded by 2% of the metagenome as revealed through SEED analysis. Our study reveals the microbial diversity and provides an insight into the functional aspect of the genes that might play crucial role in arsenic geocycle in contaminated ground water of Assam.Entities:
Keywords: Arsenic; Assam; Bacteria; Groundwater; Metagenomic; Proteobacteria; Siderophore
Year: 2017 PMID: 28409115 PMCID: PMC5379903 DOI: 10.1016/j.gdata.2017.03.013
Source DB: PubMed Journal: Genom Data ISSN: 2213-5960
Fig. 1Analysis strategy performed to analyze microbial diversity prevalent in the arsenic contaminated groundwater sample. DNA from composite groundwater sample was used for Whole Genome Shotgun (WGS) sequencing.
Physicochemical parameter of the contaminated groundwater sample collected for metagenomics analysis.
| Sl. no. | Parameter | Ground-water sample 1 | Ground-water sample 2 | Ground-water sample 3 | Ground-water sample 4 | Ground-water sample 5 |
|---|---|---|---|---|---|---|
| 1. | pH | 6.4 | 6.2 | 7.1 | 5.9 | 6.8 |
| 2. | Electrolytic conductivity (μS/m) | 1783 | 1532 | 1572 | 1770 | 1814 |
| 3. | Temperature (°C) | 22.0 | 24.0 | 22.0 | 21.6 | 22.0 |
| 4. | Dissolved oxygen (mg/l) | 8.4 | 7.8 | 7.6 | 8.2 | 8.7 |
| 5. | Redox (mv) | 187 | 172 | 167 | 183 | 181 |
| 6. | Arsenic concentration (μg/l) | 217 | 50 | 20 | 156 | 112 |
Statistical analysis of the raw and processed sequences of the metagenome.
| Raw data uploaded | |
|---|---|
| Number of base pair uploaded | 125,449,864 bp |
| Coding sequence count | 89,171 |
| Mean sequence length | 1406 ± 6901 |
| Mean GC percent | 58 ± 10% |
Fig. 2Graphical representation of sequence analysis chart.
Fig. 3Graphical representation of taxon abundance. Proteobacteria showed the highest population in the whole metagenome followed by Bacteroidetes, Planctomycetes, Verucomicrobia, Actinobacteria, and Firmcutes etc.
Fig. 4Refrection curve of species richness.
Fig. 5Cluster based orthologous classification of proteins.
Fig. 6Functional prediction of annotated proteins.
Fig. 7Graphical representation of genes identified in the metagenome responsible for iron acquisition and siderophore activity (Krona Chart).
Fig. 8Genes involved in arsenic resistance mechanisms identified from metagenome.