| Literature DB >> 25452698 |
Raphael D Isokpehi1, Udensi K Udensi2, Shaneka S Simmons3, Antoinesha L Hollman4, Antia E Cain5, Samson A Olofinsae6, Oluwabukola A Hassan6, Zainab A Kashim6, Ojochenemi A Enejoh6, Deborah E Fasesan6, Oyekanmi Nashiru6.
Abstract
The influence of environmental chemicals including arsenic, a type 1 carcinogen, on the composition and function of the human-associated microbiota is of significance in human health and disease. We have developed a suite of bioinformatics and visual analytics methods to evaluate the availability (presence or absence) and abundance of functional annotations in a microbial genome for seven Pfam protein families: As(III)-responsive transcriptional repressor (ArsR), anion-transporting ATPase (ArsA), arsenical pump membrane protein (ArsB), arsenate reductase (ArsC), arsenical resistance operon transacting repressor (ArsD), water/glycerol transport protein (aquaporins), and universal stress protein (USP). These genes encode function for sensing and/or regulating arsenic content in the bacterial cell. The evaluative profiling strategy was applied to 3,274 genomes from which 62 genomes from 18 genera were identified to contain genes for the seven protein families. Our list included 12 genomes in the Human Microbiome Project (HMP) from the following genera: Citrobacter, Escherichia, Lactobacillus, Providencia, Rhodococcus, and Staphylococcus. Gene neighborhood analysis of the arsenic resistance operon in the genome of Bacteroides thetaiotaomicron VPI-5482, a human gut symbiont, revealed the adjacent arrangement of genes for arsenite binding/transfer (ArsD) and cytochrome c biosynthesis (DsbD_2). Visual analytics facilitated evaluation of protein annotations in 367 genomes in the phylum Bacteroidetes identified multiple genomes in which genes for ArsD and DsbD_2 were adjacently arranged. Cytochrome c, produced by a posttranslational process, consists of heme-containing proteins important for cellular energy production and signaling. Further research is desired to elucidate arsenic resistance and arsenic-mediated cellular energy production in the Bacteroidetes.Entities:
Keywords: Bacteroides; Bacteroidetes; Human Microbiome Project; arsenate; arsenic; arsenite; bioinformatics; genomes; gut microbiota; heavy metal transport; human symbiont; mercuric transport; secondary data analysis; visual analytics
Year: 2014 PMID: 25452698 PMCID: PMC4230230 DOI: 10.4137/MBI.S18076
Source DB: PubMed Journal: Microbiol Insights ISSN: 1178-6361
Position of Pfam protein family annotation in genome binary profile.
| Pfam ID | Pfam NAME AND ABBREVIATION | POSITION IN BINARY DIGIT |
|---|---|---|
| Pfam02374 | Anion-transporting ATPase [ArsA] | 1 |
| Pfam02040 | Arsenical pump membrane protein [ArsB] | 2 |
| Pfam03960 | Arsenate reductase and related proteins, glutaredoxin family [ArsC] | 3 |
| Pfam06953 | Arsenical resistance operon trans-acting repressor, [ArsD] | 4 |
| Pfam01022 | As(III)-responsive transcriptional repressor [ArsR] | 5 |
| Pfam00230 | Major intrinsic protein family [MIP/AQP] | 6 |
| Pfam00582 | Universal stress protein domain [Usp] | 7 |
Figure 1Visualization of binary-encoded matrix for relevance of genomes with genes for arsenic operon, aquaporin, and universal stress protein. Data were obtained from the Integrated Microbial Genomes. Black square, relevance annotated for genome; white square, relevance not annotated for genome. When relevance data were not available, we entered a “Not_Reported” for data processing.
Locus tags for genes encoding selected arsenic-associated protein families in five Human Microbiome Project reference genomes.
| Pfam FAMILY | |||||
|---|---|---|---|---|---|
| arsR (Pfam01022) | CSAG_00049 | HMPREF9540_00434 | HMPREF9552_00168 | HMPREF0511_0214 | PstuA_020100015920 |
| CSAG_00058 | HMPREF9540_00675 | HMPREF9552_02803 | HMPREF0511_1131 | PstuA_020100016320 | |
| CSAG_00761 | HMPREF9540_01104 | HMPREF9552_02903 | HMPREF0511_1475 | PstuA_020100017025 | |
| CSAG_02502 | HMPREF9540_04804 | HMPREF9552_02908 | |||
| CSAG_04185 | HMPREF9540_04813 | ||||
| CSAG_04189 | |||||
| CSAG_04238 | |||||
| CSAG_04297 | |||||
| arsD (Pfam06953) | CSAG_00050 | HMPREF9540_04807 | HMPREF9552_02907 | HMPREF0511_1134 | PstuA_020100016315 |
| CSAG_00055 | HMPREF9540_04812 | ||||
| CSAG_04239 | |||||
| arsA (Pfam02374) | CSAG_00051 | HMPREF9540_04808 | HMPREF9552_02906 | HMPREF0511_1132 | PstuA_020100007500 |
| CSAG_00054 | HMPREF9540_04811 | PstuA_020100016310 | |||
| CSAG_04240 | |||||
| CSAG_04243 | |||||
| arsB (Pfam02040) | CSAG_00052 | HMPREF9540_00433 | HMPREF9552_02905 | HMPREF0511_1133 | PstuA_020100016305 |
| CSAG_04241 | HMPREF9540_04810 | ||||
| arsC (Pfam03960) | CSAG_00053 | HMPREF9540_00432 | HMPREF9552_01835 | HMPREF0511_0280 | PstuA_020100013100 |
| CSAG_02267 | HMPREF9540_04809 | HMPREF9552_01863 | HMPREF0511_0923 | PstuA_020100013225 | |
| CSAG_02283 | HMPREF9540_05016 | HMPREF9552_02904 | HMPREF0511_0962 | PstuA_020100016300 | |
| CSAG_04242 | HMPREF9540_05044 | ||||
| Aqp (Pfam00230) | CSAG_01847 | HMPREF9540_02867 | HMPREF9552_00087 | HMPREF0511_1378 | PstuA_020100002768 |
| CSAG_01948 | HMPREF9540_03890 | HMPREF9552_03971 | |||
| CSAG_04569 | HMPREF9540_04727 | ||||
| Usp (Pfam00582) | CSAG_00404 | HMPREF9540_00348 | HMPREF9552_01620 | HMPREF0511_0613 | PstuA_020100008480 |
| CSAG_00475 | HMPREF9540_00443 | HMPREF9552_02920 | HMPREF0511_1339 | PstuA_020100010015 | |
| CSAG_01459 | HMPREF9540_00492 | HMPREF9552_03271 | HMPREF0511_1387 | PstuA_020100010755 | |
| CSAG_01471 | HMPREF9540_01169 | HMPREF9552_03967 | HMPREF0511_1569 | PstuA_020100010765 | |
| CSAG_01741 | HMPREF9540_02863 | HMPREF9552_04168 | HMPREF0511_1702 | PstuA_020100011480 | |
| CSAG_03714 | HMPREF9540_04247 | HMPREF9552_05110 | PstuA_020100015380 | ||
| CSAG_03977 | HMPREF9540_04414 | HMPREF9552_05202 | PstuA_020100019714 | ||
| CSAG_04126 | |||||
| CSAG_00328 | |||||
Comparison of gene function order in arsenic resistance operon in selected Human Microbial Project reference genomes.
| GENOME | GENE CLUSTER IDENTIFIER | GENE ORDER | ||||
|---|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | 5 | ||
| CSAG_00049-CSAG_00053 | arsR | arsD | arsA | arsB | arsC | |
| CSAG_00054-CSAG_00055 | arsA | arsD | ||||
| CSAG_04238-CSAG_04242 | arsR | arsD | arsA | arsB | arsC | |
| CSAG_04243-CSAG_04243 | arsA | |||||
| HMPREF9540_00432-HMPREF9540_00434 | arsC | arsB | arsR | |||
| HMPREF9540_04807-HMPREF9540_04810 | arsD | arsA | arsC | arsB | ||
| HMPREF9540_04811-HMPREF9540_04813 | arsA | arsD | arsR | |||
| HMPREF9552_02903-HMPREF9552_02907 | arsR | arsC | arsB | arsA | arsD | |
| HMPREF9552_02908-HMPREF9552_02908 | arsR | |||||
| HMPREF0511_1131-HMPREF0511_1134 | arsR | arsA | arsB | arsD | ||
| PstuA_020100016300-PstuA_020100016320 | arsC | arsB | arsA | arsD | arsR | |
Note:
Start and end genes are used to identify gene clusters.
Figure 2Genomes in the Human Microbiome Project (HMP) genomes collection with genes for arsenic operon, aquaporin, and universal stress protein.
Notes: Binary code is based on the presence of seven protein families: Pfam02374 [anion-transporting ATPase (ArsA)]; Pfam02040 [arsenical pump membrane protein (ArsB)]; Pfam03960 [arsenate reductase and related proteins, glutaredoxin family (ArsC)]; Pfam06953 [arsenical resistance operon trans-acting repressor (ArsD)]; Pfam01022 [As(III)-responsive transcriptional repressor (ArsR)]; Pfam00230 [major intrinsic protein family (MIP/AQP)]; and Pfam00582 [universal stress protein domain (Usp)].
Figure 3Abundance of genes for arsenic-associated genes in genomes of Bacteroides species.
Notes: The horizontal axis has the count for arsenic-associated genes in the genome. The scale for each Pfam family varies according to the maximum abundance observed in the genomes evaluated. The genes and their Pfam encoding are arsA, Pfam02374 (anion-transporting ATPase); arsB, Pfam02040 (arsenical pump membrane protein); arsC, Pfam03960 (arsenate reductase and related proteins, glutaredoxin family); arsD, Pfam06953 (arsenical resistance operon trans-acting repressor); and arsR, Pfam01022 (As(III)-responsive transcriptional repressor).
Figure 4Transcription units and functional associations of arsenic resistance operon in Bacteroides thetaiotaomicron VPI-5482. The web pages for the transcription units are http://biocyc.org/BTHE226186/NEW-IMAGE?type=OPERON&object=TUJXV-83 and http://biocyc.org/BTHE226186/NEW-IMAGE?type=OPERON&object=TUJXV-442.