| Literature DB >> 28273103 |
Jiujun Cheng1, Tatyana Romantsov1, Katja Engel1, Andrew C Doxey1, David R Rose1, Josh D Neufeld1, Trevor C Charles1.
Abstract
The techniques of metagenomics have allowed researchers to access the genomic potential of uncultivated microbes, but there remain significant barriers to determination of gene function based on DNA sequence alone. Functional metagenomics, in which DNA is cloned and expressed in surrogate hosts, can overcome these barriers, and make important contributions to the discovery of novel enzymes. In this study, a soil metagenomic library carried in an IncP cosmid was used for functional complementation for β-galactosidase activity in both Sinorhizobium meliloti (α-Proteobacteria) and Escherichia coli (γ-Proteobacteria) backgrounds. One β-galactosidase, encoded by six overlapping clones that were selected in both hosts, was identified as a member of glycoside hydrolase family 2. We could not identify ORFs obviously encoding possible β-galactosidases in 19 other sequenced clones that were only able to complement S. meliloti. Based on low sequence identity to other known glycoside hydrolases, yet not β-galactosidases, three of these ORFs were examined further. Biochemical analysis confirmed that all three encoded β-galactosidase activity. Lac36W_ORF11 and Lac161_ORF7 had conserved domains, but lacked similarities to known glycoside hydrolases. Lac161_ORF10 had neither conserved domains nor similarity to known glycoside hydrolases. Bioinformatic and structural modeling implied that Lac161_ORF10 protein represented a novel enzyme family with a five-bladed propeller glycoside hydrolase domain. By discovering founding members of three novel β-galactosidase families, we have reinforced the value of functional metagenomics for isolating novel genes that could not have been predicted from DNA sequence analysis alone.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28273103 PMCID: PMC5342196 DOI: 10.1371/journal.pone.0172545
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Bacterial strains, plasmids, and cosmids.
| Bacteria, plasmids, cosmids | Characteristics | References |
|---|---|---|
| Rm1021 | SU47 | [ |
| RmF728 | Rm1021 derivative ( | [ |
| DH5α | F | [ |
| DH5α (RifR) | A spontaneous RifR mutant of DH5α, RifR | [ |
| HB101 | F-
| [ |
| BL21(DE3)pLysS | F-
| Novagen |
| MT616 | [ | |
| pET-30a(+) | Expression vector, KmR | Novagen |
| pET-30b(+) | Expression vector, KmR | Novagen |
| pK19mobsacB | Cloning vector, KmR | [ |
| pSRKGm | pBBR1MCS-5 derivative, GmR | [ |
| pRK600 | Conjugation helper plasmid, CmR | [ |
| pJC8 | Low-copy broad-host-range Gateway® entry cosmid, TcR GmR | [ |
| pJC97 | pJC98 carrying Lac36W-ORF11, GmR | This work |
| pJC98 | pSRKGm derivative carrying a His-tag region from pET-30b(+), GmR | This work |
| pJC102 | pJC98 carrying Lac161-ORF7, GmR | This work |
| pTR5 | pET-30a(+) carrying Lac161-ORF10, KmR | This work |
Fig 1Functional metagenomics selection/screen and characterization of novel β-galactosidases.
12AC metagenomic clones complementing E. coli DH5α (lacZYA) and S. meliloti RmF728 (lacEFGZ1K1) grown in M9-lactose medium.
| Lac+ clones ID | Metagenomic DNA (bp) | GC content (%) | Numbers of predicted ORFs | Taxonomic origin | GenBank accession number |
|---|---|---|---|---|---|
| Lac13 | 34,117 | 63.6 | 34 | KF796593 | |
| Lac16 | 32,464 | 65.9 | 21 | KF796594 | |
| Lac20 | 34,092 | 60.9 | 35 | KF796595 | |
| Lac24B | 34,753 | 61.5 | 31 | KF796596 | |
| Lac35B | 34,179 | 61.7 | 31 | KF255992 | |
| Lac36B | 35,369 | 61.0 | 31 | KF796597 | |
| Lac36W | 34,259 | 67.3 | 33 | KF255993 | |
| Lac71 | 35,712 | 58.8 | 28 | KF796598 | |
| Lac82 | 34,035 | 65.8 | 30 | KF796599 | |
| Lac84 | 15,763 | 65.8 | 17 | KF796600 | |
| Lac100B_102 | 8,025 | 52.8 | 8 | KU728997 | |
| Lac100B_103 | 18,097 | 65.3 | 19 | KU728998 | |
| Lac111 | 30,066 | 65.9 | 23 | KF796601 | |
| Lac112W_102 | 24,084 | 58.5 | 18 | KU728999 | |
| Lac112W_103 | 13,528 | 60.6 | 14 | KU729000 | |
| Lac121 | 29,178 | 64.3 | 27 | KF796602 | |
| Lac127 | 31,850 | 69.9 | 21 | Bacteria | KF796603 |
| Lac146 | 25,797 | 62.0 | 15 | KF796604 | |
| Lac153 | 35,985 | 67.4 | 31 | KF796605 | |
| Lac160 | 36,235 | 69.1 | 12 | KF796606 | |
| Lac161 | 35,906 | 59.1 | 29 | KF255994 | |
| Lac172 | 37,868 | 59.6 | 36 | KF796607 | |
| Lac193 | 35,861 | 63.3 | 25 | KF796608 | |
| Lac224_103 | 13,505 | 68.5 | 7 | KU729001 | |
| Lac224_102 | 4,259 | 70.9 | 3 | KU729002 | |
| LacEc1 | 36,404 | 61.4 | 31 | KF796609 | |
| LacEc104 | 39,079 | 62.6 | 37 | KF796610 | |
| LacEc123 | 34,035 | 65.8 | 34 | KF796611 |
a Taxonomic origin was analyzed using PhylopythiaS.
Fig 2Conserved domains (E < 0.01) in β-galactosidases isolated from 12AC metagenomic library clones.
Biochemical characterization of novel β-galactosidases from 12AC Lac+ metagenomic clones complementing S. meliloti RmF728 (lac).
The β-galactosidase activity of purified proteins was assayed using lactose as substrate. Released glucose was quantified with a glucose oxidase activity kit.
| ORFs | Proteins | Molecular weight (kDa) | pI | Optimal tempe-rature (°C) | Optimal pH | |||
|---|---|---|---|---|---|---|---|---|
| Lac36W_ORF11 | β-Galactosidase (GenBank, AGW45517) | 79.6 | 8.2 | 2.5 | 10.4 | 3.7 × 103 | 42 | 6.0 |
| Lac161_ORF7 | β-Galactosidase (GenBank, AGW45552) | 109.0 | 6.7 | 1.8 | 13.2 | 7.3 × 104 | 50 | 6.0 |
| Lac161_ORF10 | β-Galactosidase (GenBank, AGW45555) | 63.7 | 9.1 | 3.2 | 8.6 | 2.7 × 103 | 37 | 6.5 |
Fig 3Biochemical characterization of novel β-galactosidases.
pH profiles of Lac161_ORF10 (A), Lac161_ORF7 (C), Lac36W_ORF11 (E). Temperature profiles of Lac161_ORF10 (B), Lac161_ORF7 (D), Lac36W_ORF11 (F).
Fig 4Bioinformatic characterization of a putative glycosyl hydrolase domain in Lac161_ORF10.
(A) The NCBI Conserved Domain Database (CDD) predicts Lac161_ORF10 as a divergent member of the GH_J clan of glycosyl hydrolases. (B) Structural model of Lac161_ORF10 generated by Phyre 2.0, with a predicted cluster of 8 ligand-binding residues highlighted in yellow. The putative binding site was predicted by 3dLigandSite based on the Phyre model with PDB ID 1vkd (chain A) as the template. A NAG ligand is shown in red, which approximates the location of a lactose molecule in Lac161_ORF10. (C) An alignment of Lac161_ORF10 with the most similar members of the CDD’s GH_J sequence cluster (Genbank accession numbers are included on the right of the tree). The most conserved columns are coloured light blue. A predicted active site feature (D197) is highlighted in yellow, and is consistent with 3dLigandSite’s predicted cluster of ligand-binding residues.
Fig 5Protein homology searches of novel β-galactosidase sequences of Lac161_ORF10, Lac161_ORF7 and Lac36W_ORF11 against aquatic, human gut, and soil metagenomic databases, normalized to the rpoB gene.