| Literature DB >> 30851083 |
Dayana Calderon1, Luis Peña2, Angélica Suarez1, Carolina Villamil1, Adan Ramirez-Rojas1, Juan M Anzola3, Juan C García-Betancur4, Martha L Cepeda1, Daniel Uribe5, Patricia Del Portillo1, Alvaro Mongui1,6.
Abstract
The vast microbial diversity on the planet represents an invaluable source for identifying novel activities with potential industrial and therapeutic application. In this regard, metagenomics has emerged as a group of strategies that have significantly facilitated the analysis of DNA from multiple environments and has expanded the limits of known microbial diversity. However, the functional characterization of enzymes, metabolites, and products encoded by diverse microbial genomes is limited by the inefficient heterologous expression of foreign genes. We have implemented a pipeline that combines NGS and Sanger sequencing as a way to identify fosmids within metagenomic libraries. This strategy facilitated the identification of putative proteins, subcloning of targeted genes and preliminary characterization of selected proteins. Overall, the in silico approach followed by the experimental validation allowed us to efficiently recover the activity of previously hidden enzymes derived from agricultural soil samples. Therefore, the methodology workflow described herein can be applied to recover activities encoded by environmental DNA from multiple sources.Entities:
Keywords: Environmental microbiology; Functional genomics; Metagenomics; Microbial genomics
Mesh:
Substances:
Year: 2019 PMID: 30851083 PMCID: PMC6460280 DOI: 10.1002/mbo3.572
Source DB: PubMed Journal: Microbiologyopen ISSN: 2045-8827 Impact factor: 3.139
Figure 1Pipeline overview. Isolated metagenomic clones are pooled in one sample for a massive sequencing analysis and independently analyzed by Sanger sequencing, in order to map the metagenomic inserts to their corresponding bacterial clones. After DNA assembly and clone assignment processes, ORF predictions and functional characterization of predicted putative proteins (e.g., PP1 and PP2) are performed. Selected coding sequences (e.g., CDS1 and CDS2) associated with the enzymatic activities of interest are matched to the original metagenomic clones or subcloned for independent maintenance in plasmid vectors. Finally, functional analyses on subclones expressing the predicted proteins allow the recovery of several enzymatic activities not identified in traditional functional metagenomic assays
Assembly statistics from metagenomic reads
| Number of contigs | 3,811 |
|---|---|
| Total size of contigs (nt) | 2'853,727 |
| Size of longest contig (nt) | 37,904 |
| Number of contigs > 1 kb | 343 |
| Number of contigs > 10 kb | 37 |
| Mean contig size (nt) | 749 |
| N50 contig size (nt) | 1006 |
| L50 contig count (nt) | 337 |
Figure 2Gene Ontology functions of the annotated fraction of the metagenome. Proteins with associated PFAM domains were mapped to Gene Ontology terms (GOSlim). Most of the terms are associated with energy metabolism and transport in and out of the cell. Proteins can be binned into more than one category and therefore the total number of annotations is higher than the total number of proteins
Metagenomic‐derived coding genes for putative lipases/esterases and proteases
| Enzymes | Fosmid ID | Contig ID | Putative Gene | Size (nt) | Protein Size (aa) |
|---|---|---|---|---|---|
| Lipases/Esterases | F2 | C14 | Consensus_gene_329 | 1,116 | 371 |
| F2 | C14 | Consensus_gene_353 | 852 | 283 | |
| F2 | C14 | Consensus_gene_354 | 288 | 95 | |
| F5 | C17 | Consensus_gene_420 | 2,115 | 704 | |
| F6 | C8 | Consensus_gene_211 | 981 | 326 | |
| F6 | C8 | Consensus_gene_212 | 651 | 216 | |
| F8 | C18 | Consensus_gene_436 | 2,028 | 675 | |
| F19 | U17 | U_42 | 1,086 | 361 | |
| F25 | C16 | Consensus_gene_396 | 636 | 211 | |
| F27 | C3 | Consensus_gene_87 | 792 | 263 | |
| F28 | U36 | U_195 | 1,575 | 524 | |
| F36 | C25 | Consensus_gene_553 | 600 | 199 | |
| Proteases | F5 | C17 | Consensus_gene_420 | 2,115 | 704 |
| F8 | C18 | Consensus_gene_436 | 2,028 | 675 | |
| F11 | C20 | Consensus_gene_472 | 435 | 144 | |
| F11 | C20 | Consensus_gene_473 | 828 | 275 | |
| F14 | C15 | Consensus_gene_359 | 1,098 | 365 | |
| F21 | U26 | U_145 | 645 | 214 | |
| F22 | C5 | Consensus_gene_122 | 1,278 | 425 | |
| F22 | C5 | Consensus_gene_126 | 1,404 | 467 | |
| F27 | C3 | Consensus_gene_62 | 1,707 | 568 | |
| F27 | C3 | Consensus_gene_85 | 1,377 | 458 | |
| F35 | U21 | U_70 | 849 | 282 | |
| F36 | C9 | Consensus_gene_224 | 1,902 | 633 | |
| F36 | C9 | Consensus_gene_232 | 1,146 | 381 | |
| F38 | C21 | Consensus_gene_496 | 1,101 | 366 |
Gene encoding for protein denominated as LipM.
Gene encoding for protein denominated as Prot1.
Gene encoding for protein denominated as Prot2.
Figure 3Bacterial enzymatic activity. (a) Proteolytic activity determination by the colorimetric method of Folin Ciocalteu reagent using casein as substrate of the reaction together with bacterial extracts from E. coli EPI300 metagenomic clones F8_C18 (harboring Prot1 CDS) or F38_C21 (harboring Prot2 CDS). (b) Lipolytic activity detection by p‐Nitrophenyl butyrate degradation of the bacterial extract derived from E. coli EPI300 metagenomic clone F5_C17 (harboring LipM CDS). In (a) and (b) E. coli EPI300 was used as a negative control of the enzymatic activities. (c) Proteolytic activity determination of bacterial extracts derived from E. coli LMG‐194 clones harboring either pBAD_Prot1 or pBAD_Prot2. (D) Lipolytic activity detection of the bacterial extract derived from E. coli BL21 harboring pET100_LipM plasmid. In (c) and (d), the respective nontransformed E. coli strains were used as negative controls of enzymatic activity. Error values represent standard deviations from three replicates in each case. *Indicates a significant difference in the proteolytic activity from clones pBAD_Prot1 and pBAD_Prot2 (p‐value < .05) compared with negative control. **Indicates a significant difference in the lipolytic activity of the bacterial extract derived from clone pET100_LipM (p‐value < .05) compared with negative control
Figure 4Partial protease characterization. (a) Effect of temperature on protease activities of Prot1 and Prot2. (b) Effect of pH on protease activities of Prot1 and Prot2. (c) Effect of metal ions and inhibitor (EDTA) on the enzymatic activities of Prot1 and Prot2