| Literature DB >> 21261880 |
Manuel Ferrer1, Ana Beloqui, José María Vieites, María Eugenia Guazzaroni, Ilana Berger, Amir Aharoni.
Abstract
In recent years, the application of approaches for harvesting DNA from the environment, the so-called, 'metagenomic approaches' has proven to be highly successful for the identification, isolation and generation of novel enzymes. Functional screening for the desired catalytic activity is one of the key steps in mining metagenomic libraries, as it does not rely on sequence homology. In this mini-review, we survey high-throughput screening tools, originally developed for directed evolution experiments, which can be readily adapted for the screening of large libraries. In particular, we focus on the use of in vitro compartmentalization (IVC) approaches to address potential advantages and problems the merger of culture-independent and IVC techniques might bring on the mining of enzyme activities in microbial communities.Entities:
Mesh:
Year: 2008 PMID: 21261880 PMCID: PMC3815420 DOI: 10.1111/j.1751-7915.2008.00057.x
Source DB: PubMed Journal: Microb Biotechnol ISSN: 1751-7915 Impact factor: 5.813
Figure 1Flow chart illustrating the identification, isolation and further engineering of novel enzymes using metagenomic and directed evolution approaches. Newly identified enzymes from metagenomic libraries can serve as an ideal starting point for the directed evolution of enzymes with improved properties. High‐throughput screening for enzymatic activity can be used, both for screening large metagenomic libraries and subsequently, for directed evolution experiments.
Figure 2Mining genomes and metagenomes for novel enzymes. A gene library is created from environmental samples (Step 1–3) and used to screen for novel genes (Step 4) cloned into bacteria which can be sequenced (Step 5a). The encoded proteins expressed in appropriate host are then subjected to structure‐function analyses (central panel). Alternatively, large‐scale sequencing of bulk DNA is used for archiving and sequence homology screening purposes to capture the largest amount of the available genetic resources present in environmental samples (Step 5b).
Figure 3Selections by FACS sorting of double emulsion droplets. A gene library is transformed into bacteria, and the encoded proteins are expressed in the cytoplasm, the periplasm, or on the surface of the cells (Step 1). The bacteria are dispersed to form a water‐in‐oil (w/o) emulsion, with typically one cell per aqueous microdroplet. Alternatively, an in vitro transcription/translation reaction mixture containing a library of genes is dispersed to form a w/o emulsion with typically one gene per aqueous microdroplet. The genes are transcribed and translated within the microdroplets (Step 2). Proteins with enzymatic activity convert the non‐fluorescent substrate into a fluorescent product and the w/o emulsion is converted into a water‐in‐oil‐in‐water (w/o/w) emulsion (Step 3). Fluorescent microdroplets are separated from non‐fluorescent microdroplets using a fluorescence activated cell sorter (FACS) (Step 4). Bacteria or genes from fluorescent microdroplets which encode active enzymes are recovered and the bacteria are propagated or the DNA is amplified using the polymerase chain reaction. These bacteria or genes can then be re‐compartmentalized for further rounds of selection.
Analysis of microbial communities through shotgun metagenomic sequencing (only environmental samples are shown).
| Sample | Library size | Host or vector system used | Average insert size (kbp) | Biodiversity | References |
|---|---|---|---|---|---|
| Sargasso Sea | 1 985 561 | Bst XI linearized pBR322 derivative | 2–6 | Samples were dominated by genes from | |
| Soil | 1129 (Bacteria) | pCR®2.1‐TOPO (Bacterial, Archaeal and Fungal) pSMART (viral) | 0.49 (Viruses) | This is the first study to use sequencing to characterize soil viral communities. Within each of the four microbial groups, data showed minimal taxonomic overlap between sites, suggesting that soil archaea, bacteria, fungi and viruses are globally as well as locally diverse. | |
| Acid mine drainage biofilm | 103 462 | pUC18 | 3.2 | Authors report the reconstruction of near‐complete genomes of | |
| Global Ocean | 7 697 926 | Bst XI linearized pBR322 derivative | 2 | Authors report a metagenomic study of the marine planktonic microbiota in which surface (mostly marine) water samples were analysed as part of the Sorcerer II Global Ocean Sampling expedition. The resulting 7.7 million sequencing reads form 41 samples provide an unprecedented look at the great diversity and heterogeneity in naturally occurring microbial populations. | |
| Soil | 1 186 200 | pJN105/pCF430 (small inserts) pBeloBAC11 (big inserts) | 2.7–45 | Authors designed a metagenomic analysis to isolate antibiotic resistance genes from 6 libraries of soil. They identified nine clones expressing resistance to aminoglycoside antibiotics and one expressing tetracycline resistance. |
Number of reads produced or
b. independent clones.
Metagenomic populations characterized through the 454 pyrosequencing technology (only environmental samples are shown).
| Sample | 454‐library size | Average length of reads | Biodiversity | References |
|---|---|---|---|---|
| Solar saltern | 582 681 | ≈100 bp | 151 genomic fragments were dominated by different halophilic archaea and by | |
| Soudan Mine | 334 386 (Red sample) 388 627 (Black sample) | 106 bp (RS) 99.1 bp (BS) | 76 16S rDNA | |
| Coral Porites astreoides | 316 279 | 102 bp | The most prominent bacterial groups were | |
| North Atlantic Deep Water and Axial Seamount | 118 778 | < 120 bp | Nearly 50% of the population corresponds to divergent | |
| Ocean surface waters | 414 323 (DNA) 128.324 (cDNA) | 110 bp (DNA) 114 bp (cDNA) | The genus | |
| Global soil | 314 041 | 96.4 bp | Results indicate that crenarchaeota may be the most abundant ammonia‐oxidizing organisms in soil ecosystems on Earth. | |
| Marine virome of four oceanic regions | 1 768 297 | 102 bp | Metagenomic analyses of 184 viral assemblages collected over a decade and representing 68 sites in four major oceanic regions. This work provides evidence that the composition of viral assemblages varies in different geographic regions. | |
| Northwest Atlantic & Eastern Tropical Pacific Seawater | ≈100 bp | Analysis of 7.7 million sequencing reads (6.3 billion bp) from the microbes collected across a several‐thousand km marine transects. | ||
| Surface and hypersaline marine, freshwater samples | ≈100 bp | Metagenomic analysis of 37 samples. Results showed that most of the 154 662 viral peptide sequences identified were not similar to those in the current database and that only few thousands genes encoding metabolic and cellular functions could be unambiguously identified. |
Number of reads.