| Literature DB >> 25281543 |
Claudia Bragalini1, Céline Ribière2, Nicolas Parisot2, Laurent Vallon3, Elsa Prudent3, Eric Peyretaillade2, Mariangela Girlanda4, Pierre Peyret2, Roland Marmeisse1, Patricia Luis5.
Abstract
Eukaryotic microbial communities play key functional roles in soil biology and potentially represent a rich source of natural products including biocatalysts. Culture-independent molecular methods are powerful tools to isolate functional genes from uncultured microorganisms. However, none of the methods used in environmental genomics allow for a rapid isolation of numerous functional genes from eukaryotic microbial communities. We developed an original adaptation of the solution hybrid selection (SHS) for an efficient recovery of functional complementary DNAs (cDNAs) synthesized from soil-extracted polyadenylated mRNAs. This protocol was tested on the Glycoside Hydrolase 11 gene family encoding endo-xylanases for which we designed 35 explorative 31-mers capture probes. SHS was implemented on four soil eukaryotic cDNA pools. After two successive rounds of capture, >90% of the resulting cDNAs were GH11 sequences, of which 70% (38 among 53 sequenced genes) were full length. Between 1.5 and 25% of the cloned captured sequences were expressed in Saccharomyces cerevisiae. Sequencing of polymerase chain reaction-amplified GH11 gene fragments from the captured sequences highlighted hundreds of phylogenetically diverse sequences that were not yet described, in public databases. This protocol offers the possibility of performing exhaustive exploration of eukaryotic gene families within microbial communities thriving in any type of environment.Entities:
Keywords: glycoside hydrolase family GH11; metatranscriptomics; sequence capture; soil RNA; soil eukaryotes
Mesh:
Substances:
Year: 2014 PMID: 25281543 PMCID: PMC4263301 DOI: 10.1093/dnares/dsu030
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.458
Summary statistics from Illumina MiSeq sequencing of GH11 PCR fragments amplified, for each four cDNA samples, before (H0) or after one (H1) or two (H2) hybridization capture
| Sample | Total no. of sequences | Total no. of clustersa (95%) | No. of clusters encompassing ≥90% of the sequences | Shannon diversity index (H′)b | No. of shared clusters between H0–H1–H2b |
|---|---|---|---|---|---|
| PUE_H0 | 12,960 | 298 | 52 (17%) | 3.819 | 70 (11%) |
| PUE_H1 | 24,565 | 227 | 51 (22%) | 4.015 | |
| PUE_H2 | 25,053 | 291 | 46 (16%) | 3.912 | |
| BRE_H0 | 13,538 | 87 | 9 (10%) | 2.254 | 11 (5%) |
| BRE_H1 | 42,000 | 140 | 5 (4%) | 1.651 | |
| BRE_H2 | 46,626 | 112 | 6 (5%) | 1.73 | |
| BRH_H0 | 2,765 | 26 | 3 (12%) | 1.061 | 5 (4%) |
| BRH_H1 | 28,366 | 51 | 3 (6%) | 1.234 | |
| BRH_H2 | 17,322 | 159 | 18 (11%) | 2.135 | |
| BEW_H0 | 41,799 | 214 | 15 (7%) | 2.761 | 38 (6%) |
| BEW_H1 | 42,308 | 249 | 10 (4%) | 2.496 | |
| BEW_H2 | 36,859 | 205 | 6 (3%) | 2.196 |
aIncluding singletons.
bShannon diversity indices and shared clusters were calculated after rarefying the different data sets from the same soil to the same sequencing depth of 12,960, 13,538, 2,765 and 36,859 sequences for PUE, BRE, BRH and BEW, respectively.
Figure 1.Electrophoretic separation of cDNAs obtained following two consecutive solution hybridization selection. Captured cDNAs from the four soil samples PUE, BRH, BRE and BEW were run on an Agilent DNA 12000 microfluidic chip. Each band could encompass one or several unique but abundant GH11 cDNAs.
Figure 2.Semi-quantitative PCR amplification of a 281-bp GH11 fragment using different quantities (from 10 to 0.01 ng) of BRH cDNA obtained before (H0) and after one (H1) or two (H2) cycles of hybridization. Before capture, PCR products could only be obtained using 10 ng of input cDNA. Amplifications of the PUE, BRE and BEW samples are illustrated in Supplementary Fig. S3.
Cloning and characterization of captured GH11 cDNAs
| Samples | PUE | BRH | BRE | BEW |
|---|---|---|---|---|
| No. of captured cDNAs cloned in | 6,770 | 2,020 | 5,720 | 5,880 |
| No. of | 40 | 40 | 40 | 40 |
| Positive amplification of a GH11 fragment (%) | 37 (92.5) | 33 (82.5) | 35 (87.5) | 36 (90) |
| No. of inserts sequenced | 12 | 13 | 16 | 14 |
| No. of GH11 inserts (%) | 11 (92) | 12 (92) | 16 (100) | 14 (100) |
| No. of putative full-length GH11 (%) | 9 (82) | 9 (75) | 11 (69) | 9 (64) |
| % of | 1.5 | 25 | 12 | 6 |
Figure 3.Selectivity of the SHS capture. (A) Rank-abundance distribution of the most abundant GH11 nucleotide sequence clusters identified before (H0), or after one (H1) or two (H2) cycles of hybridization on the PUE cDNAs. Only clusters encompassing 80% of the sequences in the H0, H1 or H2 samples are shown. ‘C’ or ‘Y’ letters above bars indicate sequences obtained by random sequencing of plasmid inserts or which could be functionally expressed in yeast, respectively. (B) Venn diagram showing the number of unique or shared GH11 sequence clusters, before (H0), or after one (H1) or two (H2) cycles of hybridization on the PUE cDNAs. As in (A), only the most abundant clusters, encompassing 90% of the sequences, were used for the calculation. GH11 PCR sequences were clustered using a nucleotide sequence identity threshold of 95%. Similar Venn diagrams for the BRH, BRE and BEW samples are illustrated in Supplementary Fig. S5.
Figure 4.Phylogenetic diversity of the GH11 partial amino acid sequences obtained from PUE cDNA samples. 0, 1 and 2 translated PCR sequences obtained before or after one or two cycles of hybridization. PUE sequences are scattered over the entire tree that includes representative reference sequences from Ascomycota and Basidiomycota. c, sequences obtained from Escherichia coli clones; y, sequences functionally expressed in yeast clones. PhyML tree calculation was based on an alignment of ca. 80-amino-acid-long GH11 partial sequences. Thicker internal black branches indicate bootstrap value ≥60% (1,000 replications). Full species names and accession numbers of the reference sequences are given in Supplementary Fig. S6A. Similar trees drawn using the sequences from sites BRE, BRH and BEW are illustrated in Supplementary Fig. S6 B, C and D, respectively. This figure appears in colour in the online version of DNA Research.