| Literature DB >> 25973144 |
Sevasti Filippidou1, Thomas Junier2, Tina Wunderlin1, Chien-Chi Lo3, Po-E Li3, Patrick S Chain3, Pilar Junier1.
Abstract
Microbial diversity studies based on metagenomic sequencing have greatly enhanced our knowledge of the microbial world. However, one caveat is the fact that not all microorganisms are equally well detected, questioning the universality of this approach. Firmicutes are known to be a dominant bacterial group. Several Firmicutes species are endospore formers and this property makes them hardy in potentially harsh conditions, and thus likely to be present in a wide variety of environments, even as residents and not functional players. While metagenomic libraries can be expected to contain endospore formers, endospores are known to be resilient to many traditional methods of DNA isolation and thus potentially undetectable. In this study we evaluated the representation of endospore-forming Firmicutes in 73 published metagenomic datasets using two molecular markers unique to this bacterial group (spo0A and gpr). Both markers were notably absent in well-known habitats of Firmicutes such as soil, with spo0A found only in three mammalian gut microbiomes. A tailored DNA extraction method resulted in the detection of a large diversity of endospore-formers in amplicon sequencing of the 16S rRNA and spo0A genes. However, shotgun classification was still poor with only a minor fraction of the community assigned to Firmicutes. Thus, removing a specific bias in a molecular workflow improves detection in amplicon sequencing, but it was insufficient to overcome the limitations for detecting endospore-forming Firmicutes in whole-genome metagenomics. In conclusion, this study highlights the importance of understanding the specific methodological biases that can contribute to improve the universality of metagenomic approaches.Entities:
Keywords: Endospores; Metagenomics; Profile analysis; gpr; spo0A
Year: 2015 PMID: 25973144 PMCID: PMC4427659 DOI: 10.1016/j.csbj.2015.04.002
Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN: 2001-0370 Impact factor: 7.271
Fig. 1A. Validation of the profiles created for the genes spo0A and gpr compared to a selection of genomes of endospore-forming Firmicutes (blue bars) and non spore-forming genomes (red bars). In endospore-forming Firmicutes a single hit with a score above 2000 (Spo0A) and 2500 (Gpr) distinguish between positive and negative hits. Strco = Streptomyces coelicolor; Rhime = Rhizobium melliloti; Nosaz: Nostoc azollae; Lacac = Lactobacillus acidophilus; Escco = Escherichia coli; Desre = Desulfotomaculum reducens; Desha = Desulfitobacterium hafniense; Clobo = Clostridium botulinum; Bacha = Bacillus halodurans; Aliac = Alicyclobacillus acidocaldarius. B. The same analysis was repeated using all 59 endospore-forming genomes retrieved from IMG and CMR databases (see supplementary Table 1).
Fig. 2Profile similarity hits for Spo0A and Gpr protein profiles in metagenomes from different origins. The color code identifying different environments is presented under the results. The genomes included in profile testing (see Fig. 1A) were also included in the analysis and are presented in white (endospore-formers) and gray (non-spore formers).
Fig. 3Analysis of pyrosequencing results obtained from 16S rRNA gene and spo0A amplicons, from an environmental sample with high prevalence of endospore-forming Firmicutes (Nea Apollonia, NAP). (A) Total 16S rRNA gene community composition to the phylum level. (B) Firmicute fraction of the total community (16S rRNA gene) to the genus level. (C). Cladogram representing the community composition of Firmicutes using the spo0A gene. Sequences color coded by genus.
Prevalence of Firmicutes in 16S rRNA gene amplicon sequencing and shotgun metagenomic sequencing applied to the NAP sample. Different prediction tools were used to establish the five most frequent Phyla in the samples. With the exception of the 16S rRNA gene amplicon sequencing, the relative percentage indicated corresponded to the fraction of the sequences that could be classified and not to the frequency of any of the groups for the total reads generated after sequencing.
| Prediction tool | Top 5 Phyla | Frequency | Relative % | |
|---|---|---|---|---|
| 16S RNA gene amplicon pyrosequencing (QIIME) | 1 | 41.70 | 41.70% | |
| 2 | Proteobacteria | 26.14 | 26.14% | |
| 3 | Bacteroidetes | 10.55 | 10.55% | |
| 4 | Planctomycetes | 5.35 | 5.35% | |
| 5 | Chlorobi | 3.88 | 3.88% | |
| Kraken (mini database) | 1 | Proteobacteria | 16644 | 82.71% |
| 2 | Actinobacteria | 1744 | 8.67% | |
| 3 | 322 | 1.60% | ||
| 4 | Bacteroidetes | 298 | 1.48% | |
| 5 | Cyanobacteria | 192 | 0.95% | |
| MetaPhlAn | 1 | Proteobacteria | 82.01061 | 82.01% |
| 2 | Chloroflexi | 9.24158 | 9.24% | |
| 3 | Actinobacteria | 2.32449 | 2.32% | |
| 4 | Bacteroidetes | 2.08071 | 2.08% | |
| 5 | Acidobacteria | 1.54098 | 1.54% | |
| BWA | 1 | Proteobacteria | 452 | 75.21% |
| 2 | 32 | 5.32% | ||
| 3 | Thaumarchaeota | 28 | 4.66% | |
| 4 | Actinobacteria | 26 | 4.33% | |
| 5 | Bacteroidetes | 17 | 2.83% | |
| LMAT | 1 | Ascomycota | 425 | 35.68% |
| 2 | Cyanobacteria | 385 | 32.33% | |
| 3 | Proteobacteria | 190 | 15.95% | |
| 4 | Thaumarchaeota | 145 | 12.17% | |
| 5 | Basidiomycota | 20 | 1.68% | |