Literature DB >> 31189463

Multiple levels of the unknown in microbiome research.

Andrew Maltez Thomas1, Nicola Segata2.   

Abstract

Metagenomics allows exploration of aspects of a microbial community that were inaccessible by cultivation-based approaches targeting single microbes. Many new microbial taxa and genes have been discovered using metagenomics, but different kinds of "unknowns" still remain in a microbiome experiment. We discuss here whether and how it is possible to deal with them.

Entities:  

Mesh:

Year:  2019        PMID: 31189463      PMCID: PMC6560723          DOI: 10.1186/s12915-019-0667-z

Source DB:  PubMed          Journal:  BMC Biol        ISSN: 1741-7007            Impact factor:   7.431


Our understanding of the microbial communities that inhabit the human body and other environments has greatly improved in the past decade due to both biotechnological and computational advances in the metagenomic field [1]. Of particular note are the successful efforts to identify and genetically describe new microbial species that were previously part of the set of unknown micro-organisms occasionally referred to as “microbial dark matter”. However, in a typical microbiome experiment, several aspects of microbial communities still remain inaccessible. This inability to fully explore the diversity of a microbiome in a sample occurs at multiple distinct levels (Fig. 1) and should be acknowledged to avoid mis- and over-interpretation.
Fig. 1.

The current knowns and unknowns in the human microbiome. Numbers of known and unknown members of the human gut microbiome taken from a population-wide and multi-bodysite large-scale metagenomic assembly study [2]. Numbers marked with asterisks refer to genes from the Integrated Gene Catalogue (IGC) of the human gut microbiome and are derived from human fecal samples and mapping to the eggNOG database [3]

The current knowns and unknowns in the human microbiome. Numbers of known and unknown members of the human gut microbiome taken from a population-wide and multi-bodysite large-scale metagenomic assembly study [2]. Numbers marked with asterisks refer to genes from the Integrated Gene Catalogue (IGC) of the human gut microbiome and are derived from human fecal samples and mapping to the eggNOG database [3] At the deepest level of hidden diversity there are those members of the community that are not captured at all by the experiment, the undetected unknowns. These include low-abundance but potentially crucial taxa, whose genetic material is not sampled by sequencing techniques due to being present below the level of detection. Exactly where this threshold lies depends in part on experimental choices and specific techniques; for example, the dominance of host cells and DNA in the sample (e.g., biopsies from the intestinal mucosa) makes microbial taxa harder to detect and is a common problem in metagenomics experiments. Cultivation is less sensitive to the microbial concentrations in the sample than sequencing-based approaches and has contributed significantly to characterizing low-abundance taxa, especially when applied in a high-throughput setting [4]. However, available isolation protocols are unavoidably biased towards certain classes of microbes and are successful only for a fraction of a microbiome’s biodiversity. Bacteriophages are particularly prone to being under-sampled due to their short genomes and biochemical properties (e.g., having an RNA or single-stranded DNA genome) that are typically not considered by standard sample preparation protocols. Although virome enrichment protocols have been developed and applied, viruses remain perhaps the most neglected class of members of microbial communities. Microbiome taxa whose DNA is at least partially sequenced in the microbiome experiment but have not been described before and are phylogenetically far from genomes deposited in public databases represent another level of uncharacterized diversity. It was for such hard-to-profile hidden taxa that the term “microbial dark matter”, inspired by physics, was initially coined [5, 6]. This analogy has, however, come under question [7], since the dark matter in physics is thought to be a different form of matter while in microbiology undiscovered microbes have the same molecular basis as the known ones. This type of microbial hidden diversity is efficiently targeted by large-scale isolate sequencing and metagenomic assembly efforts that have recently uncovered many previously unexplored taxa [2, 8]. As a result of integrating the new taxa in the set of reference genomes, microbiomes can then be more comprehensively analyzed because the fraction of reads from a shotgun sequencing experiment that match a catalogued microbial genome—i.e., the metagenome’s mappability—increases. Our knowledge of the overall diversity of the human gut microbiome has indeed been greatly improved by large-scale metagenomic assembly efforts. For example in our study [2], mappability rates of gut metagenomes reach averages above 85% (median close to 95%), while previous rates were in the 50–70% range. Independent efforts based on both metagenomics [9, 10] and large-scale cultivation [8] have confirmed this trend. The mappability of metagenomes from human body sites other than the gut, such as the skin and the oral cavity, was similarly increased [2], and also for more diverse non-human environments these approaches have proven to be efficient and promising [11]. However, organizing large numbers of draft genomes from uncharacterized taxa is challenging, and while performing well for bacteria, assembly-based metagenomic tools are less effective when targeting new eukaryotic microbes and viruses. Intra-species genomic diversity can be extensive in bacteria and archaea. Indeed, several isolate-sequencing studies on (potential) pathogens highlighted how the set of genes that are present in some but not all the strains of a given species (i.e., the accessory or variable genome) can be more than ten times larger than the set of “core” genes that are always present in all strains of the species. Because the majority of microbiome species have few (if any) available genomes, the accessory genome of many species is underrepresented and thus the fraction of unmappable genetic material in a microbiome belonging to regions other than the core genome can be extensive. This is highlighted by the ~ 8% increased mappability that was observed when gut metagenomes are aligned against all > 154,000 newly recovered metagenomically assembled genomes rather than the 4930 single genome representatives of each candidate species (both known and newly defined). This increase ranged from 1.7% in vaginal samples to 23.8% in stool samples from non-Westernized populations [2]. To make further progress in uncovering hidden strain-level diversity, it is thus crucial to reconstruct sample-specific assemblies from the analyzed metagenomes and to include as many genomes as possible for each species in reference databases. Because species have pangenomes that are likely to be “open” (i.e., without an upper bound on the size of the accessory genome) mostly due to extensive horizontal gene transfer, it seems technically impossible to recover all strain-level diversity of a species across samples, but continuing the effort of cataloguing strain variants remains crucial for an in-depth understanding of the functional potential of a microbiome. The functional potential encoded in the overall microbiome and in its single microbial constituents is key to the understanding of microbial communities. The functional unknowns of a microbiome are, however, much more extensive and difficult to tackle than their taxonomic counterpart. This inaccessibility to functions stems from our limited understanding of the genes and pathways in a microbial genome, especially for non-model organisms, and from the wide phylogenetic diversity of microbiome members causing sequence homology to only partially capture functional similarity. Functional- and gene-centric efforts to characterize metagenomes include the creation of the Integrated Gene Catalogue (IGC) of the human gut microbiome, which comprises almost 10 million genes [3]. This is a non-redundant resource grouping genes at an identity threshold of ≥ 95% with ≥ 90% overlap, thus collapsing into gene-families the otherwise extremely large set of unique genes in the human microbiome (more than 316 million) [2]. Interestingly, 39.6% of genes present in the IGC catalogue were unmapped to functional databases. And the ability to match a gene against a target in functional databases is, however, only a partial step towards annotating its function; for instance, out of the 60.4% of genes that were annotated in the IGC, 15–20% are genes that have been observed before but are labeled as “function unknown” [3]. These numbers demonstrate how little is still known regarding both the genes that are present in microbial communities and their function. And whereas for taxonomic and phylogenetic diversity the latest high-throughput techniques are quickly decreasing the fraction of inaccessible taxa, experimental functional characterization of genes is inherently difficult to scale in high-throughput and cost-effective systems and is not receiving sufficient research investments. Although comparative analysis of the functional potential of metagenomes in different conditions can help in prioritizing genes for experimental functional characterization, it is very likely that the functional understanding of microbiomes cannot substantially improve in the short term and this appears to be one of the main limiting factors in the field. Current and future efforts to uncover the unexplored aspects of microbiomes will have direct consequences on several applications. Fecal microbiome transplantation is one such example, as a more complete profiling of gut microbiome samples can allow better and safer selection of donor samples and an improved understanding of which taxa contribute the most to the success of this medical practice. Uncovering the currently inaccessible microbiome members can also be crucial to expand disease-predictive taxonomic and functional microbiome signatures [12], and to better characterize populations and environments that are less studied and thus exhibit larger fractions of unexplored diversity. Several new phyla with intriguing phylogenetic placement in the whole tree-of-life have been recently described using metagenomics [13], and such continued expansion of the catalogued microbial diversity may also aid in our understanding of several biological aspects, including, for example, the process of eukaryogenesis, the origin of the eukaryotic cell [14]. The microbiome field is ready to embrace new and improved technologies to continue current efforts of reducing the effect of the different levels of unknowns in a microbiome experiment. These range from high-throughput cultivation [4] to single cell sequencing [6], but also improved computational methods are needed to more deeply explore metagenomic datasets, especially at a large scale. Functional understanding of the microbiome remains, however, the biggest challenge, and although low-throughput experiments targeting specific genes are irreplaceable, technology can again provide complementary solutions. These include integrated high-throughput profiling of the microbial transcriptome, metabolome, and proteome, and the automation of cultivation-based assays to scale-up the screening of multiple taxa and genes for phenotypes of interest. There are thus the conditions to substantially uncover the currently inaccessible microbiome, but specific differences and challenges are connected with each of the different kinds of the unknown outlined here.
  13 in total

1.  Dissecting biological "dark matter" with single-cell genetic analysis of rare and uncultivated TM7 microbes from the human mouth.

Authors:  Yann Marcy; Cleber Ouverney; Elisabeth M Bik; Tina Lösekann; Natalia Ivanova; Hector Garcia Martin; Ernest Szeto; Darren Platt; Philip Hugenholtz; David A Relman; Stephen R Quake
Journal:  Proc Natl Acad Sci U S A       Date:  2007-07-09       Impact factor: 11.205

2.  Insights into the phylogeny and coding potential of microbial dark matter.

Authors:  Christian Rinke; Patrick Schwientek; Alexander Sczyrba; Natalia N Ivanova; Iain J Anderson; Jan-Fang Cheng; Aaron Darling; Stephanie Malfatti; Brandon K Swan; Esther A Gies; Jeremy A Dodsworth; Brian P Hedlund; George Tsiamis; Stefan M Sievert; Wen-Tso Liu; Jonathan A Eisen; Steven J Hallam; Nikos C Kyrpides; Ramunas Stepanauskas; Edward M Rubin; Philip Hugenholtz; Tanja Woyke
Journal:  Nature       Date:  2013-07-14       Impact factor: 49.962

3.  Microbial culturomics: paradigm shift in the human gut microbiome study.

Authors:  J-C Lagier; F Armougom; M Million; P Hugon; I Pagnier; C Robert; F Bittar; G Fournous; G Gimenez; M Maraninchi; J-F Trape; E V Koonin; B La Scola; D Raoult
Journal:  Clin Microbiol Infect       Date:  2012-10-03       Impact factor: 8.067

Review 4.  Shotgun metagenomics, from sampling to analysis.

Authors:  Christopher Quince; Alan W Walker; Jared T Simpson; Nicholas J Loman; Nicola Segata
Journal:  Nat Biotechnol       Date:  2017-09-12       Impact factor: 54.908

5.  Recovery of nearly 8,000 metagenome-assembled genomes substantially expands the tree of life.

Authors:  Donovan H Parks; Christian Rinke; Maria Chuvochina; Pierre-Alain Chaumeil; Ben J Woodcroft; Paul N Evans; Philip Hugenholtz; Gene W Tyson
Journal:  Nat Microbiol       Date:  2017-09-11       Impact factor: 17.745

6.  A new genomic blueprint of the human gut microbiota.

Authors:  Alexandre Almeida; Alex L Mitchell; Miguel Boland; Samuel C Forster; Gregory B Gloor; Aleksandra Tarkowska; Trevor D Lawley; Robert D Finn
Journal:  Nature       Date:  2019-02-11       Impact factor: 49.962

7.  Metagenomic analysis of colorectal cancer datasets identifies cross-cohort microbial diagnostic signatures and a link with choline degradation.

Authors:  Andrew Maltez Thomas; Paolo Manghi; Francesco Asnicar; Edoardo Pasolli; Federica Armanini; Moreno Zolfo; Francesco Beghini; Serena Manara; Nicolai Karcher; Chiara Pozzi; Sara Gandini; Davide Serrano; Sonia Tarallo; Antonio Francavilla; Gaetano Gallo; Mario Trompetto; Giulio Ferrero; Sayaka Mizutani; Hirotsugu Shiroma; Satoshi Shiba; Tatsuhiro Shibata; Shinichi Yachida; Takuji Yamada; Jakob Wirbel; Petra Schrotz-King; Cornelia M Ulrich; Hermann Brenner; Manimozhiyan Arumugam; Peer Bork; Georg Zeller; Francesca Cordero; Emmanuel Dias-Neto; João Carlos Setubal; Adrian Tett; Barbara Pardini; Maria Rescigno; Levi Waldron; Alessio Naccarati; Nicola Segata
Journal:  Nat Med       Date:  2019-04-01       Impact factor: 87.241

8.  1,520 reference genomes from cultivated human gut bacteria enable functional microbiome analyses.

Authors:  Yuanqiang Zou; Wenbin Xue; Guangwen Luo; Ziqing Deng; Panpan Qin; Ruijin Guo; Haipeng Sun; Yan Xia; Suisha Liang; Ying Dai; Daiwei Wan; Rongrong Jiang; Lili Su; Qiang Feng; Zhuye Jie; Tongkun Guo; Zhongkui Xia; Chuan Liu; Jinghong Yu; Yuxiang Lin; Shanmei Tang; Guicheng Huo; Xun Xu; Yong Hou; Xin Liu; Jian Wang; Huanming Yang; Karsten Kristiansen; Junhua Li; Huijue Jia; Liang Xiao
Journal:  Nat Biotechnol       Date:  2019-02-04       Impact factor: 54.908

9.  Exploring microbial dark matter to resolve the deep archaeal ancestry of eukaryotes.

Authors:  Jimmy H Saw; Anja Spang; Katarzyna Zaremba-Niedzwiedzka; Lina Juzokaite; Jeremy A Dodsworth; Senthil K Murugapiran; Dan R Colman; Cristina Takacs-Vesbach; Brian P Hedlund; Lionel Guy; Thijs J G Ettema
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2015-09-26       Impact factor: 6.237

10.  Extensive Unexplored Human Microbiome Diversity Revealed by Over 150,000 Genomes from Metagenomes Spanning Age, Geography, and Lifestyle.

Authors:  Edoardo Pasolli; Francesco Asnicar; Serena Manara; Moreno Zolfo; Nicolai Karcher; Federica Armanini; Francesco Beghini; Paolo Manghi; Adrian Tett; Paolo Ghensi; Maria Carmen Collado; Benjamin L Rice; Casey DuLong; Xochitl C Morgan; Christopher D Golden; Christopher Quince; Curtis Huttenhower; Nicola Segata
Journal:  Cell       Date:  2019-01-17       Impact factor: 41.582

View more
  30 in total

Review 1.  Tools for Analysis of the Microbiome.

Authors:  Jessica Galloway-Peña; Blake Hanson
Journal:  Dig Dis Sci       Date:  2020-03       Impact factor: 3.199

Review 2.  Mutualistic interplay between bacteriophages and bacteria in the human gut.

Authors:  Andrey N Shkoporov; Christopher J Turkington; Colin Hill
Journal:  Nat Rev Microbiol       Date:  2022-06-30       Impact factor: 60.633

3.  Phylogenomic Analyses of Snodgrassella Isolates from Honeybees and Bumblebees Reveal Taxonomic and Functional Diversity.

Authors:  Luc Cornet; Ilse Cleenwerck; Jessy Praet; Raphaël R Leonard; Nicolas J Vereecken; Denis Michez; Guy Smagghe; Denis Baurain; Peter Vandamme
Journal:  mSystems       Date:  2022-05-23       Impact factor: 7.324

4.  Unifying the known and unknown microbial coding sequence space.

Authors:  Chiara Vanni; Matthew S Schechter; Silvia G Acinas; Albert Barberán; Pier Luigi Buttigieg; Emilio O Casamayor; Tom O Delmont; Carlos M Duarte; A Murat Eren; Robert D Finn; Renzo Kottmann; Alex Mitchell; Pablo Sánchez; Kimmo Siren; Martin Steinegger; Frank Oliver Gloeckner; Antonio Fernàndez-Guerra
Journal:  Elife       Date:  2022-03-31       Impact factor: 8.713

5.  Integrating taxonomic, functional, and strain-level profiling of diverse microbial communities with bioBakery 3.

Authors:  Francesco Beghini; Lauren J McIver; Aitor Blanco-Míguez; Leonard Dubois; Francesco Asnicar; Sagun Maharjan; Ana Mailyan; Paolo Manghi; Matthias Scholz; Andrew Maltez Thomas; Mireia Valles-Colomer; George Weingart; Yancong Zhang; Moreno Zolfo; Curtis Huttenhower; Eric A Franzosa; Nicola Segata
Journal:  Elife       Date:  2021-05-04       Impact factor: 8.140

Review 6.  Diet and the Microbiota-Gut-Brain Axis: Sowing the Seeds of Good Mental Health.

Authors:  Kirsten Berding; Klara Vlckova; Wolfgang Marx; Harriet Schellekens; Catherine Stanton; Gerard Clarke; Felice Jacka; Timothy G Dinan; John F Cryan
Journal:  Adv Nutr       Date:  2021-07-30       Impact factor: 8.701

Review 7.  The lung microbiome: progress and promise.

Authors:  Samantha A Whiteside; John E McGinniss; Ronald G Collman
Journal:  J Clin Invest       Date:  2021-08-02       Impact factor: 19.456

8.  Parabacteroides pekinense sp. nov.: a new bacterium isolated from the stool of a healthy man living in China.

Authors:  Z Li; X Zhou; W Xu; R Chen; B Zhao; C Wu
Journal:  New Microbes New Infect       Date:  2022-03-17

9.  Quantifying and Cataloguing Unknown Sequences within Human Microbiomes.

Authors:  Sejal Modha; David L Robertson; Joseph Hughes; Richard J Orton
Journal:  mSystems       Date:  2022-03-08       Impact factor: 7.324

10.  Precise phylogenetic analysis of microbial isolates and genomes from metagenomes using PhyloPhlAn 3.0.

Authors:  Francesco Asnicar; Andrew Maltez Thomas; Francesco Beghini; Claudia Mengoni; Serena Manara; Paolo Manghi; Qiyun Zhu; Mattia Bolzan; Fabio Cumbo; Uyen May; Jon G Sanders; Moreno Zolfo; Evguenia Kopylova; Edoardo Pasolli; Rob Knight; Siavash Mirarab; Curtis Huttenhower; Nicola Segata
Journal:  Nat Commun       Date:  2020-05-19       Impact factor: 14.919

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.