| Literature DB >> 31607556 |
Adrian Tett1, Kun D Huang2, Francesco Asnicar3, Hannah Fehlner-Peach4, Edoardo Pasolli3, Nicolai Karcher3, Federica Armanini3, Paolo Manghi3, Kevin Bonham5, Moreno Zolfo3, Francesca De Filippis6, Cara Magnabosco7, Richard Bonneau8, John Lusingu9, John Amuasi10, Karl Reinhard11, Thomas Rattei12, Fredrik Boulund13, Lars Engstrand14, Albert Zink14, Maria Carmen Collado15, Dan R Littman4, Daniel Eibach16, Danilo Ercolini6, Omar Rota-Stabelli17, Curtis Huttenhower18, Frank Maixner14, Nicola Segata19.
Abstract
Prevotella copri is a common human gut microbe that has been both positively and negatively associated with host health. In a cross-continent meta-analysis exploiting >6,500 metagenomes, we obtained >1,000 genomes and explored the genetic and population structure of P. copri. P. copri encompasses four distinct clades (>10% inter-clade genetic divergence) that we propose constitute the P. copri complex, and all clades were confirmed by isolate sequencing. These clades are nearly ubiquitous and co-present in non-Westernized populations. Genomic analysis showed substantial functional diversity in the complex with notable differences in carbohydrate metabolism, suggesting that multi-generational dietary modifications may be driving reduced prevalence in Westernized populations. Analysis of ancient metagenomes highlighted patterns of P. copri presence consistent with modern non-Westernized populations and a clade delineation time pre-dating human migratory waves out of Africa. These findings reveal that P. copri exhibits a high diversity that is underrepresented in Western-lifestyle populations.Entities:
Keywords: Iceman; Prevotella copri; Westernization; ancient DNA; bacterial pangenome; bacterial phylogenetics; comparative microbial genomics; gut microbes; human microbiome; metagenomic assembly; metagenomics
Mesh:
Year: 2019 PMID: 31607556 PMCID: PMC6854460 DOI: 10.1016/j.chom.2019.08.018
Source DB: PubMed Journal: Cell Host Microbe ISSN: 1931-3128 Impact factor: 21.023
Figure 1The Four Distinct Clades of the P. copri Complex
(A) Whole-genome phylogenetic tree of a representative subset of the four P. copri clades comprising the P. copri complex in relation to other sequenced members of the genera Prevotella, Alloprevotella, and Paraprevotella. Magenta circles indicate P. copri isolate sequences (built using 400 universal bacterial marker gene sequences, see STAR Methods). The phylogeny containing all P. copri genomes is available as Figure S1B and http://segatalab.cibio.unitn.it/data/Pcopri_Tett_et_al.html (see Data and Code Availability; Method Details).
(B) Genetic distances within a clade (intra-clade), between clades (inter-clade), and between clades and other species (denoted as OS) of Prevotella, Alloprevotella, and Paraprevotella (inter-species), shown as pairwise average nucleotide identity distances (ANI distance). The dotted line denotes 5% ANI distance.
(C) Pairwise SNV distances based on core gene alignment within and between clades (see STAR methods).
(D) Jaccard distance based on pairwise gene content (see STAR Methods) between and within the P. copri clades.
Figure 2Phylogenetic Representation of All 1,023 P. copri Genomes Separated for Each Clade of the P. copri Complex
Outer ring is colored by continent of origin and inner ring is colored by country. Radial gray bars indicate recently sequenced isolate genomes, and publicly available reference genomes are denoted by black stars.
Figure 3Prevalence of the P. copri Complex and Its Association with Non-Westernized Populations
(A) P. copri prevalence in non-Westernized and Westernized datasets. “All” refers to the prevalence of any of the four clades being present.
(B) Percentage of individuals harboring multiple P. copri clades.
(C) P. copri complex pangenome sizes for non-Westernized individuals by dataset compared to Westernized individuals.
Figure 4Functional Diversity of the P. copri Complex
(A) Presence and absence of eggNOG functions significantly different between the four P. copri clades (yellow, present; black, absent) (see STAR Methods).
(B) Multidimensional scaling (MDS) ordination based on CAZy families present in each genome showing distinct inter- and intra-clustering in the P. copri complex.
(C) All CAZy families significantly enriched (left) or depleted (right) in at least one clade relative to each of the other three (see STAR Methods). Prevalence is defined as the percentage of genomes in that clade for which at least one gene belongs to the given CAZy family. For full list of CAZy prevalence in each clade, see Table S3.
Figure 5Ancient Microbiomes and the Evolutionary History of the P. copri Complex
(A) Ancient Mexican coprolite samples and intestinal and lung tissue sampled from the Iceman, a natural ice mummy.
(B) Percentage of positive P. copri clade-specific markers identified in each ancient metagenomic sample.
(C) Time-resolved phylogenetic tree of the P. copri complex; magenta star indicates the ancient coprolite sample, 2180 (see STAR Methods).
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| Faecal samples from a Ethiopian cohort | This study | N/A |
| Faecal samples from a Ghanaian cohort | This study | N/A |
| Faecal samples from a Tanzanian cohort | This study | N/A |
| PowerSoil DNA isolation kit | Qiagen | Cat No./ID: 12888-100 |
| TruSeq DNA PCR-free Library Prep Kit | Illumina, California, USA | 20015962 |
| NexteraXT DNA Library Preparation Kit | Illumina, California, USA | FC-131-1096 |
| Raw sequencing data (Ethiopia) | NCBI-SRA BioProject: PRJNA504891 | |
| Raw sequencing data (Ghana) | This study | NCBI-SRA BioProject: PRJNA529124 |
| Raw sequencing data (Tanzania) | This study | NCBI-SRA BioProject: PRJNA529400 |
| All | This study | |
| Raw sequencing data (Ancient metagenomics samples) | This study | NCBI-SRA BioProject: PRJEB31971 |
| metaSPAdes (version 3.10.1) | ||
| MEGAHIT (version 1.1.1) | ||
| anvi’o (version 2.3.2) | ||
| MetaPhlAn2 (version 2.0) | ||
| Bowtie2 (version 2.2.9) | ||
| blastn (version 2.6.0+) | ||
| CMseq (version dev commit 41082ef) | ||
| Prokka (version 1.11) | ||
| Pyani (version 0.2.6) | ||
| Roary (version 3.11) | ||
| SciPy | ||
| PhyloPhlAn (version dev commit 7c38e19) | ||
| Diamond (version v0.9.9.110) | ||
| MAFFT (version v7.310) | ||
| trimAl (version 1.2rev59) | ||
| FastTree (version 2.1.10) | ||
| RAxML (version 8.1.15) | ||
| GraPhlAn (version 1.1.3) | ||
| EggNOG mapper (version 1.0.3) | ||
| HMMSEARCH (version 3.1b2) | ||
| DeDup | ||
| mapDamage2 | ||
| SAMtools | ||
| HaploGrep2 | ||
| PileupCaller | ||
| BEAST (version 2.5.1) | ||
| Tracer (version 1.7) | ||
| Schmutzi | ||
| ANGSD | ||
| PRANK (version 140603) | ||
| Prodigal (version 2.6.3) | ||
| Barrnap (version 0.9) | ||