| Literature DB >> 34923897 |
Simone Bihl1, Marcus de Goffau2,3, Daniel Podlesny1, Nicola Segata4, Fergus Shanahan5, Jens Walter5, W Florian Fricke1,6.
Abstract
There is an ongoing controversy around the existence of a prenatal, fetal microbiome in humans, livestock, and other animals. The 'in utero microbial colonization' hypothesis challenges the clinical paradigm of the 'sterile womb' but has been criticized for its reliance on DNA-based evidence to detect microbiomes and the failure to conciliate the routine experimental derivation of germ-free animals from surgically resected embryos with a thriving fetal microbiome. In order to avoid the propagation of misinformation in the scientific literature, a critical assessment and careful review of newly published studies, particularly those that challenge the convincing current clinical dogma of the sterile womb, is of critical importance.We read with interest a recent publication that postulated the presence of a fetal microbiome in sheep, but questioned the plausibility of the reported findings and their meaningfulness to prove "microbial colonisation of the fetal gut […] in utero". We reanalyzed the published metagenomic and metatranscriptomic sequence data from the original publication and identified evidence for different types of contamination that affected all samples alike and could explain the reported findings without requiring the existence of a fetal microbiome.Our reanalysis challenges the reported findings as supportive of a prenatal fetal lamb microbiome. The shortcomings of the original analysis and data interpretation highlight common problems of low-biomass microbiome projects. We propose genomic independence of separate biological samples, i.e. distinctive profiles at the microbial strain level, as a potential new microbiome marker to increase confidence in metagenomics analyses of controversial low-biomass microbiomes.Entities:
Keywords: Fetal microbiome; contamination; metagenomics; metatranscriptomics; microbial strains; phiX; sheep
Mesh:
Year: 2022 PMID: 34923897 PMCID: PMC8726709 DOI: 10.1080/19490976.2021.2005751
Source DB: PubMed Journal: Gut Microbes ISSN: 1949-0976
Overview of the original sequence data and the results of their re-analysis
| Sample/template | Low-quality and host DNA removal | Taxonomic analysis | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Raw sequence data1 | Low quality read filtering2 | Filtered sequence data | Phi X7 | ||||||
| C1 | DNA | 94,721,122 | 7,504,441/7.9% | 87,048,840/99.8% | 167,841 / | 25,670/15.3% | 4,792/2.9% | 25/<0.1% | 48,122/28.7% |
| RNA | 8,554,736 | 1,326,647/15.5% | 6,741,078/93.3% | 487,011/5.7% | 261,609/53.7% | 117,237/24.1% | 7,375/1.5% | 23,255/4.8% | |
| C2 | DNA | 106,325,910 | 7,925,261/7.4% | 97,731,410/99.3% | 669,239/0.6% | 38,358/5.7% | 39,768/5.9% | 1,157/0.2% | 87,724/13.1% |
| RNA | 8,153,304 | 1,258,545/15.4% | 6,392,044/92.7% | 502,715/6.2% | 17,564/3.5% | 268,923/53.5% | 46,912/9.3% | 89,231/17.7% | |
| C3 | DNA | 110,048,306 | 100,991,077// 91.8% | 100,761,731/99.8% | 229,346/0.2% | 29,312/12.8% | 4,835/2.1% | 16/<0.1% | 93,480/40.8% |
| RNA | 39,103,292 | 33,979,905/86.9% | 33,645,902/99.0% | 334,003 / | 135,772/40.7% | 81,504/24.4% | 3,353/1.0% | 54,540/16.3% | |
| C4 | DNA | 95,215,914 | 87,445,966/91.8% | 87,230,854/99.8% | 215,112/0.2% | 36,214/16.8% | 4,226/2.0% | 56/<0.1% | 36,821/17.1% |
| RNA | 8,656,196 | 7,313,727/84.5% | 6,210,900/84.9% | 1,102,827/12.7% | 594,911/53.9% | 391,563/35.5% | 18,067/1.6% | 11,739/1.1% | |
| C5 | DNA | 117,291,656 | 107,795,471/91.9% | 107,520,176/99.7% | 275,112/0.2% | 75,921/27.6% | 5,838 /2.1% | 19/<0.1% | 96,498/35.1% |
| RNA | 30,009,606 | 26,535,222/88.4% | 26,345,106/99.3% | 190,116/0.6% | 54,327/28.6% | 30,887/16.2% | 1,586/0.8% | 80,009/42.1% | |
| C6 | DNA | 149,365,318 | 137,894,749/92.3% | 137,772,043/99.9% | 122,706/0.1% | 6,872/5.6% | 9,435/7.7% | 72/0.1% | 89,478/72.9% |
| RNA | 19,015,404 | 16,674,129/87.7% | 16,219,742/97.3% | 454,387/2.4% | 236,101/52.0% | 92,953/20.5% | 5,045/1.1% | 47,720/10.5% | |
| Negative control | DNA | 4,956,844 | 429,994/8.7% | 2,953,335/65.2% | 1,573,515/31.7% | 321,548/20.4% | 62,139/3.9% | 1,129/<0.1% | 16,879/1.1% |
| RNA | 7,707,198 | 1,283,593/6.7% | 5,866,837/91.3% | 556,768/7.2% | 266,483/47.9% | 189,912/34.1% | 11,187/2.0% | 16,819/3.0% | |
| Positive control | DNA | 118,227,748 | 8,647,781/7.3% | 32,281,742/29.5% | 77,298,225// 65.4% | 690,722/0.9% | 74,205,991 | 3,128/<0.1% | 69,080/0.1% |
| RNA | 51,212,242 | 3,641,662/7.1% | 21,274,220/44.7% | 26,296,360/51.3% | 34,128/0.1% | 25,793,455/98.1% | 877/<0.1% | 26,780/0.1% | |
1Raw sequence data from the original manuscript were downloaded from https://www.ncbi.nlm.nih.gov/sra/ (PRJNA601636; PRJNA598075);
2Sequence regions where the base quality fell below Q20 within a 4-nucleotide sliding window were trimmed and reads that were truncated by more than 30% removed (SLIDINGWINDOW:4:20, MINLEN:70) with KneadData v0.6.1 (https://huttenhower.sph.harvard.edu/kneaddata);
3Host DNA was removed by mapping trimmed reads to the sheep genome (Ovis aries; GCF_002742125.1) with the Burrows-Wheeler Aligner (BWA; http://bio-bwa.sourceforge.net/);
4−9For taxonomic assignments, reads were successively mapped to the different eukaryotic, bacterial, and viral reference genomes with BWA. After each alignment, mapped reads were filtered out and only the remaining reads compared to the next reference, in the order O. aries -> H. sapiens -> E. coli -> C. marimammalium -> phiX.
4Homo sapiens ((GRCh37/hg19));
5Escherichia coli K12 DH10B (GCF_000019425.1);
6Catellicoccus marimammalium M35/04/3 (GCF_000313915.1);
7Escherichia virus phiX174 (GCF_000819615.1).
Figure 1.Comparable relative abundance profiles in different DNA and RNA samples and controls. Relative abundances were calculated based on the number of reads mapped to reference genomes with BWA, either relative to the total number of quality-filtered reads (for O. aries) or relative to the number of qualify-filtered reads after removal of sheep sequence data (all others). Reads were mapped iteratively using a filtering approach, i.e. only those reads that did not map to one genome were used as input for the mapping to the next genome, in the order O. aries -> H. sapiens -> E. coli -> C. marimammalium -> phiX.
Figure 2.Shared single-nucleotide variants in fetal lamb metagenomes and positive control identify the same strain. Two screenshots show trimmed and filtered metagenomic reads mapped with BWA to E. coli K12 DH10B (GCF_000019425.1). Alignments were visualized with the Integrative Genomic Viewer (IGV; https://software.broadinstitute.org/software/igv/). Based on unique and shared single nucleotide variation (SNV) profiles the positive control (p) contains two strains. With respect to the depicted genome region, one strain is identical to DH10B whereas the other strain is also found in three fetal lamb samples (C1, C4, C5)
Figure 3.Sample and control metagenomes and metatranscriptomes contain the same phiX strain. A screenshot shows trimmed and filtered metagenomic and metatranscriptomic reads mapped with BWA to phiX174(GCF_000819615.1). Sequencing coverage in all datasets extends over the entire phage genome (100%), including non-transcribed regions around the origin of replication