| Literature DB >> 33957869 |
Anna Cuscó1, Daniel Pérez2, Joaquim Viñes3,2, Norma Fàbregas3, Olga Francino2.
Abstract
BACKGROUND: Long-read sequencing in metagenomics facilitates the assembly of complete genomes out of complex microbial communities. These genomes include essential biologic information such as the ribosomal genes or the mobile genetic elements, which are usually missed with short-reads. We applied long-read metagenomics with Nanopore sequencing to retrieve high-quality metagenome-assembled genomes (HQ MAGs) from a dog fecal sample.Entities:
Keywords: Canine microbiome; Dog microbiome; Fecal microbiome; Gastrointestinal microbiome; Long-read metagenomics; Long-reads; Metagenome-assembled genomes; Nanopore; Sutterella
Year: 2021 PMID: 33957869 PMCID: PMC8103633 DOI: 10.1186/s12864-021-07607-0
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Experimental design overview. A single fecal sample from a healthy dog was extracted using a HMW and a non-HMW DNA extraction. Samples were sequenced using nanopore sequencing. Raw reads were basecalled and corrected prior to assembly. Four different data subsets were assembled to retrieve the maximum number of high-quality MAGs. These MAGs were frameshift-corrected and further analyzed
Fig. 2HMW-DNA vs. non-HMW DNA metagenomics assembly from the fecal sample of a healthy dog. Bandage plots of a) HMW-DNA assembly and b) non-HMW DNA assembly. HMW-DNA allows the recovery of long, circular contigs, which can potentially represent complete closed MAGs. We report the longest contigs in both datasets (Mb)
High quality (HQ) and medium quality (mq) single-contig MAGs retrieved in each metagenome assembly. Taxonomy assigned using the GTDB database release 95. Q is the MAG quality. Cov. is the coverage from Flye. *Blautia_A sp900541345 and *g__Sutterella HQ MAGs after correction of the indels
| Taxonomy (GTDB) | HMW data | 100% data | 75% data | 50% data | ||||
|---|---|---|---|---|---|---|---|---|
| Q | Cov. | Q | Cov. | Q | Cov. | Q | Cov. | |
| | HQ | 47X | HQ | 101X | mq | 82X | HQ | 50X |
| | mq | 95X | mq | 159X | HQ | 123X | mq | 87/80X |
| | HQ | 394X | HQ | 577X | HQ | 430X | HQ | 282X |
| | HQ | 87X | HQ | 205X | HQ | 155X | mq | 98X |
| | HQ | 13X | mq | 24X | HQ | 17X | mq | 11X |
| | – | – | mq | 38X | HQ | 31X | mq | 18X |
| | HQ | 17X | HQ | 42X | HQ | 31X | HQ | 22X |
| | HQ | 44X | – | – | mq | 45X | – | – |
| | mq | 126X | mq | 234X | mq | 168X | – | – |
| | mq | 206X | mq | 368X | mq | 282X | mq | 196X |
| | – | – | mq | 271X | – | – | – | – |
Fig. 3Histograms of the insertions and deletions in medium-quality MAGs (left) transformed into high-quality MAGs, after frameshift correction (right). The number of CDS, completeness (Compl.), and contamination (Cont.) are also included to evaluate the quality. Y-axis scale is 500 for better visualization of the insertions and deletions
Summary of genome statistics for High-quality MAGs compared to representatives on the public datasets. Coverage (Cov.) and circularity (Circ.) retrieved from Flye; completeness (% Compl.), from CheckM; tRNAs and rRNA values, from PROKKA. tRNAs count refers to unique canonical tRNAs. GTDB species representative are used as the references for comparison. The two exceptions are Succinivibrio and Sutterella since they were potential novel species regarding GTDB, but we found a MAG > 95% ANI on the animal gut metagenome and UHGG catalog, respectively. apartial gene call by PROKKA. bnot detected by PROKKA, but the GTDB/NCBI reference for Blautia argii (GCF_003287895.1) is described to contain five 5S rRNA genes
| Length (Mbp) | Cov. | Circ. | % Compl. | CDS | tRNAs | 16S rRNA | 23S rRNA | 5S rRNA | % pseudo genes | Contiguity level | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 2.04 | 101X | N | 98.68 | 1683 | 20 | 7 | 7 | 8 | 0.83% | 1 contig | |
| Animal Gut Mg: Freddie_038 MAG | 1.74 | – | N | 97.50 | 1490 | 14 | 0 | 0 | 0 | 0.20% | 185 contigs |
| 2.70 | 123X | N | 95.49 | 2646 | 19 | 9 | 9 | 0 | 3.86% | 1 contig | |
| UHHG: MGYG_HGUT_01574 MAG | 1.14 | – | N | 78.72 | 1012 | 14 | 0 | 0 | 0 | 0.69% | 24 contigs |
| 2.72 | 577X | Y | 97.65 | 2037 | 20 | 7 | 7 | 7 | 0.15% | 1 contig | |
| GTDB Rep: GCA_900541335.1 | 2.42 | 13X | N | 96.13 | 1842 | 16 | 0 | 0 | 0 | 0.11% | 95 contigs |
| 2.09 | 205X | N | 99.85 | 1985 | 20 | 5 | 5 | 5 | 1.61% | 1 contig | |
| GTDB Rep: GCA_900544885.1 | 1.75 | 20X | N | 98.65 | 1610 | 18 | 1a | 0 | 0 | 0.68% | 87 contigs |
| 2.53 | 17X | N | 98.50 | 2565 | 20 | 7 | 7 | 7a | 1.83% | 1 contig | |
| GTDB Rep: GCF_004168205.1 | 2.54 | 45X | N | 100 | 2003 | 20 | 1a | 0 | 0 | 0.90% | 212 contigs |
| 2.44 | 44X | N | 93.86 | 2313 | 19 | 6 | 6 | 6 | 1.30% | 1 contig | |
| GTDB Rep: GCA_900541345.1 | 2.69 | 8X | N | 95.85 | 2499 | 16 | 0 | 0 | 0 | 0.72% | 160 contigs |
| 2.78 | 42X | Y | 99.13 | 2511 | 20 | 6 | 6 | 6 | 1.27% | 1 contig | |
| GTDB Rep: GCF_000271405.2 | 2.83 | – | Y | 99.63 | 2670 | 20 | 6 | 6 | 6 | 0.97% | Complete |
| 2.96 | 31X | N | 92.78 | 2698 | 20 | 5 | 5 | 0b | 0.59% | 1 contig | |
| GTDB Rep: GCF_003287895.1 | 3.30 | 217X | Y | 97.64 | 3203 | 20 | 5 | 5 | 0b | 2.06% | Complete |
Prevalence of the bacterial species identified in public microbiome surveys. For human-derived MAGs, the Unified Human Gut Genome database was used [36]. For animal-derived MAGs, the animal gut metagenome catalog [10] was used. If no MAG belonged to that bacterial species, we further screened GTDB [29]. For further detail on 16S rRNA gene phylogenies, see Additional File 6.
| HQ MAG | Dog | Human | Other animals | Closest 16S | Main host |
|---|---|---|---|---|---|
| 35 | 1 | 0 | Human gut | Dog | |
| 12 | 1 | 0 | Dog gut | Dog | |
| 7 | 1 | 0 | Wolves’ gut | Canids | |
| 1 | 0 | 0 | Wolves’ gut | Canids | |
| 27 | 691 | 2 | Human gut | Human, animal | |
| 1 | 35 | 3 | Multiple | Human, animal | |
| 1 | 6 | 1 | Dog gut | Human, animal | |
| 0 | 1 | 0 | Multiple carnivora | Human, animal |
Fig. 4Similarity of 16S rRNA gene from Sutterella HQ MAGs to public datasets. The 16S rRNA gene comparison from Sutterella HQ MAGs suggested it is the genome assembly for Sutterella stercoricanis. a Phylogenetic 16S rRNA gene tree of Sutterella HQ MAGs. It presents high similarity to uncultured bacterium clone with codes UUF from Panthera uncia (wild feline); uncultured bacterium clone CA_68 from Cuon alpinus (wild canid) (JN559525.1), and S. stercoricanis from dog feces [42]. b Identity matrix of 16S rRNA genes of Sutterella HQ MAG against S. stercoricanis (NR_025600.1). Sutterella HQ MAG contained nine 16S rRNA genes that were more than 98% identical to NR_025600.1 (reference). Specifically, 16S_6 presented more than 99% of identity
Fig. 5Functional analysis and comparison of HQ MAGs and published bacterial species using the COG database. a Stacked bar plots representing the 18 more abundant COG categories for Catenibacterium, Phascolarctobacterium, Succinivibrio, and Sutterella representatives (HQ MAGs with more Mobilome functions). Y-axis is escalated to 100% for visualization proposes. b Boxplots representing the actual percentage of mobilome COG category in each of the HQ MAGs and the bacterial species present in the databases grouped by origin. Y-axis represents different scales