| Literature DB >> 28095919 |
Gabriele Andrea Lugli1, Christian Milani1, Leonardo Mancabelli1, Francesca Turroni1, Chiara Ferrario1, Sabrina Duranti1, Douwe van Sinderen2, Marco Ventura3.
Abstract
BACKGROUND: Ancient microbiota information represents an important resource to evaluate bacterial evolution and to explore the biological spread of infectious diseases in history. The soft tissue of frozen mummified humans, such as the Tyrolean Iceman, has been shown to contain bacterial DNA that is suitable for population profiling of the prehistoric bacteria that colonized such ancient human hosts.Entities:
Keywords: Genomic evolution; Genomics; Gut bacteria; Metagenomics
Mesh:
Substances:
Year: 2017 PMID: 28095919 PMCID: PMC5240250 DOI: 10.1186/s40168-016-0221-y
Source DB: PubMed Journal: Microbiome ISSN: 2049-2618 Impact factor: 14.650
Ancient genome features
| Species | Strain | Genome size | ORFs number | GC content | Contigs | COGsa | Core genesa | Unique genesa | ANIb | Related strain |
|---|---|---|---|---|---|---|---|---|---|---|
|
| CADE | 2,713,669 | 2605 | 27.06 | 333 | 3,119 | 1,993 | 215 | 90.04 |
|
|
| CALG | 2,793,371 | 2588 | 29.98 | 95 | 2,902 | 2,295 | 216 | 98.38 |
|
|
| CPER | 3,860,191 | 3739 | 27.72 | 1,018 | 7,720 | 1,737 | 493 | 98.46 |
|
|
| PFLU | 5,806,496 | 5839 | 59.05 | 1,486 | 33,938 | 1,627 | 785 | 87.4 |
|
|
| PSYR | 2,744,354 | 2771 | 57.44 | 642 | – | – | – | – | – |
|
| PVER | 6,299,241 | 5805 | 60.49 | 289 | 8,700 | 4,200 | 615 | 93.89 |
|
aData obtained through pan-genome analyses with the public available sequenced genomes of the same species
bAverage nucleotide identity of the whole sequenced strains
Fig. 1Bacterial abundance in the Ötzi’s gut. Panel a displays a bar plot with the abundance of the major species identified in the Tyrolean Iceman gut using CoCla script. The x axis represents the identified bacterial species, while the y axis represents the number of nucleotides assembled in contigs. Each color reflects a specific sample, i.e., B0625 (lower part of the large intestine), C1824 and C1825 (upper part of the large intestine), and B0621 (small intestine). Panel b shows the visual abundance of the identified species
Fig. 2Phylogenetic diversity of the reconstructed ancient genomes. Panel a depicts a phylogenetic supertree based on the sequences of identified core proteins shared by the analyzed C. perfringens genomes. Panel b shows the same supertree based on the analyzed P. fluorescent genomes, while panel c displays the P. veronii supertee
Fig. 3Comparative genomic analysis of P. veronii PVER with other fully sequenced P. veronii strains. Circular genome atlas of P. veronii PVER (red circle) with mapped orthologues (defined as reciprocal best BLASTp hits with more than 50% identity over at least 50% of both protein lengths) in four publicly available P. veronii genomes (orange through green circle). Internal circles illustrate P. veronii PVER GC% deviation and GC skew (G − C/G + C), while the external maps exhibit the sequence identity between the unique loci of P. veronii PVER compared to other bacteria retrieved from the database. Each arrow indicates an ORF, whereas the length of the arrow is proportional to the length of the predicted ORF. Red arrows correspond to the P. veronii PVER genes, while orange arrows display orthologous genes
Clostridium perfringens toxins
| CPER ORFs | VFDB subject | Identity | Aln-len | Mismatch | Gap openings | Q.start | Q.end | S.start | S.end | log ( | Bit score | Predicted annotation |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| CPER_0621 | VFG002284 (gi:18309535) (nanJ) exo-alpha-sialidase [sialidase (VF0391)] [ | 94.4 | 1173 | 66 | 0 | 1 | 1173 | 1 | 1173 | −10000 | 2225.7 | Exo-alpha-sialidase |
| CPER_0726 | VFG002274 (gi:18309018) (plc) phospholipase C [alpha-toxin (CpPLC) (VF0378)] [ | 98.2 | 398 | 7 | 0 | 1 | 398 | 1 | 398 | −240.2 | 826.2 | Phospholipasae C |
| CPER_0764 | VFG002279 (gi:18310216) (nagJ) hyaluronidase [mu-toxin (VF0389)] [ | 98.1 | 879 | 17 | 0 | 1 | 879 | 1 | 879 | −10000 | 1726.1 | Hyaluronidase |
| CPER_1090 | VFG002276 (gi:18309155) (colA) collagenase [kappa-toxin (VF0388)] [ | 98.3 | 1043 | 18 | 0 | 1 | 1043 | 62 | 1104 | −10000 | 2096.2 | Peptidase M9 |
| CPER_1851 | VFG002278 (gi:18309863) (nagI) hyaluronidase [mu-toxin (VF0389)] [ | 98.6 | 1297 | 18 | 0 | 1 | 1297 | 1 | 1297 | −10000 | 2558.1 | Hyaluronidase |
| CPER_2203 | VFG002282 (gi:18309828) (cloSI) alpha-clostripain [alpha-clostripain (VF0390)] [ | 97.3 | 522 | 14 | 0 | 1 | 522 | 3 | 524 | −304.5 | 1040 | Alpha-clostripain precursor |
| CPER_2247 | VFG002280 (gi:18310261) (nagK) hyaluronidase [mu-toxin (VF0389)] [ | 99.3 | 1157 | 8 | 0 | 1 | 1157 | 1 | 1157 | −10000 | 2315.8 | Hyaluronidase |
| CPER_2490 | VFG002285 (gi:110800384) (nanH) sialidase [sialidase (VF0391)] [ | 95.9 | 294 | 12 | 0 | 1 | 294 | 89 | 382 | −168.5 | 587.4 | Exo-alpha-sialidase |
| CPER_2569 | VFG002277 (gi:18309173) (nagH) hyaluronidase [mu-toxin (VF0389)] [ | 98 | 1058 | 21 | 0 | 1 | 1058 | 21 | 1078 | −10000 | 2120.1 | Hyaluronoglucosaminidase |
| CPER_2658 | VFG002283 (gi:18309707) (nanI) exo-alpha-sialidase [sialidase (VF0391)] [ | 99.7 | 694 | 2 | 0 | 1 | 694 | 1 | 694 | −10000 | 1404.4 | Exo-alpha-sialidase |
| CPER_2774 | VFG002281 (gi:18310505) (nagL) hyaluronidase [mu-toxin (VF0389)] [ | 97.9 | 900 | 19 | 0 | 9 | 908 | 1 | 900 | −10000 | 1787.3 | Hyaluronidase |
| CPER_2934 | VFG002275 (gi:18309145) (pfoA) perfringolysin O [theta-toxin/PFO (VF0382)] [ | 99.8 | 500 | 1 | 0 | 1 | 500 | 1 | 500 | −289.6 | 990.7 | Perfringolysin O |
Predicted toxin-antitoxin systems
| Ancient ORFs | TADB code | TADB gene | Subject | Identity | Aln-len | Mismatch | Gap openings | Q.start | Q.end | S.start | S.end | Log ( | Bit score | Predicted annotation |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| CADE_0165 | TADB|8353403 | CLL_A3362 |
| 96.6 | 116 | 4 | 0 | 1 | 116 | 1 | 116 | -57.5 | 216.9 | mRNA interferase PemK |
| CADE_0166 | – | – | – | – | – | – | – | – | – | – | – | – | – | Hypothetical protein |
| CALG_0512 | TADB|8436183 | CLH_2005 |
| 76.7 | 146 | 34 | 0 | 1 | 146 | 1 | 146 | −64.3 | 240 | Hypothetical protein |
| CALG_0513 | TADB|8436181 | CLH_2004 |
| 63.8 | 127 | 46 | 0 | 4 | 130 | 6 | 132 | −45.3 | 176.4 | Hypothetical protein |
| CALG_1265 | TADB|375739 | CAC0494 |
| 90.4 | 114 | 11 | 0 | 3 | 116 | 2 | 115 | −54.3 | 206.5 | mRNA interferase PemK |
| CALG_1266 | – | – | – | – | – | – | – | – | – | – | – | – | – | CopG family transcriptional regulator |
| CPER_1456 | TADB|4710268 | CPF_0812 |
| 95.8 | 71 | 3 | 0 | 1 | 71 | 1 | 71 | −35.6 | 144.4 | Hypothetical protein |
| CPER_1457 | TADB|607725 | CPE0814 |
| 97.5 | 162 | 4 | 0 | 1 | 162 | 1 | 162 | −87.9 | 318.2 | GNAT family acetyltransferase |
| CPER_2012 | TADB|4710831 | CPF_1033 |
| 99.4 | 168 | 1 | 0 | 1 | 168 | 1 | 168 | −95.5 | 343.6 | Hypothetical protein |
| CPER_2013 | TADB|4710829 | CPF_1032 |
| 100 | 69 | 0 | 0 | 1 | 69 | 1 | 69 | −34 | 139 | DNA-binding protein |
| CPER_2208 | TADB|4721113 | CPR_0896 |
| 97.2 | 143 | 4 | 0 | 1 | 143 | 1 | 143 | −77.7 | 284.3 | Hypothetical protein |
| CPER_3173 | TADB|4710797 | CPF_1020 |
| 97.9 | 140 | 3 | 0 | 7 | 146 | 1 | 140 | −75.2 | 275.8 | Hypothetical protein |
| CPER_3174 | TADB|4710799 | CPF_1021 |
| 95.6 | 180 | 8 | 0 | 1 | 180 | 1 | 180 | −93.5 | 337 | XRE family transcriptional regulator |
| PFLU_1561 | TADB|9079447 | PFLU2030 |
| 98.4 | 123 | 2 | 0 | 1 | 123 | 1 | 123 | −64.8 | 241.1 | Hypothetical protein |
| PFLU_1562 | TADB|9079443 | PFLU2029 |
| 88.5 | 113 | 12 | 1 | 1 | 112 | 1 | 113 | −50.8 | 194.9 | Cro/Cl family transcriptional regulator |
| PFLU_2731 | – | – | – | – | – | – | – | – | – | – | – | – | – | Hexulose-6-phosphate isomerase |
| PFLU_2732 | TADB|348443 | hicB-2 |
| 71.3 | 108 | 31 | 0 | 5 | 112 | 2 | 109 | −42.4 | 166.8 | Antitoxin HicB |
| PFLU_4077 | – | – | – | – | – | – | – | – | – | – | – | – | – | Protein RnfH |
| PFLU_4078 | – | – | – | – | – | – | – | – | – | – | – | – | – | Ribosome association toxin RatA |
| PVER_0551 | TADB|12012688 | BCAS0580 |
| 55.8 | 129 | 56 | 1 | 1 | 128 | 1 | 129 | −35.9 | 145.2 | Twitching motility protein PilT |
| PVER_0552 | – | – | – | – | – | – | – | – | – | – | – | – | – | AbrB family transcriptional regulator |
| PVER_1351 | TADB|9070051 | PFLU0443 |
| 69.9 | 133 | 40 | 0 | 1 | 133 | 1 | 133 | −50.8 | 194.9 | Transcriptional regulator |
| PVER_1352 | TADB|9070048 | PFLU0442 |
| 64.2 | 95 | 34 | 0 | 1 | 95 | 1 | 95 | −33.8 | 138.3 | Hypothetical protein |
| PVER_1942 | – | – | – | – | – | – | – | – | – | – | – | – | – | Motility quorum-sensing regulator MqsR |
| PVER_1943 | TADB|3146921 | PFL_1052 |
| 58.1 | 136 | 55 | 1 | 1 | 136 | 1 | 134 | −38.8 | 155.2 | Transcriptional regulator |
| PVER_2079 | TADB|9097595 | PFLU5131 |
| 94 | 84 | 5 | 0 | 1 | 84 | 19 | 102 | −39.7 | 157.9 | Prevent-host-death protein |
| PVER_2080 | TADB|9097599 | PFLU5132 |
| 95.2 | 84 | 4 | 0 | 1 | 84 | 1 | 84 | −44.9 | 175.3 | Toxin YoeB |
| PVER_2187 | – | – | – | – | – | – | – | – | – | – | – | – | – | Addiction module toxin RelE |
| PVER_2188 | TADB|9082037 | PFLU2450 |
| 92 | 100 | 8 | 0 | 1 | 100 | 1 | 100 | −47.6 | 184.1 | Transcriptional regulator |
| PVER_2192 | TADB|9082090 | PFLU2458 |
| 92.8 | 125 | 9 | 0 | 1 | 125 | 1 | 125 | −62.2 | 232.6 | Twitching motility protein PilT |
| PVER_2193 | TADB|9082094 | PFLU2459 |
| 76.9 | 117 | 27 | 0 | 1 | 117 | 1 | 117 | −47.4 | 183.3 | Prevent-host-death protein |
| PVER_2252 | TADB|348443 | hicB-2 |
| 71.3 | 108 | 31 | 0 | 5 | 112 | 2 | 109 | −42.4 | 166.8 | Antitoxin HicB |
| PVER_2253 | – | – | – | – | – | – | – | – | – | – | – | – | – | Hexulose-6-phosphate isomerase |
| PVER_2785 | TADB|9079443 | PFLU2029 |
| 88.5 | 113 | 12 | 1 | 1 | 112 | 1 | 113 | −50.8 | 194.9 | Cro/Cl family transcriptional regulator |
| PVER_2786 | TADB|9079447 | PFLU2030 |
| 98.4 | 123 | 2 | 0 | 1 | 123 | 1 | 123 | −64.8 | 241.1 | Hypothetical protein |
| PVER_3149 | TADB|3145803 | PFL_0652 |
| 78.7 | 94 | 20 | 0 | 1 | 94 | 1 | 94 | −37.3 | 149.8 | Addiction module antitoxin |
| PVER_3150 | TADB|3145805 | PFL_0653 |
| 59.6 | 99 | 40 | 0 | 1 | 99 | 12 | 110 | −30.3 | 126.7 | Addiction module protein |
| PVER_3532 | TADB|9094658 | PFLU4639 |
| 91.5 | 117 | 10 | 0 | 1 | 117 | 2 | 118 | −57 | 215.3 | Transcriptional regulator |
| PVER_3533 | TADB|9094655 | PFLU4638 |
| 88 | 100 | 12 | 0 | 1 | 100 | 1 | 100 | −46.4 | 180.3 | Toxin RelE |
| PVER_4905 | – | – | – | – | – | – | – | – | – | – | – | – | – | Protein RnfH |
| PVER_4906 | – | – | – | – | – | – | – | – | – | – | – | – | – | Ribosome association toxin RatA |
Ancient metabolic pathways
| Species | Pathways | Status | Enzymes | Genes | Starting molecule | Final product |
|---|---|---|---|---|---|---|
|
| PreQ0 biosynthesis | Complete | GTP cyclohydrolase I |
| guanosine 5′-triphosphate (GTP) | 7-cyano-7-deazaguanine (preQ0) |
| 6-carboxy-5,6,7,8-tetrahydropterin synthase |
| |||||
| 7-carboxy-7-deazaguanine synthase |
| |||||
| 7-cyano-7-deazaguanine synthase |
| |||||
| β- | Complete | β- |
| β- |
| |
|
|
| |||||
|
|
| |||||
|
|
| |||||
| 2-keto-3-deoxygluconokinase |
| |||||
| 2-keto-3-deoxygluconate 6-phosphate aldolase |
| |||||
| Flavin biosynthesis I | Partial | bifunctional riboflavin kinase/FMN adenylyltransferase |
| Guanosine 5′-triphosphate (GTP) | Flavin adenine dinucleotide (FAD) | |
| 5-amino-6-(5-phospho- |
| |||||
| 3,4-dihydroxy-2-butanone 4-phosphate synthase |
| |||||
| Biotin biosynthesis from 8-amino-7-oxononanoate I | Partial | Biotin synthase |
| 8-amino-7-oxononanoate (KAPA) | Vitamin H (biotin) | |
| Biotin biosynthesis from 8-amino-7-oxononanoate II | Partial | Biotin synthase |
| 8-amino-7-oxononanoate (KAPA) | Vitamin H (biotin) | |
| Sucrose degradation I (sucrose phosphotransferase) | Absent | – | – | Sucrose | β- | |
|
| Myo-, chiro- and scillo-inositol degradation | Partial | (methyl) malonate-semialdehyde dehydrogenase gene |
| Inositol | Glycerone phosphate (DHAP) + acetyl-CoA |
| Myo-inositol degradation | Partial | (methyl) malonate-semialdehyde dehydrogenase gene |
| Inositol | Glycerone phosphate (DHAP) + acetyl-CoA | |
|
| KDO transfer to lipid IVA I | Absent | – | – | CMP-3-deoxy-β- | α-Kdo-(2- > 4)-α-Kdo-(2- > 6)-lipid IVA |
| KDO transfer to lipid IVA II | Absent | – | – | CMP-3-deoxy-β- | 4-O-phospho-α-Kdo-(2 → 6)-lipid IVA | |
|
| Pyochelin biosynthesis | Absent | – | – |
| Pyochelin |
| 4-hydroxyphenylacetate degradation | Complete | 4-coumarate 3-monooxygenase |
| 4-hydroxyphenylacetate | Succinate |
Fig. 4Carbohydrate-active enzymes of the ancient genomes. Panel a exhibits a bar plot with the abundance in Glycosyl hydrolase (GH) or Glycosyltransferase (GT) families encoded by the genomes of Clostridium sp. CADE and Clostridium sp. Ade.TY. Panel b shows a similar bar plot for C. algidicarnis CALG and C. algidicarnis B3, while panels c, d, and e display the GH and GT average between the publicly available genomes of C. perfringens, P. fluorescens and P. veronii, respectively