| Literature DB >> 35263215 |
Ji-Sun Kim1, Se Won Kang1, Ju Huck Lee1, Seung-Hwan Park1, Jung-Sook Lee1,2.
Abstract
Akkermansia muciniphila is a commensal bacterium using mucin as its sole carbon and nitrogen source. A. muciniphila is a promising candidate for next-generation probiotics to prevent inflammatory and metabolic disorders, including diabetes and obesity, and to increase the response to cancer immunotherapy. In this study, a comparative pan-genome analysis was conducted to investigate the genomic diversity and evolutionary relationships between complete genomes of 27 A. muciniphila strains, including KGMB strains isolated from healthy Koreans. The analysis showed that A. muciniphila strains formed two clades of group A and B in a phylogenetic tree constructed using 1,219 orthologous single-copy core genes. Interestingly, group A comprised of strains from human feces in Korea, whereas most of group B comprised strains from human feces in Europe and China, and from mouse feces. As group A and B branched, mucin hydrolysis played an important role in the stability of the core genome and drove evolution in the direction of defense against invading pathogens, survival in, and colonization in the mucus layer. In addition, WapA and anSME, which function in competition and post-translational modification of sulfatase, respectively, have been a particularly important selective pressure in the evolution of group A. KGMB strains in group A with anSME gene showed sulfatase activity, but KCTC 15667T in group B without anSME did not. Our findings revealed that KGMB strains evolved to gain an edge in the competition with other gut bacteria by increasing the utilization of sulfated mucin, which will allow it to become highly colonized in the gut environment.Entities:
Keywords: Akkermansia muciniphila; comparative genomic analysis; environmental adaptation; evolution
Mesh:
Substances:
Year: 2022 PMID: 35263215 PMCID: PMC8920140 DOI: 10.1080/19490976.2021.2025017
Source DB: PubMed Journal: Gut Microbes ISSN: 1949-0976
Strain information sequenced in this study
| Strains | No. of contigs | Genome size (bp) | GC ratio | Topology | Country | Host |
|---|---|---|---|---|---|---|
| KGMB01988 | 1 | 2,844,056 | 55.23 | Circular | Republic of Korea | |
| KGMB01989 | 1 | 2,844,036 | 55.23 | Circular | Republic of Korea | |
| KGMB01990 | 1 | 2,844,062 | 55.23 | Circular | Republic of Korea | |
| KGMB02009 | 1 | 2,844,059 | 55.23 | Circular | Republic of Korea | |
| KCTC 15667T | 1 | 2,664,051 | 55.76 | Circular | Netherlands |
ANI values between strains sequenced in this study
| Strains | KGMB01989 | KGMB01990 | KGMB02009 | KGMB01988 | KCTC 15667T |
|---|---|---|---|---|---|
| KGMB01989 | - | 99.99 | 99.99 | 99.99 | 97.53 |
| KGMB01990 | 99.99 | - | 100 | 100 | 97.53 |
| KGMB02009 | 99.99 | 100 | - | 100 | 97.53 |
| KGMB01988 | 99.99 | 100 | - | 97.53 | |
| KCTC 15667T | 97.54 | 97.54 | 97.54 | 97.53 | - |
Complete genome sequences of A. muciniphila analyzed in this study
| Species | Strains | Tag | Genome assembly | Country | Host | Genome size | G + C ratio | No. of CDS | |
|---|---|---|---|---|---|---|---|---|---|
| 1 | EB-AMDK-3 | MYE3 | GCA_003716935.1 | Republic of Korea | 2,663,833 | 55.76 | 2,217 | ||
| 2 | EB-AMDK-4 | MYE4 | GCA_003716955.1 | Republic of Korea | 2,664,010 | 55.76 | 2,166 | ||
| 3 | KCTC 15667 T | KCTC 15667 | in this study | Netherlands | 2,664,051 | 55.76 | 2,149 | ||
| 4 | ATCC BAA-835 T | ATCC BAA-835 | GCA_000020225.1 | Netherlands | 2,664,102 | 55.76 | 2,150 | ||
| 5 | YL44_sDMDMm2 | MYL2 | GCA_002201495.1 | Switzerland | 2,737,357 | 55.66 | 2,242 | ||
| 6 | YL44 | MYL4 | GCA_001688765.2 | Switzerland | 2,745,278 | 55.66 | 2,254 | ||
| 7 | EB-AMDK-7 | MYE7 | GCA_004015245.1 | Republic of Korea | 2,799,431 | 55.30 | 2,305 | ||
| 8 | 139 | M139 | GCA_004319565.1 | China | 2,801,917 | 55.74 | 2,315 | ||
| 9 | EB-AMDK-16 | MY16 | GCA_004015205.1 | Republic of Korea | 2,770,073 | 55.30 | 2,270 | ||
| 10 | EB-AMDK-15 | MY15 | GCA_004015305.1 | Republic of Korea | 2,770,098 | 55.30 | 2,274 | ||
| 11 | EB-AMDK-18 | MY18 | GCA_004015085.1 | Republic of Korea | 2,770,124 | 55.30 | 2,287 | ||
| 12 | EB-AMDK-17 | MY17 | GCA_004015225.1 | Republic of Korea | 2,770,146 | 55.30 | 2,269 | ||
| 13 | EB-AMDK-1 | MYE1 | GCA_003716915.1 | Republic of Korea | 2,772,237 | 55.39 | 2,261 | ||
| 14 | H2 | MHA2 | GCA_004101765.1 | Belgium | 2,819,944 | 55.32 | 2,293 | ||
| 15 | CBA5201 | MC01 | GCA_004104435.1 | Republic of Korea | 2,860,407 | 55.32 | 2,348 | ||
| 16 | EB-AMDK-21 | MY21 | GCA_004015345.1 | Republic of Korea | 2,724,154 | 55.32 | 2,243 | ||
| 17 | EB-AMDK-22 | MY22 | GCA_004015125.1 | Republic of Korea | 2,724,161 | 55.32 | 2,252 | ||
| 18 | EB-AMDK-20 | MY20 | GCA_004015325.1 | Republic of Korea | 2,724,186 | 55.32 | 2,220 | ||
| 19 | EB-AMDK-19 | MY19 | GCA_004015105.1 | Republic of Korea | 2,724,248 | 55.32 | 2,211 | ||
| 20 | EB-AMDK-10 | MYE0 | GCA_004015005.1 | Republic of Korea | 2,763,834 | 55.25 | 2,363 | ||
| 21 | EB-AMDK-13 | MY13 | GCA_004015285.1 | Republic of Korea | 2,763,965 | 55.25 | 2,329 | ||
| 22 | EB-AMDK-14 | MY14 | GCA_004015065.1 | Republic of Korea | 2,764,188 | 55.25 | 2,267 | ||
| 23 | EB-AMDK-2 | MYE2 | GCA_003716975.1 | Republic of Korea | 2,764,211 | 55.25 | 2,251 | ||
| 24 | EB-AMDK-12 | MY12 | GCA_004015045.1 | Republic of Korea | 2,764,297 | 55.26 | 2,238 | ||
| 25 | EB-AMDK-11 | MY11 | GCA_004015025.1 | Republic of Korea | 2,764,311 | 55.26 | 2,243 | ||
| 26 | EB-AMDK-8 | MYE8 | GCA_004015265.1 | Republic of Korea | 2,824,041 | 55.39 | 2,330 | ||
| 27 | KGMB01988 | KGMB01988 | in this study | Republic of Korea | 2,844,056 | 55.23 | 2,315 | ||
| Outgroup | APytT | GAPY | GCA_900097105.1 | Netherlands | 3,074,078 | 57.65 | 2,497 | ||
Genome to genome distance with strain KGMB01988
| Query genome | Strain | ANI value | dDDH value | Distance | Probability that dDDH > 70% | G + C difference |
|---|---|---|---|---|---|---|
| KGMB01988 | EB-AMDK-3 | 97.64 | 78.90 | 0.0246 | 89.71 | 0.53 |
| EB-AMDK-4 | 97.64 | 78.90 | 0.0246 | 89.73 | 0.53 | |
| ATCC BAA-835 T | 97.66 | 78.90 | 0.0246 | 89.73 | 0.53 | |
| KCTC 15667 T | 97.63 | 78.90 | 0.0246 | 89.73 | 0.53 | |
| YL44_sDMDMm2 | 97.55 | 78.40 | 0.0252 | 89.30 | 0.43 | |
| YL44 | 97.63 | 78.40 | 0.0253 | 89.24 | 0.43 | |
| EB-AMDK-7 | 97.46 | 77.70 | 0.0262 | 88.57 | 0.07 | |
| 139 | 97.44 | 78.20 | 0.0255 | 89.10 | 0.51 | |
| EB-AMDK-16 | 98.06 | 82.30 | 0.0207 | 92.19 | 0.07 | |
| EB-AMDK-15 | 98.04 | 82.30 | 0.0207 | 92.21 | 0.07 | |
| EB-AMDK-18 | 98.03 | 82.30 | 0.0207 | 92.21 | 0.07 | |
| EB-AMDK-17 | 98.06 | 82.30 | 0.0206 | 92.22 | 0.07 | |
| EB-AMDK-1 | 98.26 | 84.00 | 0.0187 | 93.21 | 0.16 | |
| H2 | 98.08 | 82.10 | 0.0209 | 92.07 | 0.09 | |
| CBA5201 | 98.12 | 83.00 | 0.0199 | 92.63 | 0.09 | |
| EB-AMDK-21 | 99.89 | 98.90 | 0.0019 | 98.04 | 0.09 | |
| EB-AMDK-22 | 99.89 | 98.80 | 0.0020 | 98.03 | 0.09 | |
| EB-AMDK-20 | 99.88 | 98.90 | 0.0019 | 98.04 | 0.09 | |
| EB-AMDK-19 | 99.90 | 98.90 | 0.0019 | 98.04 | 0.09 | |
| EB-AMDK-10 | 98.97 | 90.80 | 0.0112 | 96.07 | 0.02 | |
| EB-AMDK-13 | 99.00 | 90.80 | 0.0112 | 96.07 | 0.02 | |
| EB-AMDK-14 | 99.03 | 90.90 | 0.0111 | 96.10 | 0.02 | |
| EB-AMDK-2 | 99.00 | 91.00 | 0.0111 | 96.11 | 0.02 | |
| EB-AMDK-12 | 99.03 | 91.00 | 0.0111 | 96.11 | 0.03 | |
| EB-AMDK-11 | 99.03 | 90.90 | 0.0111 | 96.10 | 0.03 | |
| EB-AMDK-8 | 99.10 | 92.30 | 0.0096 | 96.52 | 0.16 |
Figure 1.Curves for size of pan-genome and core-gene sets from completely sequenced Akkermansia muciniphila strain genomes. The strains shared 1,749 gene families. Pan-genome of 27 A. muciniphila strains consists of 3,811 gene families. Estimation of openness based on heaps’ law model showed that the A. muciniphila pan genome is open with a parameter (γ) of 0.78.
Figure 2.Phylogenetic tree of Akkermansia muciniphila group based on the concatenation of the amino acid alignments deduced from 1,219 core genes with maximum likelihood approach. Numbers above branches show maximum-likelihood bootstrap supports from 100 non-parametric replicates. The tree was rooted by A. glycaniphila APytT as an outgroup. The scale represents the number of substitutions per site.
Figure 3.Comparison of functional categories of Akkermansia muciniphila. The ordinate axis represents the percentages of genes in each functional category. The pan-genome from the 27 A. muciniphila genomes was analyzed using COG functional category: ‘cellular processes and signaling’, ‘metabolism’, ‘information storage and processing’ and ‘poorly characterized’. A. The radar plots of totalContents is the ratio of Core/Accessory/Unique genes in each COG category. B. The radar plots of eachCategory is the ratio of orthologs corresponding to each COG category in Core/Accessory/Unique genes. The radar plots were drawn using the ‘fmsb’ R package.
Glycoside hydrolase in . Values that are significantly different in KGMB01988 compared to members of other groups are highlighted in bold
| | Group B | | Group A | | ||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Glycosylhydrolasefamily | ATCC BAA-835T | KCTC 15667T | 139 | EB-AMDK-3 | EB-AMDK-4 | EB-AMDK-7 | YL44_sDMDMm2) | YL44 | CBA5201 | EB-AMDK-1 | H2 | EB-AMDK-15 | EB-AMDK-16 | EB-AMDK-17 | EB-AMDK-18 | EB-AMDK-8 | KGMB01988 | EB-AMDK-19 | EB-AMDK-20 | EB-AMDK-21 | EB-AMDK-22 | EB-AMDK-2 | EB-AMDK-10 | EB-AMDK-11 | EB-AMDK-12 | EB-AMDK-13 | EB-AMDK-14 | UniProt ID |
| GH 18 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | ||
| GH 29 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | ||
| GH 33 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | ||
| GH 43_24 | 2 | 2 | 2 | 1 | 2 | 2 | 2 | 2 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | ||
| GH 84 | 1 | 1 | 2 | 0 | 1 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 1 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | ||
| GH 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 1 | 1 | 1 | 1 | 1 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 1 | 2 | 2 | 2 | 2 | B2UQC2 |
| GH 2 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 2 | 1 | 1 | 1 | 1 | 1 | 1 | A0A3G3PEN8 |
| GH 2 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | A0A2N8ILY3 |
| GH 2 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | A0A2N8HGL1 |
| GH 2 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | B2UM41 |
| GH 2 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | A0A1H6L742 |
| GH 3 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | B2UPP0 |
| GH 13_5 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | A0A2N8I8L1 |
| GH 13_8 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | B2UML3 |
| GH 13_38 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | B2UM13 |
| GH 16 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | B2UPN9 |
| GH 16 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | R6IZJ9 |
| GH 16 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | A0A2N8HLJ8 |
| GH 20 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | B2UN02 |
| GH 20 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | B2UP57 |
| GH 20 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | B2UN22 |
| GH 20 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 2 | 1 | 1 | 1 | 1 | B2UPR7 |
| GH 20 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | A0A354E9T1 |
| GH 20 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | B2UNM1 |
| GH 20 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | A0A2N8IJN6 |
| GH 20 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | B2UNC4 |
| GH 20 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | A0A2N8INZ9 |
| GH 27 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | B2URC7 |
| GH 29 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | B2UQE4 |
| GH 29 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | A0A2N8HG39 |
| GH 29 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | R6J8W4 |
| GH 31 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | B2UN73 |
| GH 31 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | B2UQU9 |
| GH 33 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | B2UN42 |
| GH 33 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | B2UPI5 |
| GH 33 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | B2ULI1 |
| GH 35 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | R7E2K5 |
| GH 35 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 2 | 1 | 1 | 1 | 1 | A0A2N8IUG9 |
| GH 36 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | B2UQF3 |
| GH 36 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | B2UMB0 |
| GH 36 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | B2UNY5 |
| GH 57 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | A0A139TUR1 |
| GH 63 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | A0A2N8ILQ4 |
| GH 77 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | B2ULZ7 |
| GH 89 | 1 | 1 | 1 | 2 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 2 | 2 | 2 | 1 | 1 | 2 | 1 | B2URG0 |
| GH 89 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | B2ULB7 |
| GH 95 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | A0A2N8IVR0 |
| GH 95 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | B2UM81 |
| GH 97 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | A0A410ERH5 |
| GH 105 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | B2UQG1 |
| GH 109 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | B2UL75 |
| GH 110 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | B2UNU8 |
| GH 110 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | B2UL12 |
| GH 123 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | B2UQA1 |
Figure 4.Analysis of ancestral genes in evolutionary path. Numbers adjacent to internal nodes indicate the number of estimated ancestral genes (protein coding genes). Right panel indicates the number of CDSs of Akkermansia muciniphila strains.
Figure 5.Minimal gene gain and loss events under the best fit model (GD-FR-ML). Numbers on the branches denote the minimum number of gains and losses in that order.
Gene families gained at the ancestral branch of group A
| Before | After | No. of changes | Annotation |
|---|---|---|---|
| 8 | 13 | 5 | tRNA3(Ser)-specific nuclease WapA |
| 0 | 1 | 1 | Sulfite reductase (NADPH) flavoprotein alpha-component |
| 0 | 1 | 1 | Sulfate adenylyltransferase subunit 1 |
| 0 | 1 | 1 | Sulfate adenylyltransferase subunit 2 |
| 0 | 1 | 1 | Thioredoxin-dependent 5’-adenylylsulfate reductase |
| 0 | 1 | 1 | Modification methylase DpnIIB |
| 0 | 1 | 1 | Type I restriction enzyme EcoKI M protein |
| 0 | 1 | 1 | Type I restriction enzyme EcoR124II R protein |
| 0 | 1 | 1 | 5-Methylcytosine-specific restriction enzyme B |
| 0 | 1 | 1 | Putative glycosyltransferase |
| 0 | 1 | 1 | Putative glycosyltransferase EpsJ |
| 0 | 1 | 1 | Bicarbonate transport system permease protein CmpB |
| 0 | 1 | 1 | Bicarbonate transport ATP-binding protein CmpD |
| 0 | 1 | 1 | Putative aliphatic sulfonates-binding protein |
| 0 | 1 | 1 | O-acetylserine sulfhydrylase |
| 0 | 1 | 1 | Outer membrane protein beta-barrel domain |
| 0 | 1 | 1 | Maltose O-acetyltransferase |
| 0 | 1 | 1 | Teichuronic acid biosynthesis protein TuaB |
| 0 | 1 | 1 | Putative teichuronic acid biosynthesis glycosyltransferase TuaC |
| 0 | 1 | 1 | Tyrosine recombinase XerD |
| 0 | 1 | 1 | Serine/threonine-protein kinase PrkC |
| 0 | 1 | 1 | Regulatory protein SoxS |
| 0 | 1 | 1 | Ornithine cyclodeaminase |
Figure 6.Gene counts of sulfate adenylyltransferase involved in sulfur metabolism in phylogenetic tree.
Figure 7.Graphical circular map of the chromosome of KGMB01988. From outside to the center: Genes on forward strand (color by COG categories), Genes on reverse strand (color by COG categories), GC content (black), GC skew (light green/Orange), and gained gene families on forward strand and negative strand after speciation (color by branching times).
Genes that differ between groups A1, A2 and B
| | |||||||||||||||
| tRNA3(Ser)-specific nuclease WapA | 5 | 6 | 7 | 7 | 7 | 9 | 8 | 8 | 15 | 12 | 13 | 12 | 12 | 11 | 12 |
| Anaerobic sulfatase-maturating enzyme anSME | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| DNA methylase | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| putative deoxyribonuclease RhsB | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 |
| putative type I restriction enzymeP M protein | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| XRE family transcriptional regulator | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| signal peptidase I | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 |
| putative deoxyribonuclease RhsB | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 |
| CRISPR pre-crRNA endoribonuclease Cas5d | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| type I-C CRISPR-associated protein Cas7/Csd2 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| sialidases | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 0 |
| α-L-fucosidase | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| copper oxidase | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 |
| DNA modification methylase | 1 | 1 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 |
| PEP-CTERM sorting domain-containing protein | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| glycosyltransferase | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 |