| Literature DB >> 35778384 |
Sébastien Bridel1, Valérie Bouchez1,2, Bryan Brancotte3, Sofia Hauck4, Nathalie Armatys1,2, Annie Landier1,2, Estelle Mühle5, Sophie Guillot1,2, Julie Toubiana1,2,6, Martin C J Maiden4, Keith A Jolley4, Sylvain Brisse7,8.
Abstract
The genus Bordetella includes bacteria that are found in the environment and/or associated with humans and other animals. A few closely related species, including Bordetella pertussis, are human pathogens that cause diseases such as whooping cough. Here, we present a large database of Bordetella isolates and genomes and develop genotyping systems for the genus and for the B. pertussis clade. To generate the database, we merge previously existing databases from Oxford University and Institut Pasteur, import genomes from public repositories, and add 83 newly sequenced B. bronchiseptica genomes. The public database currently includes 2582 Bordetella isolates and their provenance data, and 2085 genomes ( https://bigsdb.pasteur.fr/bordetella/ ). We use core-genome multilocus sequence typing (cgMLST) to develop genotyping systems for the whole genus and for B. pertussis, as well as specific schemes to define antigenic, virulence and macrolide resistance profiles. Phylogenetic analyses allow us to redefine evolutionary relationships among known Bordetella species, and to propose potential new species. Our database provides an expandable resource for genotyping of environmental and clinical Bordetella isolates, thus facilitating evolutionary and epidemiological research on whooping cough and other Bordetella infections.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35778384 PMCID: PMC9249784 DOI: 10.1038/s41467-022-31517-8
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 17.694
Bordetella genomic species defined in this work.
| Genomic speciesa | Former denomination | No. of isolates | No. of genomes | Reference strain (BIGSdb ID) | Accession number(s) | Genome status | Original description |
|---|---|---|---|---|---|---|---|
| 6 | 2 | AU9795 (266) | ASM226143v1; GCF_002261435.1; SAMN06767132 | 7 contigs | Ref. [ | ||
| 8 | 2 | AU8256 (271) | ASM226134v1; GCF_002261345.1; SAMN06767130 | 7 contigs | Ref. [ | ||
| 3 | 2 | AU14378 (287) | ASM226129v1; GCF_002261295.1; SAMN06767136 | 19 contigs | Ref. [ | ||
| 2 | 2 | AU14646 (290) | ASM226147v1; GCF_002261475.1; SAMN06767137 | 13 contigs | Ref. [ | ||
| 4 | 3 | AU18089 (299) | ASM226126v1; GCF_002261265.1; SAMN06767141 | 8 contigs | Ref. [ | ||
| 1 | 1 | AU19157 (302) | ASM211968v1; GCF_002119685.1; SAMN06767142 | Complete | Ref. [ | ||
| 2 | 2 | AU14267 (304) | ASM211974v1; GCF_002119745.1; SAMN06767135 | Complete | Ref. [ | ||
| 2 | 1 | AU16122 (306) | ASM226122v1; GCF_002261225.1; SAMN06767138 | 5 contigs | Ref. [ | ||
| 1 | 1 | AU8856 (308) | ASM226121v1; GCF_002261215.1; SAMN06767131 | 4 contigs | Ref. [ | ||
| 1 | 1 | AU6712 (309) | ASM226135v1; GCF_002261355.1; SAMN06767128 | 3 contigs | Ref. [ | ||
| 1 | 1 | AU7206 (310) | ASM211966v1; GCF_002119665.1; SAMN06767129 | Complete | Ref. [ | ||
| 1 | 1 | AU21707 (303) | ASM226142v1; GCF_002261425.1; SAMN06767143 | 3 contigs | Ref. [ | ||
| 1 | 1 | H050680373 (2171) | FKIF01; GCA_900078705.1; SAMEA3906486 | 10 contigs | This study | ||
| 1 | 1 | FB-8 (1021) | ARNH01; ASM38218v1; DSM24873; GCA_000382185.1; GCF_000382185.1; PRJNA187096; PRJNA200428; SAMN02440498 | 6 contigs | This study | ||
| 1 | 1 | H567 (2694) | ASM170429v1; GCF_001704295.1; SAMN03940794 | Complete | This study | ||
| 1 | 1 | N (2693) | ASM143339v1; GCF_001433395.1; SAMN04219331 | Complete | This study | ||
| 1 | 1 | BT-1-9.2 (2697) | ASM1774559v1; GCF_017745595.1; SAMN13166181 | 68 contigs | This study | ||
| 1 | 1 | DE0060 (2695) | ASM767996v1; GCF_007679965.1; SAMN11792220 | 24 contigs | This study | ||
| 1 | 1 | BOR01 (2696) | ASM1921888v1; GCF_019218885.1; SAMN19994144 | 29 contigs | This study | ||
| 1 | 1 | BMC_SI_3 (2698) | ASM1735624v1; GCF_017356245.1; SAMN18228559 | 14 contigs | This study | ||
| BbGS lineage IIb | 9 (20) | 2 (13) | AU22978 (297) | ASM211970v1; GCF_002119705.1; SAMN06767144 | Complete | This study | |
| n/ac | 1 | 0 | TRE152202 (402) | n/a | n/a | n/a | |
| n/ac | 1 | 0 | AU30427 (383) | n/a | n/a | n/a |
aGenomic species numbers were inherited from genogroup numbering (see ref. [20]) where possible. Genogroups 3, 14 and 15 belong to the recently defined B. bronchialis, B. sputigena and B. flabilis; therefore, the corresponding numbers were not used for genomic species.
bRepresents a separate B. bronchiseptica clade that may be considered as a genomic species per se.
cNumbers 16 and 17 were not used for genomic species, as genogroups 16 and 17 were defined previously based on single gene sequencing and might corresponds to new genomic species, but no genomic sequence is available yet.
Fig. 1Phylogenetic analysis of the Bordetella genus.
The phylogenetic tree was obtained based on the concatenated multiple sequence alignments of the 1415 core gene sequences from the cgMLST_genus scheme; recombination was accounted for using Gubbins. The tree was rooted on the branch leading to isolate FB-8, which was the earliest branching genome in a phylogenetic analysis that included the external group Ralstonia solanacearum isolate IBSBF1503 (Supplementary Fig. 2). Leaves are labelled with the identifier of the isolate in the BIGSdb database, followed by the isolate name. Genomic species are labelled as G.S. An interactive iTOL version of the tree can be accessed at: https://itol.embl.de/shared/1l7Fw0AvKOoCF. Source data are provided as a Source Data file.
Fig. 2Cladogram of the Bordetella bronchiseptica genomic species.
The analysis was performed with 186 B. bronchiseptica genomes, and representatives of the phylogenetic diversity of B. pertussis (9 genomes) and B. parapertussis (9 genomes). The recombination-purged concatenated multiple sequence alignment of 1415 core gene loci (cgMLST_genus scheme) was used. The tree is rooted on lineage II, which is the most divergent clade. Branch lengths were not used to ease readability of groups and isolates; see Supplementary Fig. 5 for the corresponding phylogram. For each isolate, the host is represented using a leaf symbol, where available (see key in Supplementary Fig. 5). The numbers corresponding to T3SS sequence types are indicated along the external circle around the tree; only the identifiers of main T3STs are indicated. An interactive iTOL version of the tree can be accessed at: https://itol.embl.de/shared/1l7Fw0AvKOoCF. Source data are provided as a Source Data file.
Fig. 3Phylogenetic analysis of Bordetella pertussis.
The phylogenetic tree was obtained based on the recombination-purged concatenated multiple sequence alignments of the 2038 core genome loci of the cgMLST_pertussis scheme. The distribution of macrolide resistance is indicated. The three outer circles represent (from innermost): 23S_rRNA alleles, fhaB alleles, prn alleles and the vaccine antigen sequence types (AgST). Lineages IIa and IIb were defined in Bart et al. 2014 and are labelled in blue. An interactive iTOL version of the tree can be accessed at: https://itol.embl.de/shared/1l7Fw0AvKOoCF. Source data are provided as a Source Data file.