| Literature DB >> 34131152 |
François Bonnardel1,2,3, Stuart M Haslam4,5, Anne Dell4,5, Ten Feizi5,6, Yan Liu5,6, Virginia Tajadura-Ortega5,6, Yukie Akune6, Lynne Sykes5,7,8, Phillip R Bennett5,7,8,9, David A MacIntyre10,11,12, Frédérique Lisacek13,14,15, Anne Imberty16.
Abstract
Bacteria use carbohydrate-binding proteins (CBPs), such as lectins and carbohydrate-binding modules (CBMs), to anchor to specific sugars on host surfaces. CBPs in the gut microbiome are well studied, but their roles in the vagina microbiome and involvement in sexually transmitted infections, cervical cancer and preterm birth are largely unknown. We established a classification system for lectins and designed Hidden Markov Model (HMM) profiles for data mining of bacterial genomes, resulting in identification of >100,000 predicted bacterial lectins available at unilectin.eu/bacteria. Genome screening of 90 isolates from 21 vaginal bacterial species shows that those associated with infection and inflammation produce a larger CBPs repertoire, thus enabling them to potentially bind a wider array of glycans in the vagina. Both the number of predicted bacterial CBPs and their specificities correlated with pathogenicity. This study provides new insights into potential mechanisms of colonisation by commensals and potential pathogens of the reproductive tract that underpin health and disease states.Entities:
Year: 2021 PMID: 34131152 PMCID: PMC8206207 DOI: 10.1038/s41522-021-00220-9
Source DB: PubMed Journal: NPJ Biofilms Microbiomes ISSN: 2055-5008 Impact factor: 7.290
Fig. 1Bacterial lectin folds.
a Distribution of bacterial lectin folds derived from the UniLectin3D database. From the analysis of fold distribution of bacterial lectin crystal structures, the six most frequent folds are detailed. b β-Sandwich/pili and adhesins fold representative: Escherichia coli PapG in complex with GalNAc(β1-3)Gal(α1-4)Gal(β1-4)Glc (PDB code 1J8R); c α/β OB fold: E. coli SLT-1 with Gal(α1-4)Gal(β1-4)Glc (1BOS); d β-Trefoil fold: Clostridium tetani TeNT with GT1b ganglioside NeuAc(α2-3)Gal(β1-3)GalNAc(β1-4)[NeuAc(α2-8)NeuAc(α2-3)]Gal(β1-4)Glc (1FV2); e β-Sandwich/2 calcium lectin fold: Pseudomonas aeruginosa LecB with Gal(β1-4)GlcNAc(β1-3)Gal(β1-4)[Fuc(α1-3)]Glc (1W8F); f β-Propeller fold: Ralstonia solanacearum RSL with Fuc(α1-2)Gal(β1-2)Xyl (2BS6); and g β-Sandwich with galactose-binding domain-like fold: P. aeruginosa LecA/Gal(α1-3)Gal(β1-4)Glc (2VXJ). 3D structures were generated using LiteMol[85] with terminal monosaccharides at binding sites represented using Symbol Nomenclature for Glycans (SNFG)[86].
Fig. 2Distribution of structural folds in predicted bacterial lectins based on UniLectin3D lectin classes.
The distribution of the predicted lectin classes is presented as horizontal boxes and whisker plots coloured on the basis of lectin class origin. The whisker plot represents the minimum, maximum, median, first quartile and third quartile in each class. Values approaching 1 are indicative of high sequence similarity to the reference motif. The predicted lectins in [0.25–0.5] and [0.5–1] score intervals are presented as bar graphs. The total number of predicted lectins in each class is listed in Supplementary Table 1.
Fig. 3Distribution of predicted lectomes classified by fold and class in different vaginal commensal, and confirmed and potentially pathogenic bacterial species.
In the margins, commensal species are indicated by green, and confirmed and potentially pathogenic species by red. Colours within each class of lectin reflect their predicted glycan-binding specificity, indicated as the monosaccharide with most contacts at the binding site of crystal structures available, and are represented using the Symbol Nomenclature of Glycans (SNFG) (https://www.ncbi.nlm.nih.gov/glycans/snfg.html). The lectin classes circled in red are further discussed in the ‘Results’ section to highlight their particular presence in L. iners. Accession numbers are listed in Supplementary Table 4.
Fig. 4Distribution of predicted lectins and CBMs in different vaginal commensal, and confirmed and potentially pathogenic bacterial species arranged by domain composition similarity.
Colours (following the SNFG nomenclature) within each class of lectins reflect its main sugar-binding specificities referred to in Fig. 3. The domains highlighted are further discussed in the results due to their presence in L. iners. The additional positive correlation of the number of CBMs distinguishes between commensal, and confirmed and potentially pathogenic bacteria. Accession numbers are available in Supplementary Table 4.
Fig. 5Hierarchical radial tree of predicted classes of carbohydrate-binding proteins in vaginal bacteria.
(a) Predicted lectin classes only or (b) lectin classes and predicted carbohydrate-binding modules in vaginal commensal (green), and confirmed and potentially pathogenic (red) bacteria. The ubiquitous LysM and CBM50 are excluded from the dataset to generate the hierarchical radial tree. Although the majority of Lactobacillus species clustered closely to each other, indicating similar putative lectomes, the lectome of L. iners isolates more closely resembled that of confirmed and potential pathogens.