Literature DB >> 34989956

Proteomic Characterization and Target Identification Against Streptococcus mutans Under Bacitracin Stress Conditions Using LC-MS and Subtractive Proteomics.

Sahar Zaidi1, Tulika Bhardwaj2, Pallavi Somvanshi2,3, Asad U Khan4.   

Abstract

The aim of the present study, is to identify potential targets against the highly pathogenic bacteria Streptococcus mutans that causes dental caries as well as the deadly infection of endocarditis. The powerful and highly sensitive technique of liquid chromatography-mass spectrometry (LC-MS/MS) identified 321 proteins of S. mutans when grown under stressful conditions induced by the antibiotic bacitracin. These 321 proteins were subjected to the insilico method of subtractive proteomics to screen out potential targets by utilizing different analyses like CD-HIT, non-homologous sequence screening, KEGG pathway, essentiality screening, gut-flora non-homology, and codon usage analysis. A database of essential proteins was employed to find sequence homology of non-paralogous proteins to determine proteins which are essential for bacterial survival. Cellular localization analysis of the selected proteins was done to localize them inside the cell along with physico-chemical characterization and druggability analysis. Using computational tools, 22 proteins out of 321, that are functionally distinguishable from their human counterparts and passed the criterion of a potential therapeutic candidate were identified. The selected proteins comprise central energy metabolic proteins, virulence factors, proteins of the sortase family, and essentiality factors. The presented analyses identified proteins of the sortase family, which appear as key therapeutic targets against caries infection. These proteins regulate a number of virulence factors, thus can be simultaneously inhibited to obstruct multiple virulence pathways.
© 2022. The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.

Entities:  

Keywords:  Bacitracin; Essentiality analysis; LC–MS; S. mutans; Subtractive proteomics

Mesh:

Substances:

Year:  2022        PMID: 34989956      PMCID: PMC8733428          DOI: 10.1007/s10930-021-10038-1

Source DB:  PubMed          Journal:  Protein J        ISSN: 1572-3887            Impact factor:   4.000


Introduction

Streptococcus mutans was first isolated by J. Clarke in 1924, from carious lesions. Nevertheless, it was around 1950s, when this bacterium grabbed attention of a number of scientific communities as well as clinical and animal-based laboratories, which revealed S. mutans as a significant etiologic agent in a number of dental pathologies. S. mutans generally resides inside the human mouth or more precisely the dental plaque, which is a biofilm constructed on tooth’s hard surface by diverse range of bacteria. Besides, this bacterium is also implicated in bacteremia and infective endocarditis, a deadly heart valves inflammation [1, 2]. Unique feature that makes S. mutans one of the most potent cariogenic organisms include the capability of S. mutans to metabolize a broad range of carbohydrates into organic acids (acidogenicity) and also it can flourish even under acidic conditions (aciduricity) [3]. While genetic as well as biochemical approaches have been employed for the past few decades to explore the biology of S. mutans, the whole sequence of genome of S. mutans strain UA159 was only published in 2001, which transformed the landscape dramatically. Currently, S. mutans is among the finest characterized Gram-positive bacteria. The genome of S. mutans that was first sequenced (serotype c UA159 strain) contained ~ 2.0 Mb of DNA, encoding around 2000 genes [4, 5]. However, the transcriptome and proteome of S. mutans has got less attention, though they are known to be significant to understand pathogenesis of any bacteria. Proteome is regarded as cell’s functional moiety that fluctuates in response to the stimuli imposed externally such as drug stress. Under stressful conditions, harmony inside the bacterial cell is disturbed and alternate cellular functions are adopted by bacteria simultaneously that proteome mediates in order to bypass the effect. Consequently, proteome obtained when bacteria is grown under stressful conditions, could unravel some novel pathways that bacteria adopt to avoid perturbations around. Besides, proteomics along with bioinformatics approach could further assist in resolving such biological problems [6, 7]. The current study intends to understand the pathogenesis of the cariogenic bacteria S. mutans. Literature mining was carried out to construct a novel and robust pipeline amalgamating the proteomics analysis employing LC–MS and subtractive proteomics approach to identify potential targets against caries infection. Protein samples of S. mutans grown under bacitracin stress condition and without stress, were subjected to LC–MS. Antibiotic bacitracin was selected, since the drug impacts the functioning of the cell by opting multiple routes and disturbs the total cell’s physiology. And thereby it elevates the chances of obtaining differential expression of maximum number of virulence proteins [8]. Bacitracin obstructs the process of bacterial cell wall biosynthesis as it hinders the critical step of dephosphorylation of C55-isoprenyl pyrophosphate (IPP) and certain other cellular processes like action of a number of hydrolytic enzymes, ubiquinone precursors formation, synthesis of membrane derived oligosaccharides as well as role of membranes in cell division process [9]. Proteins identified by LC–MS then underwent subtractive proteomics approach to identify non-homologs against human proteome, followed by essentiality as well as KEGG pathway analysis. Eventually proteins were screened for non-homologs against gut microflora proteome. Once shortlisted, the qualitative characterization of proteins was done. The putative target proteins that have been shortlisted can act as promising candidates to construct therapeutic to control S. mutans virulence. The aim of this study is to provide resourceful data to researchers, who are exploring targets or working on drug discovery, so as to decode biological queries in a computational tractable method by exploring, filtering and weighting the huge size of proteome -scale data sets.

Materials and Methods

Bacteria, Media, Culturing Conditions and Drug Susceptibility Testing

Institute of Microbial Technology, Chandigarh, India provided the Streptococcus mutans MTCC 497 strain employed in this study. Brain heart infusion broth (BHI) with 1% sucrose (Himedia Labs, Mumbai, India) was used to grow bacteria. Antibiotic bacitracin (Sigma Chemical Company) was utilized at sub-MIC concentrations. Sub-MIC concentration of bacitracin was determined by diluting the stock of 10 mg/ml of antibiotic serially in 96 well microtiter plate by means of microdilution method [10].

Culture and Drug Induction

S. mutans was grown in Brain heart fusion broth (BHI) for overnight at 37 °C and 220 rpm. Secondary culture was grown in two flasks using overnight culture (primary culture), and bacitracin at half of the MIC (39 μg/ml) value was added to one of the flasks labelled as treated while no drug was added to control. Cells in both flasks were grown up to mid log phase (0D600 = 0.5–0.6) and were harvested by the technique of centrifugation operated at 10,000 rpm for 10 min at 4 °C. The precipitated cell pellet obtained was subsequently kept at −80 °C till needed. All the experiments that have been performed in this study were replicated thrice biologically.

Protein Extraction

Washing of bacterial cells was done with normal saline, and then lysis buffer was used to re-suspend the washed cells. Composition of lysis buffer used was 50 mM Tris–HCl containing 0.1% sodium azide, 10 mM MgCl2, 1 mM ethylene glycol tetra acetic acid (EGTA; pH 7.4) and 1 mM phenylmethylsulfonylfluoride (PMSF) at the concentration of 5 ml per 1 g wet weight. Cells were broken down by sonicating, intermittently, at 40% amplitude (Sonics & Materials Inc., Newtown, CT, USA). Homogenate obtained was then subjected to centrifugation operated at the speed of 12,000×g, for the duration of 20 min, at 4 °C. Precipitation of supernatant was done by mixing cold acetone to supernatant in ratio1: 4 and storing at − 20 °C for overnight. After incubation proteins present in the supernatant were precipitated and obtained by centrifuging at 12,000 rpm, for 30 min approximately and were allowed to dry in the air. Finally, in the appropriate volume of dissolving buffer, proteins were solubilized and concentration of protein was calculated with the help of Bradford assay [11].

Separation as Well as Identification of Proteome bynanoLC-TripleTOF5600MS

Trypsin was used to digest protein samples taken in equal amounts that were further studied employing Triple TOF 5600 mass-spectrometer (AB-Sciex, USA) which is provided with Eskigent MicroLC 200 system (Eskigent, Dublin, CA) as well as possesses an Eskigent C18reverse phase column (150 × 0.3 mm, 3 μm, 120 Å). For each sample, data dependent analysis (DDA) was carried out with the purpose of generating spectral library that can be used for protein identification experiment, as well as to produce spectral ion libraries of superior quality to be used for SWATH analysis, with specific parameters to operate the mass spectrometer. Information dependent acquisition (IDA) mode was employed to generate spectral library after inserting tryptic digest of 2 g on column by means of Eksigentnano LC-Ultra™2D plus system that possesses SCIEX Triple TOF ®5600 system equipped with nano sprayIII source. Sample afterwards were loaded on trap (Eksigent Chrom XP 350 μm × 0.5 mm, 3 μm 120 Å) and washing was done for 30 min at the rate of 3μl/min. A gradient in multiple steps (range varying between 5 and 50% acetonitrile in water, with 0.1% formic acid) of 120 min was set up so that peptides from Chrom XP3-C18,0.075 × 150 mm, 3 μm, 120 Å analytical column can be eluted [11].

Information Dependent Acquisition (IDA) Parameters

IDA is the ion library generation procedure was employed in the study. In this up to 20 most strong and multiple charged ions per MS cycle were chosen to achieve MS/MS fragmentation. Each of the ions, were subjected to a dynamic exclusion criterion for 10 s. The accumulation time was adjusted to 70 ms for each MS/MS experiment.

SWATH Parameters for Label Free Quantification

In SWATH acquisition method, adjustment of Q1transmission window to 12 Da from the mass range for 350–1250 Da. With an accumulation duration of 62 ms, sum of 75 windows was attained independently, with each set having three technical replicates. Total cycle time i.e. < 5 s was kept constant. Spectral library was generated by employing Protein Pilot™ v.5.0. Spectral alignment and the peak extraction were carried out for label free quantification with the help of Peak View® 2.2 software with parameters that have been defined example, transitions number as 5, 95% as peptide confidence, peptide number as 2, 30 ppm as XIC width, 3 min as XIC extraction window. Data was then, processed by Marker View software V1.3 (AB Sciex) in order to obtain interpretation of statistical data. In Marker View, normalization of peak area underneath the curve for the peptides that were selected was done with internal standard protein (beta galactosidase) spike throughout the SWATH accumulation. Results displayed as AUC of the ions; ions total intensity for the peptide as well as peptide total intensity for protein in the form of three output files. SWATH Acquisition Micro App 2.0inPeakView® Software. 2.7 was used to process all SWATH Acquisition data.

Data Analysis

Protein Pilot Software v. 5.0 (AB SCIEX, Foster City, CA) was employed to process Data analysis by using Paragon and Pro group Algorithm. Analysis was also completed by means of integrated tools in Protein Pilot at the 1% false discovery rate (FDR).

Subtractive Proteome Screening

CD-Hit Analysis

Identified proteome dataset was subjected to CD-HIT analysis to remove duplicity in dataset based on clustering algorithm. An identity cut-off of 0.6 (60% identity) was selected to screen proteomes as proteins sharing 60% identity with relatively similar structure and functional domains. A bandwidth of 20 amino acids was selected to perform sequence based global alignment [12].

Non-homologous Sequence Analysis

Pathogen specific screening of drug targets was supported by performing non-homology search against human proteome. This prevents the cross reactivity and binding of therapeutic compounds with host proteome. Therefore, non-redundant protein sequence (nr) database of host Homo sapiens (taxID: 9606) was subjected to similarity search analysis against bacterial proteome dataset (321) employing BLASTp at threshold cut-off e-value of 0.5 with identity of 60% [13, 14].

Essentiality Analysis

Non-homologous dataset underwent similarity screening using Database of Essential Genes (DEG) version 15.2. The complete screening was performed using BLASTp against experimentally validated 53,885 essential proteins along with 786 essential non-coding sequences of pathogens. Proteins hit with expectation value ≤ 10–100, identity ≥ 25% and minimum bit score100 as parameters were considered as essential proteins of the pathogen [12-14].

KEGG Pathway Analysis

Kyoto Encyclopedia of Genes and Genomes (KEGG) organism code was utilized for the retrieval of metabolic pathways of disease-causing bacteria and host Homo sapiens. KAAS server at KEGG database was utilized to identify essential proteins participating in pathways specific for pathogen. KAAS offers genes functional annotation by means of BLAST comparison compared with the manually curated database of KEGG GENES. KO (KEGG Orthology) assignments define metabolic proteins. KAAS also produces KEGG pathways which display these metabolic proteins [12].

Non-homology Analysis Against Gut-Flora

This analysis was performed to filter out essential proteome content sharing structural similarity with gut microbe proteome as it prevents toxicity caused by gut microbe proteome interaction with drugs. In-house temporary database containing proteome dataset of gut microbes was developed using MySQL and PHP (Hypertext Preprocessor). Pathogen specific essential protein dataset then underwent non-homology sequence similarity screening against gut microbe proteome using BLASTp at bit score of 10 and threshold e-value of 10–10 and identity of 0.6 (60% similarity) [13, 14].

Codon Usage Analysis

Codon adaptation Index (CAI) defines the geometric mean of relative fitness values for each synonymous codon which ranges between 0 and 1. Higher the expression values of essential genes higher is the suitability for consideration as drug target [15].

Qualitative Characterization

Sub Cellular Localization Analysis

This analysis categorizes essential proteins significance of serving as either drug or vaccine target. CELLO 2.0 [16], a two-level support vector machine tool comprising 1444 and 7589 bacterial and eukaryotic proteins, respectively, were taken as training dataset was utilized to identify the protein subcellular localization of prioritized proteins after gut-flora non-homology analysis [13, 14, 16].

Physico-Chemical Characterization

ProtParam tool (https://web.expasy.org/protparam/) at Expasy webserver enables physico-chemical characterization of essential proteome datasets. It is used to determine isoelectric point, molecular weight, nature of charge (positive or negative), GRAVY, extinction coefficient of prioritized proteomes, aliphatic as well as instability index [13].

Druggability Analysis

DrugBank 3.0 [17], including therapeutics that have been validated experimentally and approved by FDA along with target details was employed and probable (E-value) cut off of 10−5 was included to characterize potential targets capable of binding to a number of therapeutics with high affinity. Additionally, Therapeutic Targets Database (TTD) was investigated and studied to explore targets for nucleic acid, infections or diseases that can be targeted, pathway details as well as the equivalent drugs molecules aiming specifically the targets possessing significant E-value i.e. < 1[12, 16].

Results

Protein Identification Using LC–MS/MS

S. mutans was cultured under bacitracin drug stress (half-MIC i.e., 39 μg/ml) [18]. The proteome of S. mutans was analyzed by LC–MS/MS, by means of SWATH workflow. Proteins were enumerated at 1% FDR. 321 proteins were identified and the scale of differential expression was characterized using log folds change vs. p-value ratio, tabulated in Supplementary file 1 (S1). These 321 proteins underwent subtractive proteome scrutiny to further cut down the list.

Non-homology Analysis

Proteins with human homologs were eliminated and only non-human homologs were screened out employing comparative non-homology proteome analysis. This analysis identified approximately 67% evolutionary relatedness among pathogen and host [15]. Among 321 pathogen proteome dataset, 151 protein sequences were identified non-homologous (S2; Sheet 1) to host Homo sapiens after BLASTp at NCBI server with 0.005 (E = 0.005) as E-value cutoff and also a match of greater than 50% of query length. The core proteome dataset constituting essential proteins, which are necessary for bacterial existence as well as persistence. Thus, essential proteins are regarded as potential targets for identification of novel and innovative chemical scaffolds for the purpose of drug development. Several studies have demonstrated the potential of essential proteins towards synthetic biology [19]. Functional codes were utilized to define such system of classification in accordance with COG (Cluster of Orthologous Groups of proteins) classification [13, 15]. The collection of data containing pathogen sequences that are non-homologous to humans were then employed to predict their essentiality. It utilized BLASTp, with the dataset of essential proteins of bacteria employing DEG version 15.2 with an E-value cutoff of 0.0001. In the list of essential proteins (S2 Sheet 2), that were identified, the greatest value of significant similarities with S. sanguinis (35 hits), Francisella novicida U122 (16 hits), M. tuberculosis H37Rv (8 hits), S. pneumoniae R6 (5 hits), Helicobacter pylori 26,695 (2 hits) along with Mycoplasma pulmonis UAB CTIP (1 hit). Pathways regulating metabolism of the host Homo sapiens as well as the pathogen were retrieved from the KEGG server. These pathways were grouped into different categories, namely: metabolism, genetic information processing, and environmental information processing. Manual comparative analysis among host and pathogen-related pathways indicates the presence of 21 pathogen-related unique pathways. The KAAS webserver helps in the identification of participation of the essential proteins identified (67) towards major metabolic pathways of the pathogen in terms of percentage. After obtaining the score and related metabolic pathways from KAAS for each essential protein, a percentage calculation of participation was performed using in-house pipeline and cross-validated manually [15] (S2 Sheet 3). The majority of essential proteins participate in biosynthesis of antibiotics (19%) [20, 21], biosynthesis of secondary metabolites (12%) [22], lysine metabolism(11%) [23], monobactam biosynthesis (9%) [24], peptidoglycan biosynthesis and degradation proteins (8%) [25, 26], methane metabolism (6%) [27], starch and sucrose metabolism (5%) [28], two component system (TCS) (6%) [29, 30], pantothenate and CoA biosynthesis (5%) [31, 32], 2-Oxocarboxylic acid metabolism (5%) [33], phenylalanine, tyrosine and tryptophan biosynthesis (2%) [34, 35], vancomycin resistance (2%) [36], cationic antimicrobial peptide (CAMP) resistance (4%) [37], Jacobs terpenoid backbone biosynthesis (3%) [38], metabolism of bacteria in diverse environments (1%) [39]and nicotinate and nicotinamide metabolism (1%) [40] as shown in Fig. 1.
Fig. 1

KEGG Pathway analysis identifies 21 pathways unique to bacteria only. KAAS webserver encodes majority of the selected proteins involved in the biosynthesis of antibiotics (19%) followed by secondary metabolites biosynthesis (12%) and lysine metabolism (11%)

KEGG Pathway analysis identifies 21 pathways unique to bacteria only. KAAS webserver encodes majority of the selected proteins involved in the biosynthesis of antibiotics (19%) followed by secondary metabolites biosynthesis (12%) and lysine metabolism (11%) Vitamin synthesis in human gut is assisted by microflora of the gut [41, 42] along with other processes that gut microflora performs like degradation of xenobiotics [43] and assimilation of incompletely digested components particularly dietary compounds [44]. Furthermore, they avert pathogenic as well as opportunistic bacteria present in the gut from establishing a foothold, hence are crucial for human wellbeing. For the development of a temporary database, FASTA sequences of the Human Microbiome Project database Reference proteome were mined [45]. The essential proteome dataset was subjected to sequence-based similarity screening at default parameters. Also, non-homology screening against gut-flora prioritizes sequences showing fewer than the identity of 30% and identified 22 proteins as potential drug targets (S2 Sheet 4), The computation of CAI values assists in scrutinizing non-homologous essential proteins, expressing their translational efficiency ranging from 0 to 1. The prioritized protein sequences support major criteria of essentiality and selectivity of proteins to serve as targets (Table 1). The higher the translational efficiency of the protein, the higher the potential to serve as a drug target [46]. Therefore, we further subjected the dataset to qualitative characterization, including sub-cellular localization, physico-chemical characterization, and druggability analysis. (An illustration of all proteome subtractive methods is given in Table 2.
Table 1

List of 22 potential target, their function, location and codon adaptation index

S.No.Potential targetsFunctionLocationCodon adaptation index
1Glucose-6-phosphate isomerase

Glycolysis

Biosynthesis of antibiotics

Biosynthesis of secondary metabolites

Cytoplasmic0.76
24-Hydroxy-tetrahydrodipicolinate reductase

Lysine biosynthesis

Monobactam biosynthesis

Cytoplasmic0.80
3Glycogen synthaseBiosynthesis of secondary metabolitesCytoplasmic0.71
4Glucan-binding protein GbpCBacterial adherence and extracellular polymeric substance (EPS) synthesisExtracellular0.75
5Cell surface antigen SpaPAdherenceExtracellular0.91
6Thiol peroxidaseH2O2 Receptor and Redox-TransducerCytoplasmic0.87
7Levansucrase

Two-component system

Starch and sucrose metabolism

Extracellular0.89
8Phospho-2-dehydro-3-deoxyheptonate aldolase

Phenylalanine, tyrosine and tryptophan biosynthesis

Biosynthesis of secondary metabolites

Cytoplasmic0.79
9ATP synthase F0F1 subunit alphaOxidative phosphorylationCytoplasmic0.92
10ATP synthase F0F1 subunit betaOxidative phosphorylationCytoplasmic0.87
11Glucosyltransferase-SI (GtfC)Two-component systemExtracellular0.88
12Glucosyltransferase-I (GtfB)Two-component systemExtracellular0.89
13secreted antigen GbpB/SagAPeptidoglycan biosynthesis and degradation proteinsExtracellular0.92
14Formate acetyltransferase/ Pyruvate formate–lyasePyruvate and butanoate metabolismCytoplasmic0.93
15Fructose-1,6-biphosphate aldolaseGlycolysisCytoplasmic0.80
16Endolytic murein transglycosylaseTerminates nascent peptidoglycan synthesisExtracellular0.88
17L-lactate dehydrogenase

Glycolysis / Gluconeogenesis

Cysteine and methionine metabolism

Cytoplasmic0.82
18Ketol-acid reductoisomerase

Pantothenate and CoA biosynthesis

2-Oxocarboxylic acid metabolism

Cytoplasmic0.92
19Aspartate-semialdehyde dehydrogenase

Lysine biosynthesis

Monobactam biosynthesis

Cytoplasmic0.97
20Enolase

Glycolysis

Methane metabolism

Cytoplasmic0.85
21UDP-N-acetylglucosamine 1-carboxyvinyltransferase 1Peptidoglycan biosynthesisCytoplasmic0.95
22ATP-dependent protease ClpEGenetic information processingCytoplasmic0.79
Table 2

Details of the proteome subtractive and qualitative characterization methods

Streptococcus mutans UA159
ModuleProteome subtraction methodsSoftwaresTotal datasetSelectedExcluded
1Total proteome contentCD-Hit32129922
Non-homology analysisBLASTp299151148
2Essentiality analysisDEG1516784
Pathway analysisKEGG674522
Gut-flora non-homology analysisBLASTp452223
Codon usage analysisCAI22ALL
3Subcellular localization analysisCELLO2222
Physicochemical characterizationProtParam2222
Druggability analysisDrugBank2222
List of 22 potential target, their function, location and codon adaptation index Glycolysis Biosynthesis of antibiotics Biosynthesis of secondary metabolites Lysine biosynthesis Monobactam biosynthesis Two-component system Starch and sucrose metabolism Phenylalanine, tyrosine and tryptophan biosynthesis Biosynthesis of secondary metabolites Glycolysis / Gluconeogenesis Cysteine and methionine metabolism Pantothenate and CoA biosynthesis 2-Oxocarboxylic acid metabolism Lysine biosynthesis Monobactam biosynthesis Glycolysis Methane metabolism Details of the proteome subtractive and qualitative characterization methods CELLO version 2 assists in the selective distribution of prioritized drug targets within a bacterial cell. The sorting of protein sequences was mainly in the cell membrane, the extracellular matrix, and cytoplasmic domains (Table 1). The physico-chemical properties of essential protein sequences were computed using ProtParam based on amino acid residue composition. Instability index, aliphatic index, and extinction coefficient predict the in-vivo lifespan of protein sequences which are considered as targets. Such analysis supports quality control and assurance procedure during drug development [47, 48]. Instability index > 40 predicts stability of proteins in-vivo experiments. The space that aliphatic side chains occupy represents aliphatic index while GRAVY is an indicator of sum of hydropathy values all amino acids divided to the total number of residues of amino acids in a protein. These parameters of selected drug targets are listed in Table 3
Table 3

List of thephysico-chemical properties of selected potential targets

Potential drug targetsNumber of Amino acidsMolecular weightIsoelectric Pointnegatively charged residuespositively charged residuesaliphatic indexInstability indexExtinction coefficient (M−1 cm−1)GRAVYNature
Glucose-6-phosphate isomerase44949,421.664.84634588.6927.2654,780− 0.232Stable
4-Hydroxy-tetrahydrodipicolinate reductase25527,804.825.07392769.5120.48239,060− 0.063Stable
Glycogen synthase47654,311.805.69453687.0132.117572− 0.287Stable
Glucan-binding protein GbpC58363,349.906.752020104.9325.6745760.079Stable
Cell surface antigen SpaP1562169,971.935.33531104.5223.7216,008− 0.223Stable
Thiol peroxidase16117,552.715.083631122.8327.0324,535− 0.061Stable
Levansucrase79587,384.685.8272293.4421.9213,076− 0.114Stable
Phospho-2-dehydro-3-deoxyheptonate aldolase34338,718.655.53434277.3339.8422,256− 0.455Stable
ATP synthase F0F1 subunit alpha50154,372.845.422815745047.8719,878− 0.094Unstable
ATP synthase F0F1 subunit beta68975,218.114.884530101.0240.222,045− 0.473Unstable
Glucosyltransferase-SI1455162,966.245.284136100.133.514,900− 0.151Stable
Glucosyltransferase-I1476165,846.815.096853103.6830.3225,245− 0.16Stable
secreted antigen GbpB/SagA43144,620.429.46666494.6527.1817,880− 0.219Stable
Formate acetyltransferase77587,605.805.27493892.721.6226,860− 0.321Stable
Fructose-1,6-biphosphate aldolase29331,425.805.46322699.9631.7813,5350.007Stable
Endolytic murein transglycosylase61468,543.366.613736102.5920.1916,180− 0.226Stable
l-lactate dehydrogenase32835,245.095.85333196.2825.9714,440− 0.089Stable
Ketol-acid reducto isomerase34037,285.335.53685692.7927.137,650− 0.277Stable
Aspartate-semialdehyde dehydrogenase35838,903.316.21454189.5429.8523,452− 0.365Stable
Enolase43246,857.619.32569287.6238.947,655− 0.555Stable
UDP-N-acetylglucosamine 1-carboxyvinyl-transferase 142345,616.654.95453392.4634.5511,564− 0.51Stable
ATP-dependent protease ClpE75383,733.905.593155100.3422.8717,886− 0.195Stable
List of thephysico-chemical properties of selected potential targets Similarity search analysis against the therapeutics approved by the FDA, nutraceutical as well as experimental small-molecule compounds using BLASTp rendered 22 potential drug targets. Shortlisted and selected targets can be exploited for drug designing as they are harmless to the host (Table 4).
Table 4

Listing of FDA approved drugs (Drug names, DrugBank ID) for the 22 screened potential drug targets

S.No.Drug targetDrug nameDrugBank ID
1Glucose-6-phosphate isomerase

Vitafol-one

Artenimol

DB09130

DB11638

24-Hydroxy-tetrahydrodipicolinate reductaseDipicolinic acidDB04267
3Glycogen synthase

Valproate

Tideglusib

DB00313

DB12129

4Glucan-binding protein GbpC

Eraxis

Vfend

DB00362

DB00582

5cell surface antigen SpaPGleevecDB00619
6Thiol peroxidaseS-oxy-l-cysteineDB03382
7LevansucraseSucroseDB02772
8Phospho-2-dehydro-3-deoxyheptonate aldolase

Phosphoenolpyruvate

D-erythrose 4-phosphate

DB01819

DB03937

9ATP synthase F0F1 subunit alpha

Aurovertin B

Piceatannol

Quercetin

Artenimol

DB07394

DB08399

DB04216

DB11638

10ATP synthase F0F1 subunit beta

Aurovertin B

Piceatannol

Quercetin

Artenimol

DB07394

DB08399

DB04216

DB11638

11Glucosyltransferase-SIBezlotoxumabDB13140
12Glucosyltransferase-IBezlotoxumabDB13140
13Secreted antigen GbpB/SagA

Inebilizumab

Blinatumomab

Tisagenlecleucel

DB12530

DB09052

DB13881

14Formate acetyltransferase

OxamicAcid

D-Treitol

DB03940

DB03278

15Fructose-1,6-biphosphate aldolaseArtenimolDB11638
16Endolytic murein transglycosylaseBicineDB03709
17l-lactate dehydrogenase

Oxamic Acid

Artenimol

DB03940

DB11638

18Ketol-acid reductoisomeraseCocarboxylaseDB01987
19Aspartate-semialdehyde dehydrogenase

Nicotinamide adenine dinucleotide phosphate

Aspartate Semialdehyde

DB03461

DB04498

20EnolaseSodium fluorideDB09325
21UDP-N-acetylglucosamine 1-carboxyvinyl-transferase 1

Aminomethylcyclohexane

8-Anilinonaphthalene-1-sulfonic acid

DB02435

DB04474

22ATP-dependent protease ClpEBismuth subcitrate potassiumDB09275
Listing of FDA approved drugs (Drug names, DrugBank ID) for the 22 screened potential drug targets Vitafol-one Artenimol DB09130 DB11638 Valproate Tideglusib DB00313 DB12129 Eraxis Vfend DB00362 DB00582 Phosphoenolpyruvate D-erythrose 4-phosphate DB01819 DB03937 Aurovertin B Piceatannol Quercetin Artenimol DB07394 DB08399 DB04216 DB11638 Aurovertin B Piceatannol Quercetin Artenimol DB07394 DB08399 DB04216 DB11638 Inebilizumab Blinatumomab Tisagenlecleucel DB12530 DB09052 DB13881 OxamicAcid D-Treitol DB03940 DB03278 Oxamic Acid Artenimol DB03940 DB11638 Nicotinamide adenine dinucleotide phosphate Aspartate Semialdehyde DB03461 DB04498 Aminomethylcyclohexane 8-Anilinonaphthalene-1-sulfonic acid DB02435 DB04474

Discussion

Highly pathogenic Gram-positive bacteria Streptococcus mutans, is the chief causal organism in the pathogenesis of dental caries in humans as well as it may also be the source of fetal infection of endocarditis [49]. The objective of the present study was to find out new and more effective drug targets against this cariogenic bacteria S. mutans, so as to propose the alternative potential therapeutics to combat oral infections. In this study, we employed LC–MS based quantitative proteomics, which is a powerful analytical technique, that is being increasingly utilized for a varied range of biological applications owing to its increasing capabilities for broad proteome coverage, high sensitivity, specificity as well as precision in quantification [50]. LC–MS along with subtractive proteomics approach which has been documented to be a powerful method to identify unique yet uncharacterized sequences as possible therapeutic targets. Besides, the in silico subtractive proteomics is the most efficient, time saving and economical technique, on condition that both proteome of pathogen and host are accessible. This approach involves numerous analyses at multiple stages that generally employ BLAST [15, 51]. The S. mutans grown in the presence of bacitracin. Bacitracin is an antibiotic that blocks the biogenesis of cell-wall polysaccharides such as peptidoglycan. The process is facilitated through an efficient carrier for example undecaprenylphosphate (C55–P). Sugar linking to C55–P occurs at the membrane’s inner face then its translocation takes place towards periplasm, where glycan moiety is transferred to the growing polymer, thereby it anchors peptidoglycan units in the membrane. C55P is derived when the precursor undecaprenyl pyrophosphate (C55PP), is dephosphorylated. This dephosphorylation step is obstructed by bacitracin by sequestering C55PP, thereby reducing the pool of C55P available to transport glycan units to the cell wall thereby disturbing cell integrity and causing lysis [18]. When the whole proteome extract of bacteria grown under bacitracin stress subjected to LC–MS, a panel of 321 proteins was identified. At the outset, the duplicity in data set of 321 proteins was removed. For this, the protein panel was subjected to CD-Hit, which clusters those proteins that cross the similarity threshold. In our data, proteins that were found to have 60% sequence or functional similarity with other proteins of the data set were eliminated. The remaining proteins were then subjected to non-homologous analysis, which involves pathogen-specific screening of drug targets against human proteome [52]. In this way cross reactivity as well as binding of therapeutic compounds with the host proteome can be averted. The Non-homology analysis is particularly the foremost step in any in silico drug target identification method to avoid any off-target side-effects. Subsequently, those proteins that are necessary for bacterial survival as well as persistence were identified through essentiality analysis by employing Database of Essential Genes (DEG). These essential proteins are regarded as potential targets for identification of novel and innovative chemical scaffolds for the purpose of drug development. These proteins comprise sequences grouped according to their biological functions, that is, (a) information storage and processing (DNA and RNA metabolism), (b) protein processing, folding and secretion, (c) cellular processes (division of cell and transport), (d) energetic and intermediary metabolism (glycolysis, pentose phosphate pathway, lipid metabolism, cofactors as well as nucleotides biosynthesis, production of proton-motive force), (e) and poorly characterized [12, 19]. The results of DEG were subjected to KEGG Automated Annotation Server (KAAS), an online server, to investigate the involvement of proteins in different essential metabolic pathways in KEGG. Metabolic pathways shared among the pathogen and host are categorized as common pathways while unique pathways are those which are present in pathogen only. Human metabolic pathways are not included in the bacterial metabolism [53]. This analysis was performed on non-homologous essential genes of S. mutans. At first, metabolic pathway analysis is performed on the non-homologous essential proteins by KAAS to identify the metabolic pathway of the target. Further in this study, comparative pathway method is performed among human metabolic pathways and pathogen pathway to identify “unique pathways”. Non-homologous essential proteins among following unique metabolic pathways can be mapped, and these proteins can be key targets for the treatment of diseases and avoiding all possibilities of side effects [12, 52]. Out of the total sequences of proteins selected by the KAAS server, maximum sequences were found to be involved in the biosynthesis of antibiotics, followed by biosynthesis of secondary metabolites, lysine metabolism, monobactam and peptidoglycan biosynthesis etc. as shown in Fig. 1. In order to screen out essential proteins having structural similarity with the proteome of Gut microbiota and subsequently to prevent off-target effects, non-homology analysis against gut microflora was performed (Table 2). Suitability of essential proteins to be considered as potential targets depends on two criteria: (a) essentiality and (b) selectivity. Essential proteins constitute the foundation of organism whereas selectivity depends on expression levels of genes. The efficiency of translating mRNA to protein depends partially on the coding strategy of an mRNA and is reflected in codon usage bias which is often measured by the codon adaptation index. CAI gives the geometric mean of relative fitness values for each synonymous codon which lies between 0 and 1 [15, 54]. CAI of the proteins shortlisted was performed to find out their suitability (Fig. 2).
Fig. 2

Protein count for S. mutans MTCC 497 strain employing subtractive proteome analysis

Protein count for S. mutans MTCC 497 strain employing subtractive proteome analysis 22 proteins obtained after series of subtractive proteomics methods were considered as therapeutic targets against caries infection. These proteins were further subjected to qualitative characterization to localize them within the cell and to know their physio-chemical properties (Table 3). A significant aspect of possible drug target is its localization. Protein targets should be at appropriate cell’s compartment so they can optimally perform their functions. Besides, drug needs to bind with the target to act, therefore the protein targets whereabouts in cell should be known to design appropriate drug compound. Subcellular localization of non-homologous protein sequences was performed by CELLO 2.0 [55, 56]. The sorting of protein sequences was mainly found in cell membrane, extracellular matrix and cytoplasmic domains. Physicochemical characterization which seeks to define physical as well as chemical properties based on amino acid residue composition was computed using ProtParam [57]. Druggability analysis of 22 shorlisted proteins was also performed to predict if the protein can bind to drug-like molecules (Table 4). Identified 22 therapeutic candidates have been illustrated in file S3. (Function as well as the physiochemical properties of 22 shortlisted proteins is listed in Table 1 and 3). Out of total 22 possible targets identified by this study, sortase family of proteins, which represent highly conserved transpeptidase family have been documented to be critical for bacterial virulence. In S. mutans sortases help in anchoring a number of virulence surface proteins such as FruA, GbpC, Pac, WapA and Dex to the cell wall. These proteins mediate bacterial adherence with the tooth surface, that results in the formation of biofilm. Further, sortases are present at the center of a pathway that controls numerous virulence factors. Consequently, if they are targeted, it may result in hindering multiple virulence pathways simultaneously [58]

Conclusion

Despite enormous efforts, dental caries is still considered as one of the highly prevalent, though preventable disease around the globe. Efforts for an effective drug and vaccine development to combat infection have been made in many countries for decades but have not gained much success. In this regard, proteomics along with subtractive in silico methods is the most suitable approach to find out the proteins involved in virulence of S. mutans. With the help of hierarchical in silico novel approach, this study has presented a list of 22 potential anti-caries drug candidates along with their qualitative characterization. It has been explored that a category of essential non-human homologous, central metabolic proteins fit in the scope of potential drug targets due to two reasons: (1) Either the targets identified don’t have any human or gut microbiome counterparts or they appear to use some alternative catalytic mechanisms. Accordingly, an inhibitor can be designed that would not obstruct its human cousin. (2) secondly, targeting energy-making machinery of bacteria circumvents the usual pathogenesis and thereby paving the path to multi-faceted strategies to combat the infection of dental caries. Shortlisted proteins were qualitatively characterized to find if they can gain access into pipelines of drugs as well as vaccine development. It is being anticipated that results obtained in the present study can provide platform to design vaccine or targets against caries infection. Below is the link to the electronic supplementary material. Supplementary file: S1 represents 321 proteins identified by LC-MS and scale of differential expression was characterized using log fold change. Out of 321 proteins identified, 98 proteins were found overexpressed and 223 were suppressed. (XLSX 29 kb) Supplementary file: S2 Table showing meta-analysis of proteome dataset at each level employing subtractive proteomics analysis. (XLSX 16 kb) Supplementary file: S3 Table showing 22 potential targets searched upon UniProtKB database (https://www.uniprot.org/) mapped upon Streptococcus mutans (strain:UA159). (XLSX 13 kb)
  54 in total

1.  Inhibition of phenylalanine and tyrosine synthesis on Streptococcus faecalis and Lactobacillus arabinosus by alpha-keto acids.

Authors:  J T HOLDEN
Journal:  Arch Biochem Biophys       Date:  1956-03       Impact factor: 4.013

Review 2.  Microbes in gastrointestinal health and disease.

Authors:  Andrew S Neish
Journal:  Gastroenterology       Date:  2008-11-19       Impact factor: 22.682

3.  Structural and molecular basis of the role of starch and sucrose in Streptococcus mutans biofilm development.

Authors:  M I Klein; S Duarte; J Xiao; S Mitra; T H Foster; H Koo
Journal:  Appl Environ Microbiol       Date:  2008-11-21       Impact factor: 4.792

Review 4.  Host-microbial symbiosis in the mammalian intestine: exploring an internal ecosystem.

Authors:  L V Hooper; L Bry; P G Falk; J I Gordon
Journal:  Bioessays       Date:  1998-04       Impact factor: 4.345

Review 5.  Stress-induced remodeling of the bacterial proteome.

Authors:  Monica S Guo; Carol A Gross
Journal:  Curr Biol       Date:  2014-05-19       Impact factor: 10.834

6.  The two-component system VicRK regulates functions associated with Streptococcus mutans resistance to complement immunity.

Authors:  Livia A Alves; Erika N Harth-Chu; Thais H Palma; Rafael N Stipp; Flávia S Mariano; José F Höfling; Jacqueline Abranches; Renata O Mattos-Graner
Journal:  Mol Oral Microbiol       Date:  2017-05-25       Impact factor: 3.563

7.  Proteome profiling of carbapenem-resistant K. pneumoniae clinical isolate (NDM-4): Exploring the mechanism of resistance and potential drug targets.

Authors:  Divakar Sharma; Anjali Garg; Manish Kumar; Asad U Khan
Journal:  J Proteomics       Date:  2019-04-03       Impact factor: 4.044

Review 8.  Metal binding and structure-activity relationship of the metalloantibiotic peptide bacitracin.

Authors:  Li-June Ming; Jon D Epperson
Journal:  J Inorg Biochem       Date:  2002-07-25       Impact factor: 4.155

9.  PSORTb 3.0: improved protein subcellular localization prediction with refined localization subcategories and predictive capabilities for all prokaryotes.

Authors:  Nancy Y Yu; James R Wagner; Matthew R Laird; Gabor Melli; Sébastien Rey; Raymond Lo; Phuong Dao; S Cenk Sahinalp; Martin Ester; Leonard J Foster; Fiona S L Brinkman
Journal:  Bioinformatics       Date:  2010-05-13       Impact factor: 6.937

10.  Subtractive proteomics to identify novel drug targets and reverse vaccinology for the development of chimeric vaccine against Acinetobacter baumannii.

Authors:  Vandana Solanki; Vishvanath Tiwari
Journal:  Sci Rep       Date:  2018-06-13       Impact factor: 4.379

View more
  1 in total

1.  Inhibitory Effect of Bacillus licheniformis Strains Isolated from Canine Oral Cavity.

Authors:  Natália Šurín Hudáková; Jana Kačírová; Miriam Sondorová; Svetlana Šelianová; Rastislav Mucha; Marián Maďar
Journal:  Life (Basel)       Date:  2022-08-15
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.