| Literature DB >> 27229861 |
Ashok K Sharma1, Sanjiv Kumar1,2, Harish K1, Darshan B Dhakan1, Vineet K Sharma3.
Abstract
BACKGROUND: The efficacy of antibiotics against bacterial infections is decreasing due to the development of resistance in bacteria, and thus, there is a need to search for potential alternatives to antibiotics. In this scenario, peptidoglycan hydrolases can be used as alternate antibacterial agents due to their unique property of cleaving peptidoglycan cell wall present in both gram-positive and gram-negative bacteria. Along with a role in maintaining overall peptidoglycan turnover in a cell and in daughter cell separation, peptidoglycan hydrolases also play crucial role in bacterial pathophysiology requiring development of a computational tool for the identification and classification of novel peptidoglycan hydrolases from genomic and metagenomic data.Entities:
Keywords: Carboxypeptidase; Cell wall hydrolases; Endopeptidase; Lytic transglycosylases; N-acetylglucosaminidase; N-acetylmuramidases; N-acetylmuramoyl-L-alanine; Peptidoglycan hydrolase; Random Forest; Support Vector Machine
Mesh:
Substances:
Year: 2016 PMID: 27229861 PMCID: PMC4882796 DOI: 10.1186/s12864-016-2753-8
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Sites of action of peptidoglycan hydrolases on bacterial cell wall
Fig. 2Performance of LibSVM modules using linear kernel at different cost parameters for AAC and DPC for binary classification
Comparative performance of LibSVM and RF using Amino Acid and Dipeptide as feature inputs for binary classification
| Sensitivity | Specificity | Accuracy | MCC | |||||
|---|---|---|---|---|---|---|---|---|
| AAC | DPC | AAC | DPC | AAC | DPC | AAC | DPC | |
| SVM | 53.81 | 78.76 | 93.63 | 94.67 | 82.94 | 90.40 | 0.53 | 0.75 |
| RF | 74.14 | 75.08 | 97.79 | 99.48 | 91.44 | 92.93 | 0.77 | 0.82 |
AAC = Amino acid composition and DPC = Dipeptide composition
SVM = Support Vector Machine at t = 0 and c = 1 for both AAC and DPC
RF = Random Forest; for AAC mtry = 4, ntree = 500 and for DPC mtry = 64, ntree = 500
Fig. 3a OOB error using AAC and DPC as feature inputs at different mtry for binary classification. b OOB error on increasing the number of trees at optimized mtry for AAC and DPC for binary classification
Fig. 4Performance of LibSVM modules using linear kernel at different cost parameters for AAC and DPC for multiclass classification
Performance of LibSVM using Amino acid and Dipeptide composition as feature inputs for multiclass classification
| Class | Sensitivity | Specificity | Accuracy | MCC | ||||
|---|---|---|---|---|---|---|---|---|
| AAC | DPC | AAC | DPC | AAC | DPC | AAC | DPC | |
| A | 7.53 | 76.57 | 97.95 | 97.16 | 86.67 | 94.67 | 0.07 | 0.74 |
| B | 0 | 52.45 | 100 | 99.6 | 96.91 | 98.31 | 0 | 0.62 |
| C | 14.2 | 55.88 | 98.36 | 97.55 | 88.94 | 93.47 | 0.19 | 0.6 |
| D | 0 | 32.77 | 100 | 99.81 | 99.37 | 99.4 | 0 | 0.4 |
| E | 98.23 | 96.17 | 15.49 | 75.08 | 76.01 | 90.74 | 0.26 | 0.74 |
SVM = Support Vector Machine at t = 0 and c = 0.7 for both AAC and DPC
Where, A = N-acetylmuramoyl-L-alanine amidases, B = Peptidases, C = Enzymes acting on Peptidoglycan chain, D = Unclassified, and E = Negative Dataset
Fig. 5a OOB error using AAC and DPC as feature inputs at different mtry for multiclass classification. b OOB error on increasing the number of trees at optimized mtry for AAC and DPC for multiclass classification
Performance of Random Forest (RF) final models using Amino acid and Dipeptide composition as feature inputs for multiclass classification
| Class | Sensitivity | Specificity | Accuracy | MCC | ||||
|---|---|---|---|---|---|---|---|---|
| AAC | DPC | AAC | DPC | AAC | DPC | AAC | DPC | |
| A | 62.52 | 71.12 | 98.55 | 99.47 | 93.90 | 95.77 | 0.70 | 0.80 |
| B | 61.10 | 54.91 | 99.95 | 99.98 | 98.62 | 98.45 | 0.77 | 0.73 |
| C | 59.73 | 53.49 | 98.72 | 99.66 | 94.19 | 94.34 | 0.68 | 0.69 |
| D | 40.41 | 38.19 | 99.97 | 99.98 | 99.55 | 99.55 | 0.60 | 0.60 |
| E | 98.99 | 99.93 | 64.50 | 62.92 | 90.14 | 90.18 | 0.73 | 0.74 |
Where, A = N-acetylmuramoyl-L-alanine amidases, B = Peptidases, C = Enzymes acting on Peptidoglycan chain, D = Unclassified, and E = Negative Dataset
Performance of HyPe on independent genomic dataset
| Genome | Sensitivity | MCC |
|---|---|---|
|
| 71.43 | 0.56 |
|
| 77.42 | 0.81 |
|
| 75.86 | 0.77 |
|
| 62.96 | 0.67 |
|
| 84.00 | 0.86 |
|
| 42.86 | 0.51 |
|
| 50.00 | 0.71 |
|
| 68.18 | 0.75 |
|
| 85.71 | 0.74 |
|
| 100.00 | 0.82 |
|
| 82.35 | 0.88 |
|
| 88.89 | 0.94 |
|
| 66.67 | 0.63 |
|
| 92.31 | 0.86 |
|
| 80.00 | 0.77 |
|
| 100.00 | 0.78 |
|
| 78.57 | 0.65 |
|
| 62.50 | 0.59 |
|
| 75.00 | 0.75 |
|
| 77.27 | 0.81 |
|
| 60.00 | 0.63 |
| Vibrio alginolyticus NBRC 15630 | 64.71 | 0.74 |
|
| 70.59 | 0.75 |
|
| 60.00 | 0.67 |
The Specificity and Accuracy was almost 1 for all the above genomes since the number of True Negatives (TN) was very large in number, which used in the denominator for the calculation to Specificity and Accuracy