| Literature DB >> 33293615 |
Tobias Hegelund Olsen1, Betül Yesiltas2, Frederikke Isa Marin1, Margarita Pertseva1, Pedro J García-Moreno2, Simon Gregersen3, Michael Toft Overgaard3, Charlotte Jacobsen2, Ole Lund2, Egon Bech Hansen2, Paolo Marcatili4.
Abstract
Dietary antioxidants are an important preservative in food and have been suggested to help in disease prevention. With consumer demands for less synthetic and safer additives in food products, the food industry is searching for antioxidants that can be marketed as natural. Peptides derived from natural proteins show promise, as they are generally regarded as safe and potentially contain other beneficial bioactivities. Antioxidative peptides are usually obtained by testing various peptides derived from hydrolysis of proteins by a selection of proteases. This slow and cumbersome trial-and-error approach to identify antioxidative peptides has increased interest in developing computational approaches for prediction of antioxidant activity and thereby reduce laboratory work. A few antioxidant predictors exist, however, no tool predicting the antioxidative properties of peptides is, to the best of our knowledge, currently available as a web-server. We here present the AnOxPePred tool and web-server ( http://services.bioinformatics.dtu.dk/service.php?AnOxPePred-1.0 ) that uses deep learning to predict the antioxidant properties of peptides. Our model was trained on a curated dataset consisting of experimentally-tested antioxidant and non-antioxidant peptides. For a variety of metrics our method displays a prediction performance better than a k-NN sequence identity-based approach. Furthermore, the developed tool will be a good benchmark for future predictors of antioxidant peptides.Entities:
Mesh:
Substances:
Year: 2020 PMID: 33293615 PMCID: PMC7722737 DOI: 10.1038/s41598-020-78319-w
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Overview over the benchmark dataset.
| FRS | CheL | FRS/CheL | Non-AO | Random | Total | |
|---|---|---|---|---|---|---|
| AOdb | 615 | 11 | 70 | 218 | 500 | 1414 |
| aodb < 90% | 606 | 11 | 70 | 217 | 500 | 1404 |
FRS, CHEL, FRS/CHEL and NON-AO are all experimentally-validated peptides obtained from various papers. RANDOM consists of peptides derived from the UniProt[46] database, with lengths between 2–30 amino acids. AODB < 90% is the number of peptides after removal of sequences, so no pair has more than 90% identity. FRS free radical scavenger, CHEL chelator, FRS/CHEL both FRS and chelator, NON-AO non-antioxidant.
Figure 1Overview of AnOxPePred’s architecture. Input sequences (A) enters the Conv1 module (B) which extracts a set of features. The extracted features are then flattened before entering the Ff1 module (C). Here the features are used to predict the final output of FRS and chelating properties (D). FRS free radical scavenger.
Overview over the 10 new experimentally tested peptide sequences, ranked according to the FRS score predicted by our model (from largest to smallest) and their IC50.
| Peptide sequence | Predicted FRS score | IC50 (mg/ml) | |
|---|---|---|---|
| 0.64 | 16.32 ± 1.72 | ||
| 0.59 | 2.24 ± 0.11 | ||
| 0.55 | 7.03 ± 1.25 | ||
| 0.54 | 6.73 ± 2.42 | ||
| EHHNSPGYYDG | 0.53 | 90.83 ± 5.48 | |
| 0.53 | 14.13 ± 0.93 | ||
| ENNRPFAAANEIVPFYFEHGPHIFNS | 0.52 | 38.87 ± 6.53 | |
| 0.52 | 15.37 ± 5.15 | ||
| QSDSDYSSSGPLGVPDPSDLL | 0.51 | 37.00 ± 6.62 | |
| 0.50 | 5.47 ± 2.20 | ||
| 9 ± 2.30 |
A low IC50 is evidence for high scavenging activity. Rows are coloured according to their experimental FRS activity (obtained as described in methods) compared to Sodium Caseinate, a known antioxidant. Peptides in bold have a higher activity than Sodium Caseinate, and underlined peptides have a similar (less than twice IC50) activity.
Figure 2Overview of the properties and length of peptides in the benchmark dataset. FRS free radical scavenger.
Figure 3The difference in composition between the antioxidant dataset and a baseline (the average amino acid composition in the UniProtKB/Swiss-Prot data bank). Significant differences (P value < 0.05) was determined by applying a one sample test of proportions[59] for each amino acid and was marked with an asterisk (*). FRS Free Radical Scavenger.
Figure 4Overview of the effects of partitioning the data with different thresholds. Plotted is the performance of AnOxPePred and k-NN represented by the MCC of FRS, number of clusters (i.e. the number of groups of peptides based on the specified threshold) and the Gini coefficient based on the distribution of FRS peptides in each fold. MCC Matthew’s correlation coefficient, FRS free radical scavenger.
Figure 5Performance comparison of AnOxPePred and k-NN for both FRS and chelating properties using the metrics AUC, F1 score and MCC. AUC area under the curve, MCC Matthew’s correlation coefficient, FRS free radical scavenger.