| Literature DB >> 35888183 |
Zhidong Li1,2, Wei Tang3, Xiong You3, Xilin Hou1,2.
Abstract
Plant leaves, which convert light energy into chemical energy, serve as a major food source on Earth. The decrease in crop yield and quality is caused by plant leaf premature senescence. It is important to detect senescence-associated genes. In this study, we collected 5853 genes from a leaf senescence database and developed a leaf-senescence-associated genes (SAGs) prediction model using the support vector machine (SVM) and XGBoost algorithms. This is the first computational approach for predicting SAGs with the sequence dataset. The SVM-PCA-Kmer-PC-PseAAC model achieved the best performance (F1score = 0.866, accuracy = 0.862 and receiver operating characteristic = 0.922), and based on this model, we developed a SAGs prediction tool called "SAGs_Anno". We identified a total of 1,398,277 SAGs from 3,165,746 gene sequences from 83 species, including 12 lower plants and 71 higher plants. Interestingly, leafy species showed a higher percentage of SAGs, while leafless species showed a lower percentage of SAGs. Finally, we constructed the Leaf SAGs Annotation Platform using these available datasets and the SAGs_Anno tool, which helps users to easily predict, download, and search for plant leaf SAGs of all species. Our study will provide rich resources for plant leaf-senescence-associated genes research.Entities:
Keywords: artificial intelligence; classification; database; leaf senescence; machine learning
Year: 2022 PMID: 35888183 PMCID: PMC9316258 DOI: 10.3390/life12071095
Source DB: PubMed Journal: Life (Basel) ISSN: 2075-1729
The performance of SVM prediction model.
| Methods | Number of Feature | F1score | ACC | AUC |
|---|---|---|---|---|
| SVM-ACC | 27 | 0.811 | 0.767 | 0.721 |
| SVM-Kmer | 400 | 0.858 | 0.857 | 0.912 |
| SVM-PC-PseAAC | 22 | 0.838 | 0.834 | 0.900 |
| SVM-Kmer-ACC | 427 | 0.781 | 0.787 | 0.863 |
| SVM-Kmer-PC-PseAAC | 422 | 0.852 | 0.854 | 0.925 |
| SVM-ACC-PC-PseAAC | 49 | 0.782 | 0.789 | 0.852 |
| SVM-ACC-Kmer-PC-PseAAC | 449 | 0.802 | 0.807 | 0.883 |
The performance of SVM predictive model using PCA method.
| Methods | Number of Feature | F1score | ACC | AUC |
|---|---|---|---|---|
| SVM-PCA-Kmer-ACC | 401 | 0.816 | 0.810 | 0.857 |
| SVM-PCA-Kmer-PC-PseAAC | 410 | 0.866 | 0.862 | 0.922 |
| SVM-PCA-ACC-PC-PseAAC | 46 | 0.799 | 0.797 | 0.847 |
| SVM-PCA-ACC-Kmer-PC-PseAAC | 161 | 0.822 | 0.822 | 0.869 |
The performance of XGBoost predictive model.
| Methods | Number of Feature | F1score | ACC | AUC |
|---|---|---|---|---|
| XGBoost-ACC | 27 | 0.790 | 0.754 | 0.728 |
| XGBoost-Kmer | 400 | 0.860 | 0.852 | 0.916 |
| XGBoost-PC-PseAAC | 22 | 0.840 | 0.835 | 0.901 |
| XGBoost-Kmer-ACC | 427 | 0.863 | 0.854 | 0.923 |
| XGBoost-Kmer-PC-PseAAC | 422 | 0.860 | 0.853 | 0.928 |
| XGBoost-ACC-PC-PseAAC | 49 | 0.850 | 0.844 | 0.909 |
| XGBoost-ACC-Kmer-PC-PseAAC | 449 | 0.865 | 0.860 | 0.925 |
The performance of XGBoost predictive model using PCA method.
| Methods | Number of Feature | F1score | ACC | AUC |
|---|---|---|---|---|
| XGBoost-PCA-Kmer-ACC | 411 | 0. 842 | 0.832 | 0.900 |
| XGBoost-PCA-Kmer-PC-PseAAC | 212 | 0.855 | 0.846 | 0.919 |
| XGBoost-PCA-ACC-PC-PseAAC | 46 | 0.839 | 0.829 | 0.894 |
| XGBoost-PCA-ACC-Kmer-PC-PseAAC | 425 | 0.844 | 0.832 | 0.900 |
Figure 1The homepage of the LSAP database.
Figure 2The function of plant leaf SAGs prediction.