| Literature DB >> 35846135 |
Xiaoye Jin1, Zheng Ren1, Hongling Zhang1, Qiyan Wang1, Yubo Liu1, Jingyan Ji1, Jiang Huang1.
Abstract
Aging is usually accompanied by the decline of physiological function and dysfunction of cellular processes. Genetic markers related to aging not only reveal the biological mechanism of aging but also provide age information in forensic research. In this study, we aimed to screen age-associated mRNAs based on the previously reported genome-wide expression data. In addition, predicted models for age estimations were built by three machine learning methods. We identified 283 differentially expressed mRNAs between two groups with different age ranges. Nine mRNAs out of 283 mRNAs showed different expression patterns between smokers and non-smokers and were eliminated from the following analysis. Age-associated mRNAs were further screened from the remaining mRNAs by the cross-validation error analysis of random forest. Finally, 14 mRNAs were chosen to build the model for age predictions. These 14 mRNAs showed relatively high correlations with age. Furthermore, we found that random forest showed the optimal performance for age prediction in comparison to the generalized linear model and support vector machine. To sum up, the 14 age-associated mRNAs identified in this study could be viewed as valuable markers for age estimations and studying the aging process.Entities:
Keywords: aging; forensic age estimation; genetic markers; mRNA; machine learning
Year: 2022 PMID: 35846135 PMCID: PMC9283997 DOI: 10.3389/fgene.2022.924408
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.772
FIGURE 1Spearman correlation coefficient of 14 mRNAs with age. Detailed information of 14 mRNAs is given in Supplementary Table S5.
FIGURE 2Scatter plot of the predicted age and actual age by random forest based on 274 (A) and 14 mRNAs (B).
FIGURE 3Scatter plot of the predicted age and actual age by the generalized linear model (A) and support vector machine (B) based on 14 mRNAs.
FIGURE 4Gene ontology analysis for cellular components (A) and biological processes (B) of 14 genes associated with age.