| Literature DB >> 35202404 |
Zhongzhong Guo1,2,3, Shangqi Yu1,2,3, Jiazhi Fu2,3,4, Kai Ma5, Rui Zhang2,3,4.
Abstract
The deep neural network is used to establish a neural network model to solve the problems of low accuracy and poor accuracy of traditional algorithms in screening differentially expressed genes and function prediction during the walnut endocarp hardening stage. The paper walnut is used as the research object to analyze the biological information of paper walnut. The changes of lignin deposition during endocarp hardening from 50 days to 90 days are observed by microscope. Then, the Convolutional Neural Network (CNN) and Long and Short-term Memory (LSTM) network model are adopted to construct an expression gene screening and function prediction model. Then, the transcriptome and proteome sequencing and biological information of walnut endocarp samples at 50, 57, 78, and 90 days after flowering are analyzed and taken as the training data set of the CNN + LSTM model. The experimental results demonstrate that the endocarp of paper walnut began to harden at 57 days, and the endocarp tissue on the hardened inner side also began to stain. This indicates that the endocarp hardened laterally from outside to inside. The screening and prediction results show that the CNN + LSTM model's highest accuracy can reach 0.9264. The Accuracy, Precision, Recall, and F1-score of the CNN + LSTM model are better than the traditional machine learning algorithm. Moreover, the Receiver Operating Curve (ROC) area enclosed by the CNN + LSTM model and coordinate axis is the largest, and the Area Under Curve (AUC) value is 0.9796. The comparison of ROC and AUC proves that the CNN + LSTM model is better than the traditional algorithm for screening differentially expressed genes and function prediction in the walnut endocarp hardening stage. Using deep learning to predict expressed genes' function accurately can reduce the breeding cost and significantly improve the yield and quality of crops. This research provides scientific guidance for the scientific breeding of paper walnut.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35202404 PMCID: PMC8870417 DOI: 10.1371/journal.pone.0263755
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Comparison between deep learning and shallow learning.
Fig 2Definition to deep learning.
Fig 3Structure of CNN.
Fig 4Composition of CNN.
Fig 5Schematic diagram of LSTM network structure.
Fig 6Curves of three activation functions.
Fig 7Four-bit one-hot encoding of the RNA sequence.
Fig 8Algorithm flow of the CNN + LSTM model.
Fig 9Data sets of the model (a) mRNA; (b) protein.
Fig 10ROC curve of the model.
Fig 11Changes of lignin deposition in the thin-shell walnut during endocarp hardening.
Fig 12Confusion matrix of the model.
Fig 13Results of the five-fold cross-validation of DEG screening and prediction (a: Accuracy; b: Loss).
Fig 14Comparison of results of different algorithms.
Fig 15ROC curves of CNN + LSTM model and traditional algorithms.