| Literature DB >> 36185709 |
Abdullah Alqahtani1, Habib Ullah Khan2, Shtwai Alsubai1, Mohemmed Sha1, Ahmad Almadhor3, Tayyab Iqbal4, Sidra Abbas5.
Abstract
Text categorization is an effective activity that can be accomplished using a variety of classification algorithms. In machine learning, the classifier is built by learning the features of categories from a set of preset training data. Similarly, deep learning offers enormous benefits for text classification since they execute highly accurately with lower-level engineering and processing. This paper employs machine and deep learning techniques to classify textual data. Textual data contains much useless information that must be pre-processed. We clean the data, impute missing values, and eliminate the repeated columns. Next, we employ machine learning algorithms: logistic regression, random forest, K-nearest neighbors (KNN), and deep learning algorithms: long short-term memory (LSTM), artificial neural network (ANN), and gated recurrent unit (GRU) for classification. Results reveal that LSTM achieves 92% accuracy outperforming all other model and baseline studies.Entities:
Keywords: deep learning; machine learning; text categorization; text classification; text data
Year: 2022 PMID: 36185709 PMCID: PMC9521674 DOI: 10.3389/fncom.2022.992296
Source DB: PubMed Journal: Front Comput Neurosci ISSN: 1662-5188 Impact factor: 3.387
Figure 1Survived in groups.
Textual data analysis using LSTM.
| |
| |
| 1: |
| 2: |
| 3: |
| 4: |
| 5: |
| 6: |
| 7: |
| 8: |
| 9: |
| 10: |
| 11: |
| 12: |
| 13: |
| 14: |
| 15: |
| 16: |
| 17: |
| 18: |
| 19: |
| 20: |
| 21: |
Figure 2Methodology of the proposed approach.
Results of machine learning and deep learning techniques.
|
|
|
|---|---|
| LSTM | 92 |
| GRU | 80 |
| Logistic regression | 86 |
| Random forest | 82 |
| ANN | 81 |
| KNN | 77 |
Figure 3Accuracy and F1-score of machine learning models.
Figure 4Accuracy and F1-score of deep learning models.
Figure 5Comparison of machine learning and deep learning models.
Figure 6Confusion matrix of LSTM.
Figure 7Accuracy curve of LSTM.
Figure 8Accuracy curve of LSTM with early stopping.
Figure 9Loss curve of LSTM.
Figure 10Loss curve of LSTM with early stopping.