| Literature DB >> 35494853 |
Hemn Barzan Abdalla1, Awder M Ahmed2, Subhi R M Zeebaree3, Ahmed Alkhayyat4, Baha Ihnaini1.
Abstract
Increasing demands for information and the rapid growth of big data have dramatically increased the amount of textual data. In order to obtain useful text information, the classification of texts is considered an imperative task. Accordingly, this article will describe the development of a hybrid optimization algorithm for classifying text. Here, pre-processing was done using the stemming process and stop word removal. Additionally, we performed the extraction of imperative features and the selection of optimal features using the Tanimoto similarity, which estimates the similarity between features and selects the relevant features with higher feature selection accuracy. Following that, a deep residual network trained by the Adam algorithm was utilized for dynamic text classification. Dynamic learning was performed using the proposed Rider invasive weed optimization (RIWO)-based deep residual network along with fuzzy theory. The proposed RIWO algorithm combines invasive weed optimization (IWO) and the Rider optimization algorithm (ROA). These processes are carried out under the MapReduce framework. Our analysis revealed that the proposed RIWO-based deep residual network outperformed other techniques with the highest true positive rate (TPR) of 85%, true negative rate (TNR) of 94%, and accuracy of 88.7%. ©2022 Abdalla et al.Entities:
Keywords: Deep Residual network; Dynamic learning; Fuzzy theory; MapReduce Model; Text classification
Year: 2022 PMID: 35494853 PMCID: PMC9044237 DOI: 10.7717/peerj-cs.937
Source DB: PubMed Journal: PeerJ Comput Sci ISSN: 2376-5992
Figure 1Schematic view of text classification from the input big data using proposed RIWO-based Deep Residual Network.
Figure 2Structural design of deep residual network with residual blocks, convolutional (Conv) layers, linear classifier, and average pooling layers for text classification.
Pseudocode of the Adam algorithm.
| Input: |
|---|
| Output: Resulting parameters |
| Require |
| Require |
| Initialize first-moment vector |
| Initialize second-moment vector |
| Initialize time step |
| While |
| Get gradients |
| Update first biased moment estimate using |
| Update second biased moment estimate using |
| Evaluate corrected bias first-moment estimate |
| Evaluate corrected bias second-moment estimate |
| Update parameters |
| End while |
| Return |
Pseudocode of developed RIWO.
| Input: |
|---|
| Output: Leader |
| Begin |
| Initialize solutions set |
| Initialize algorithmic parameter |
| Discover error using |
| While |
| For |
| Update bypass position with |
| Update follower position with |
| Update overtaker position with |
| Update attacker position with |
| Rank riders using error with |
| Choose the rider with minimal error |
| Update steering angle, gear, accelerator, and brake |
| Return |
| End for |
| End while |
| End |
Figure 3Assessment of different techniques comparing with the proposed method by considering Reuter dataset with mapper = 3. (A) TPR. (B) TNR. (C) Accuracy.
Figure 4Assessment of different techniques comparing with the proposed method by considering Reuter dataset with mapper = 4. (A) TPR. (B) TNR. (C) Accuracy.
Figure 5Assessment of different techniques comparing with the proposed method by considering 20 Newsgroup dataset with mapper = 3. (A) TPR. (B) TNR. (C) Accuracy.
Figure 6Assessment of different techniques comparing with the proposed method by considering 20 Newsgroup dataset with mapper = 4. (A) TPR. (B) TNR. (C) Accuracy.
Comparative discussion.
|
|
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
| 0.800 | 0.812 | 0.818 | 0.818 | 0.818 | 0.819 | 0.819 |
|
|
| 0.846 | 0.850 | 0.865 | 0.869 | 0.878 | 0.885 | 0.896 |
| ||
|
| 0.824 | 0.839 | 0.849 | 0.852 | 0.856 | 0.859 | 0.863 |
| ||
|
|
| 0.810 | 0.820 | 0.824 | 0.824 | 0.825 | 0.826 | 0.826 |
| |
|
| 0.855 | 0.856 | 0.876 | 0.878 | 0.885 | 0.893 | 0.900 |
| ||
|
| 0.833 | 0.849 | 0.852 | 0.857 | 0.859 | 0.863 | 0.868 |
| ||
|
|
|
| 0.796 | 0.815 | 0.818 | 0.822 | 0.825 | 0.828 | 0.829 |
|
|
| 0.833 | 0.840 | 0.849 | 0.851 | 0.855 | 0.852 | 0.854 |
| ||
|
| 0.824 | 0.830 | 0.839 | 0.843 | 0.849 | 0.852 | 0.856 |
| ||
|
|
| 0.810 | 0.827 | 0.836 | 0.840 | 0.843 | 0.846 | 0.849 |
| |
|
| 0.836 | 0.839 | 0.851 | 0.857 | 0.863 | 0.866 | 0.871 |
| ||
|
| 0.824 | 0.838 | 0.849 | 0.852 | 0.858 | 0.863 | 0.868 |
|