| Literature DB >> 31284398 |
Ashima Kukkar1, Rajni Mohana1, Anand Nayyar2, Jeamin Kim3, Byeong-Gwon Kang4, Naveen Chilamkurti5.
Abstract
The accurate severity classification of a bug report is an important aspect of bug fixing. The bug reports are submitted into the bug tracking system with high speed, and owing to this, bug repository size has been increasing at an enormous rate. This increased bug repository size introduces biases in the bug triage process. Therefore, it is necessary to classify the severity of a bug report to balance the bug triaging process. Previously, many machine learning models were proposed for automation of bug severity classification. The accuracy of these models is not up to the mark because they do not extract the important feature patterns for learning the classifier. This paper proposes a novel deep learning model for multiclass severity classification called Bug Severity classification to address these challenges by using a Convolutional Neural Network and Random forest with Boosting (BCR). This model directly learns the latent and highly representative features. Initially, the natural language techniques preprocess the bug report text, and then n-gram is used to extract the features. Further, the Convolutional Neural Network extracts the important feature patterns of respective severity classes. Lastly, the random forest with boosting classifies the multiple bug severity classes. The average accuracy of the proposed model is 96.34% on multiclass severity of five open source projects. The average F-measures of the proposed BCR and the existing approach were 96.43% and 84.24%, respectively, on binary class severity classification. The results prove that the proposed BCR approach enhances the performance of bug severity classification over the state-of-the-art techniques.Entities:
Keywords: convolutional neural network; deep learning; n-gram; natural language processing; random forest; severity classification; software reliability
Year: 2019 PMID: 31284398 PMCID: PMC6651582 DOI: 10.3390/s19132964
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1The basic classes of bug severity.
Figure 2Example of bug reports.
Figure 3Bug Severity classification using a Convolutional Neural Network and Random forest with Boosting (BCR) model: an n-gram as the n-gram feature extractor, CNN as the feature extractor, and random forest with boosting as the classifier.
Figure 4The steps of the BCR model for bug severity classification.
Bug report dataset.
| Projects | Number of Bugs | BTS | Number of Classes | Name of Severity Classes |
|---|---|---|---|---|
| Mozilla | 539 | Bugzilla | 7 | Blocker, Critical, Enhancement, Major, Normal, Minor, Trivial |
| Eclipse | 693 | Bugzilla | 5 | Blocker, Critical, Enhancement, Major, Normal |
| JBoss | 573 | RedhatBugzilla | 5 | High, Low, Medium, Unspecified, Urgent |
| OpenFOAM | 795 | Manits | 8 | Blocker, Crash, Feature, Major, Minor, Text, Trivial, Tweak |
| Firefox | 620 | Bugzilla | 7 | Blocker, Critical, Enhancement, Major, Normal, Minor, Trivial |
Analysis of the proposed BCR model on five projects.
| Projects | Classes | Precision (%) | Recall (%) | F-Measure (%) | Accuracy (%) |
|---|---|---|---|---|---|
| Mozilla | Blocker | 82.90 | 80.23 | 81.54 | 85.45 |
| Critical | 96.43 | 95.34 | 94.34 | 96.33 | |
| Enhancement | 95.33 | 80.22 | 95.45 | 97.43 | |
| Major | 98.45 | 97.45 | 92.34 | 96.30 | |
| Normal | 97.44 | 95.34 | 96.45 | 95.24 | |
| Minor | 94.35 | 96.34 | 97.44 | 96.34 | |
| Trivial | 96.35 | 95.34 | 95.33 | 97.33 | |
| Firefox | Blocker | 86.16 | 83.35 | 84.73 | 83.00 |
| Critical | 97.97 | 92.23 | 95.27 | 97.92 | |
| Enhancement | 98.30 | 92.23 | 95.98 | 97.55 | |
| Major | 97.98 | 97.61 | 96.64 | 97.19 | |
| Normal | 97.28 | 96.90 | 97.64 | 97.53 | |
| Minor | 97.23 | 97.04 | 97.24 | 98.33 | |
| Trivial | 98.43 | 95.68 | 96.52 | 98.85 | |
| Eclipse | Blocker | 80.34 | 81.23 | 83.45 | 83.45 |
| Critical | 97.33 | 89.23 | 96.35 | 98.33 | |
| Enhancement | 98.34 | 90.29 | 95.35 | 98.32 | |
| Major | 99.40 | 97.85 | 95.85 | 97.22 | |
| Normal | 97.35 | 97.29 | 98.40 | 97.24 | |
| Jboss | High | 96.80 | 97.29 | 97.84 | 98.29 |
| Low | 98.27 | 97.00 | 96.75 | 98.93 | |
| Medium | 99.08 | 95.45 | 96.72 | 99.22 | |
| Unspecified | 99.59 | 93.68 | 97.08 | 99.19 | |
| Urgent | 99.59 | 96.37 | 97.76 | 98.82 | |
| OpenFOAM | Blocker | 83.45 | 82.34 | 83.45 | 83.40 |
| Crash | 98.70 | 98.42 | 98.89 | 99.38 | |
| Feature | 99.28 | 97.81 | 98.33 | 98.56 | |
| Major | 99.61 | 98.51 | 97.11 | 99.54 | |
| Minor | 99.06 | 95.29 | 97.02 | 99.28 | |
| Text | 99.29 | 91.21 | 97.30 | 99.77 | |
| Trivial | 99.45 | 95.52 | 97.05 | 99.22 | |
| Tweak | 99.82 | 99.02 | 98.57 | 98.68 |
The average result of all datasets.
| Approach | Result Analysis | ||||
|---|---|---|---|---|---|
| Projects | Precision (%) | Recall (%) | F-Measure (%) | Accuracy (%) | |
| Proposed | Mozilla | 94.47 | 91.46 | 93.27 | 94.92 |
| Eclipse | 96.19 | 93.57 | 94.86 | 95.76 | |
| JBoss | 94.55 | 91.76 | 93.88 | 94.91 | |
| OpenFOAM | 98.67 | 95.96 | 97.23 | 98.89 | |
| Firefox | 97.33 | 94.76 | 95.96 | 97.22 | |
Figure 5Results analysis of the proposed BCR on seven classes of the Mozilla Project.
Figure 6Results analysis of the proposed BCR on seven classes of the Firefox Project.
Figure 7Results analysis of the proposed BCR on seven classes of the Eclipse Project.
Figure 8Results analysis of the proposed BCR on seven classes of the Jboss Project.
Figure 9Result analysis of the proposed BCR on seven classes of the OpenFOAM Project.
Figure 10Comparison of the proposed BCR on all classes and datasets.
The configuration of the BCR model.
| Layer | Operator | Output Height | Output Width |
|---|---|---|---|
| Input | 1000 × 200 | 1000 | 200 |
| Dropout | Rate = 0.2 | ||
| Convolutional | Stride = 1, padding = 0, depth = 128, filter size = 20; activation = sigmoid for first four layers, Tanh for last three layers | 128 | 128 |
| Pooling | Max pooling | ||
| Fully connected layer | Output depth = Random forest |
Comparison of results of five projects based on binary classification.
| Projects | Zhou et al. | Proposed BCR Model | ||||
|---|---|---|---|---|---|---|
| Precision (%) | Recall (%) | F-Measure (%) | Precision (%) | Recall (%) | F-Measure (%) | |
| Mozilla | 82.60 | 82.40 | 81.70 | 98.48 | 98.52 | 98.12 |
| Eclipse | 81.80 | 82.10 | 81.60 | 99.38 | 99.48 | 99.12 |
| JBoss | 93.70 | 93.70 | 93.70 | 98.88 | 98.95 | 98.42 |
| OpenFOAM | 85.30 | 85.30 | 84.70 | 95.25 | 95.35 | 94.55 |
| Firefox | 80.30 | 80.50 | 79.50 | 92.75 | 92.95 | 91.95 |
Figure 11Comparison of the proposed and existing bug severity classifier on binary classes.