Literature DB >> 30288410

Computer-assisted Medical Decision-making System for Diagnosis of Urticaria.

Jabez J Christopher1,2,3, Harichandran Khanna Nehemiah1,2,3, Kannan Arputharaj1,2,3, George L Moses1,2,3.   

Abstract

Background: Urticaria is a common allergic disease that affects all age groups. Allergic disorders are diagnosed at allergy testing centers using skin tests. Though skin tests are the gold standard tests for allergy diagnosis, specialists are required to interpret the observations and test results. Hence, a computer-assisted medical decision-making (CMD) system can be used as an aid for decision support, by junior clinicians, in order to diagnose the presence of urticaria.
Methods: The data from intradermal skin test results of 778 patients, who exhibited allergic symptoms, are considered for this study. Based on food habits and the history of a patient, 40 relevant allergens are tested. Allergen extracts are used for skin test. Ten independent runs of 10-fold cross-validation are used to train the system. The performance of the CMD system is evaluated using a set of test samples. The test samples were also presented to the junior clinicians at the allergy testing center to diagnose the presence or absence of urticaria.
Results: From a set of 91 features, a subset of 41 relevant features is chosen based on the relevance score of the feature selection algorithm. The Bayes classification approach achieves a classification accuracy of 96.92% over the test samples. The junior clinicians were able to classify the test samples with an average accuracy of 75.68%.
Conclusion: A probabilistic classification approach is used for identifying the presence or absence of urticaria based on intradermal skin test results. In the absence of an allergy specialist, the CDM system assists junior clinicians in clinical decision making.

Entities:  

Keywords:  Bayesian statistical methods; allergy and immunology; computer-assisted diagnosis; decision aids

Year:  2016        PMID: 30288410      PMCID: PMC6125052          DOI: 10.1177/2381468316677752

Source DB:  PubMed          Journal:  MDM Policy Pract        ISSN: 2381-4683


Introduction

Allergy is considered to be an abnormal reaction of the body to a previously encountered allergen or trigger introduced by inhalation, ingestion, injection, or skin contact among atopic people. The symptoms of an allergic disorder are often manifested by itchy eyes, running nose, nasal discharge, coughing, shortness of breath, wheezing, itching, and rashes.[1] Around 300 million people worldwide have allergic disorders, and approximately 50% of them live in developing countries.[2] According to the World Allergy Organization, in India, more than 30% of the population is known to suffer from an allergic ailment. In a study by Kumar and others, out of 1860 patients screened, 1097 (58.9%) gave history of food allergy.[3] Allergy symptoms and their manifestations have a profound impact on the quality of life. Allergic diseases with explicit symptoms often hold back the daily activities of people and affect their personal and professional tasks.[4] Allergic diseases are rising all over the world, and some commonly known allergies include asthma, rhinitis, anaphylaxis, nasobronchial allergy, eczema, urticaria, and angioedema. Urticaria is a heterogeneous group of diseases. It is characterized by the appearance of a wheal, which may consist of the following three features: a central swelling that varies in size; an associated itching or burning sensation; and discomforts over a fleeting duration of usually 1 to 24 hours.[5] Urticaria is the fourth most prevalent allergic disease after rhinitis, asthma, and drug allergy.[6] Urticaria has a strong impact on school performance and is also the cause of the highest number of absence from work. Though urticaria seems to be an allergic reaction, the disease is autoimmune and idiopathic. Around 15% to 20% of people have urticaria at least once in their lifetime.[4] Due to the heterogeneity of the disease, the etiology remains unexplained. Hence, the diagnosis and treatment of urticaria is still a challenge to physicians and allergists. Urticaria is classified based on the duration of its physical manifestations. Broadly, spontaneous urticaria and physical urticaria are the two well-defined classes of urticaria. A short description about classification of urticaria is presented in Table 1.[7]
Table 1

Urticaria Classifications by Group and Subgroup

Urticaria Group and SubgroupsCharacteristics and Eliciting Factors
Spontaneous
 AcuteSpontaneous wheals, <6 weeks
 ChronicSpontaneous wheals, >6 weeks
Physical
 Acquired coldCold air, wind, food, objects
 Delayed pressureVertical pressure
 HeatLocalized heat
 SolarUltraviolet and/or visible light
 DermographicMechanical shearing force
 VibratoryVibratory forces (e.g., pneumatic hammer)
Other disorders
 AquagenicWater
 CholinergicIncrease of body temperature
 ContactContact with urticariogenic substance
 Exercise inducedPhysical exercise
Urticaria Classifications by Group and Subgroup Acute urticaria resolves within 6 weeks, whereas chronic urticaria lasts longer. Acute urticaria is more common in young adults and children. Acute allergic symptoms may be due to release of mediators from mast cells, whereas chronic symptoms may be due to eosinophil-mediated tissue damage.[1,8] Generally, in patients suffering from urticaria, a trigger causes the skin cells to release chemicals such as histamine. These chemicals cause fluids to leak from tiny blood vessels under the skin surface. The fluid accumulates and manifests in the form of wheals. Chemicals also cause the blood vessels to dilate, which causes the flare around the wheals. When the trigger induces allergic symptoms, an allergy evaluation may be sought to identify the potential trigger for the allergic symptoms. Other causes for acute urticaria include sea food, allergy to insects, environment, and transfusion reaction. If the symptoms prolong for more than 6 weeks, the condition is classified as chronic urticaria; 20% to 30% of acute urticaria turns out to be chronic urticaria.[9] Intradermal skin tests are used to find the allergens that trigger allergic symptoms. Skin test helps diagnose immunoglobulin E–mediated hypersensitivity specifically. In patients with symptoms of urticaria and who are doubtful of being allergic to particular food items or aero allergens, skin prick tests can be used to identify the potential allergens. There are several factors and issues that have to be considered while conducting a skin test. While measuring and recording the response of a skin test, the following issues need to be considered: time to measure response, making a permanent record of the response, and measurement and grading of the response. For interpretation of the response, the following issues need to be considered: proficiency of the test, analytical performance, reactivity versus sensitivity, and criteria for a positive response. Furthermore, there are internal and external variables that influence the skin test results: site of injection, distance between injection sites, time (season) of testing, age, race, gender, socioeconomic status, tobacco smoke exposure, and medication.[8] Hence, skin testing of a specific immunoglobulin E is not easily accessible and cannot be interpreted precisely by junior physicians and immunologists. But quality control measures and proper performance of skin testing are very important to produce correct results. Timely identification of allergens is important as it may reduce the impact and manifestation of symptoms. Though intradermal skin test is an effective and efficient way to identify allergic triggers, allergy specialists or experienced immunologists are required to suggest remedial measures based on the observations of the test. Clinicians and immunologists at allergy clinics and specialty centers have to make decisions on whether a diagnosed disease is due to allergic triggers or other factors. Furthermore, they have to provide recommendations on what kind of ingestants, inhalants, and contactants to avoid and other treatment options if necessary. These clinical decisions depend on the patient’s history, food habits, environment, and the results of the skin tests. The general patterns and knowledge models devised by clinical experts should be made available to medical trainees, immunologists, and junior clinicians through computer-aided clinical decision support systems. This raises the performance and confidence of physicians in dealing with more difficult and ambiguous cases. Over the past decade, due to the availability of vast medical data, computer-assisted medical decision-making (CMD) systems are widely used at clinics and health centers to provide decisions and solutions. In most situations, a CMD system cannot be considered to be a gold standard, but it can be used by junior clinicians in the absence of experts to verify and assert their decisions. Computer-assisted systems are used for diagnosis, decision making, and decision support in various medical applications such as cancer care,[10,11] heart disease diagnosis,[12] thrombosis diagnosis,[13] diagnosis and treatment of lung disorders,[14,15] drug reaction analysis,[16] and allergy diagnosis.[17] A CMD system gets medical data (e.g., patient description) as input, processes the data, extracts useful knowledge from the data, and finally makes decisions or predictions.[18] The core tasks of CMD systems are often based on typical data mining tasks such as data cleaning, normalization, data reduction, association analysis, classification, and clustering. The CMD system assists junior clinicians at allergy centers to diagnose patients with urticaria. The diagnostic result obtained from the system indicates weather a patient shows positive or negative symptoms of acute urticaria. The system supports the clinician to decide whether the reported disease is acute (triggered by allergens) or not based on the results of the intradermal skin tests, in the absence of an expert immunologist.

Methods

The proposed CMD system framework consists of a feature selector, a classifier evaluator, a Bayes classifier, and a performance evaluator (see Figure 1).
Figure 1

CMD system framework (IDST = intradermal skin test).

CMD system framework (IDST = intradermal skin test).

Feature Selector

Feature selection (attribute reduction) is a data preprocessing technique whereby the dimensionality of the data is reduced. Removal of irrelevant and redundant features enhances the efficiency of classifier. In this work, the feature selector uses an instance-based learning approach[19] for selecting relevant features. For a given dataset S, let Strain be the training set and Stest the testing set. Let |Strain| denote the sample size of Strain and τ be the relevance threshold ranging from 0 to 1. Consider X and Y to be two instances (samples), whose corresponding nominal values for the kth attribute are x and y. Then, the difference between the nominal values of x and y is given by The CMD system uses the RELIEF algorithm,[20] presented in Figure 2, to select a set of relevant features for training the classifier.
Figure 2

RELIEF algorithm.

RELIEF algorithm. The algorithm chooses an instance X, a near-hit instance of X and a near-miss instance of X. A near-hit instance is an instance that is in the neighborhood of X and belongs to the same class of X. A near-miss instance also belongs to the same neighborhood but belongs to a different class. The feature weight vector W is updated for each feature. The algorithm chooses the features whose weight (relevance) satisfies the relevance threshold (τ).

Classifier Evaluator

Classification is a typical data mining task and also the core of a decision-making or decision-support system. The inducer (learning algorithm) constructs a classifier model (knowledge model) from a set of class-labelled training samples. The classifier assigns a class label to an unknown instance (test sample) based on the classifier model. Classification approaches differ by the algorithm used for induction and also the knowledge representation model. For example, an associative classifier uses the a priori approach for rule induction and an IF-THEN rule format for representing the classifier model. The multi-layer perceptron classifier uses the gradient descent–based backpropagation algorithm for induction (training), and the trained network constitutes the knowledge model. Each classifier has its own pros and cons; hence, no classifier can be considered as the “universal best” for all applications and domains. The classifier evaluator is used to choose a suitable classifier for this CDM system for the diagnosis of allergic disorders. The evaluator uses k-fold cross-validation, which is an appropriate method to be used for an unbiased evaluation of classifiers.[21] Cross-validation with k folds is a technique whereby the preprocessed Strain data are randomly split into k folds of approximately equal size. The classifier (model) is trained and tested k times. Each time (k− 1) folds are used for training and the remaining one fold is used for testing.

Naïve Bayes Classifier

The CMD system developed in this work uses a probabilistic approach for classification.[22] Consider an instance X = (x1, x2, x3, . . . , x), where x1, x2, x3, . . . , x are the values for features f1, f2, f3, . . . , f, respectively, and x is the class label that can either be positive or negative. The probability of an instance X being in class c is X is classified as positive class if The features (allergens) are independent of each other for a given class. Hence, where is called the naïve Bayesian classifier.[21]

Performance Evaluator

The performance of the CMD system primarily depends on the classification efficiency of the classifier. The performance evaluator assesses the classification efficiency using four evaluation measures presented in Equations (2) to (5). The four measures, namely, Precision, Sensitivity, Specificity, and Accuracy, differ in their criterion of evaluation. Precision evaluates the agreement of the class label with the positive labels predicted by the classifier. Sensitivity is used to evaluate the effectiveness of a classifier to identify positive labels, whereas Specificity evaluates how effectively a classifier identifies negative labels. Accuracy evaluates the overall classification efficiency of the classifier. Table 2 presents the confusion matrix. True positives (tp) refer to those samples that are positive and correctly diagnosed as positive (patient has urticaria and is allergic). Likewise, true negatives (tn) refer to those samples that are negative and correctly diagnosed as negative (patient does not have urticaria). False positives (fp) are those samples that are diagnosed as positive by the system/clinician but are actually negative as per the expert’s diagnosis (gold standard). False negatives (fn) are those samples that are affected with urticaria but diagnosed as negative by the system/clinician.
Table 2

Confusion Matrix

Condition
PositiveNegative
TestPositive tp fp
Negative fn tn
Confusion Matrix

Data Set Description

Intradermal skin test data were collected from 778 patients who visited the Good Samaritan Lab and Allergy Centre, Chennai, Tamilnadu, India, between 1 March and 20 June 2013. The patients were referred by ENT surgeons and general physicians because of skin diseases, itching, or other plausible allergic symptoms. A total of 365 males and 413 females, of all age groups, were included in the study.

Intradermal Test Method

After analyzing the medical history of a patient, the allergist determines whether skin testing is appropriate for the patient. The allergist also determines the list of selected allergens to be tested. Allergen extracts, negative controls (saline), and positive controls (histamine) were used for performing the skin tests. The upper half of the volar surface of the forearm was selected for the test. It was cleansed with alcohol and a pen is used to label the area in a grid-like pattern to depict where the extract (allergen) is to be applied. About 0.01 mL of the allergen is injected into the epidermis using a sterile, disposable, plastic 1-mL tuberculin syringe. Patients were asked to stop taking antihistamines and anti-allergic drugs and medications. Table 3 lists the medication to be avoided before allergy testing.
Table 3

Medication to Be Avoided Before Allergy Testing

MedicationDuration (Days)
First-generation antihistamines2–3
Nonsedating antihistamines7
Tricyclic antidepressants7–14
Benzodiazepines7–14
Topical corticosteroids14–21
Medication to Be Avoided Before Allergy Testing Consecutive observations, on an hourly basis, were taken. A positive reaction is depicted by a wheal and a flare reaction.[8,23] A negative response to a skin test usually indicates that the patient is not sensitive to that allergen. For patients who reported a delayed response to the test, the reactions were incorporated in the results.

Results

The raw data obtained from the intradermal skin test results were split into training data (Strain) and testing data (Stest) using a holdout approach.[24] Out of 778 samples, 518 samples were used for training (|Strain| = 518) and the rest were used for testing. The Strain has 92 attributes (features), which includes the class attribute. The next section presents a worked-out example, and then experimental results are presented. Set of Sample Instances Note: R = reactive; NR = not reactive; 0 = not tested/not associated.

Worked-Out Example

Let us consider the last two instances as test samples and the rest as training samples. The number of samples in each class, corresponding to each attribute-value is presented below. The Prior probabilities for each class from the training samples are computed as follows: P(CLASS=POSITIVE) = 4/10 = 0.4 P(CLASS=NEGATIVE) = 6/10 = 0.6 Consider the test instance (X1): The Conditional Probabilities are computed as follows: P(Cotton dust = R|CLASS = POSITIVE) = 4/4 = 1.00 P(Cotton dust = R|CLASS = NEGATIVE) = 4/6 = 0.66 P(Wheat = NR|CLASS = POSITIVE) = 2/4 = 0.50 P(Wheat = NR|CLASS = NEGATIVE) = 3/6 = 0.50 P(Chicken = R|CLASS = POSITIVE) = 2/4 = 0.50 P(Chicken = R|CLASS = NEGATIVE) = 1/6 = 0.16 P(Prawn = R|CLASS = POSITIVE) = 2/4 = 0.50 P(Prawn = R|CLASS = NEGATIVE) = 1/6 = 0.16 P(Brinjal = R|CLASS = POSITIVE) = 3/4 = 0.75 P(Brinjal = R|CLASS = NEGATIVE) = 1/6 = 0.16 P(Carrot = NR|CLASS = POSITIVE) = 4/4 = 1.00 P(Carrot = NR|CLASS = NEGATIVE) = 4/6 = 0.66 P(Dhal = NR|CLASS = POSITIVE) = 2/4 = 0.50 P(Dhal = NR|CLASS = NEGATIVE) = 3/6 = 0.50 P(Sneezing = NO|CLASS = POSITIVE) = 3/4 = 0.75 P(Sneezing = NO|CLASS = NEGATIVE) = 3/6 = 0.50 P(Itching = YES|CLASS = POSITIVE) = 4/4 = 1.00 P(Itching = YES|CLASS = NEGATIVE) = 1/6 = 0.16 P(Swelling = YES|CLASS = POSITIVE) = 4/4 = 1.00 P(Swelling = YES|CLASS = NEGATIVE) = 1/6 = 0.16 P(X1|CLASS=POSITIVE) = P(Cotton dust = R |CLASS = POSITIVE) × P(Wheat = NR |CLASS = POSITIVE) × P(Chicken = R |CLASS = POSITIVE) × P(Prawn = R |CLASS = POSITIVE) × P(Brinjal = R |CLASS = POSITIVE) × P(Carrot = NR |CLASS = POSITIVE) × P(Dhal = NR |CLASS = POSITIVE) × P(Sneezing = NO |CLASS = POSITIVE) × P(Itching = YES |CLASS = POSITIVE) × P(Swelling = YES |CLASS = POSITIVE) = 1.0 × 0.5 × 0.5 × 0.5 × 0.75 × 0.1 × 0.5 × .75 × 1.0 × 1.0 = 0.0351 P(X1|CLASS=NEGATIVE) = P(Cotton dust = R |CLASS = NEGATIVE) × P(Wheat = NR |CLASS = NEGATIVE) × P(Chicken = R |CLASS = NEGATIVE) × P(Prawn = R |CLASS = NEGATIVE) × P(Brinjal = R |CLASS = NEGATIVE) × P(Carrot = NR |CLASS = NEGATIVE) × P(Dhal = NR |CLASS = NEGATIVE) × P(Sneezing = NO |CLASS = NEGATIVE) × P(Itching = YES |CLASS = NEGATIVE) × P(Swelling = YES |CLASS = NEGATIVE) = 0.66 × 0.5 × 0.16 × 0.16 × 0.16 × 0.66 × 0.50 × .50 × 0.16 × 0.16 = 0.00000057 To find the class that maximizes P(X1|CLASS)×P(CLASS), the following is computed: P(X1|CLASS=POSITIVE) × P(CLASS=POSITIVE) = 0.0351 × 0.4 = 0.0104 P(X1|CLASS=NEGATIVE) × P(CLASS=NEGATIVE) = 0.00000057 × 0.6 = 0.00000034 Therefore, the naïve Bayesian classifier classifies instance X1 as a CLASS = POSITIVE. Hence, the test instance X1 is diagnosed as positive to Acute/Allergic Urticaria.

Experimental Results

Table 4 shows the complete list of features that consists of a list of attributes that include the allergens, allergic symptoms, physical attributes, and the class label.
Table 4

List of Allergens, Allergic Symptoms, and Patient Details

Inhalants, Contactants, and Ingestants (Allergens)
1House dust21Fish 1[a]41Avaraikai (Broad beans)61Gram[a]81Running nose
2Cotton dust22Fish 2[a]42Kovaikai (Coccinia grandis)62Channa82Sneeze
3Aspergillus23Crab43Kothavarai (Cluster beans)63Dhal83Cough
4Pollen24Prawns44Lady’s finger64Maida84Wheezing
5Parthenium25Shark45Malli (Coriander)65Oats85Nasal blocks
6Cockroach26Gourds[a]46Mango66Ragi86Headache
7Cat dander27Banana[a]47Mushroom67Rice87Itching
8Dog fur28Beans48Nuckol (Brassica oleracea)68Wheat88Rashes
9Road dust29Beet root49Onion69Coconut89Age
10Old paper Dust30Brinjal50Peas70Oil[a]90Gender
11PS dust31Cabbage51Potroot71Garlic91Family history
12Milk (P)32Capsicum52Paneer (“Farmer’s cheese”)72Ginger92Class
13Milk (B)33Chillie53Potato73Pepper
14Curd34Cauliflower54Pumpkin74Tamarind
15Coffee35Carrot55Pudina (Mentha spicata)75Aginomoto
16Tea36Radish56Chow chow (Chayota edulis)76Spices[a]
17Beef37Corn57Tomato77Coco
18Chicken38Cucumber58Tondaikai (Trichosanthes dioica)78Horlicks
19Mutton39Drumstick59Plantain stem79Boost
20Egg40Greens[a]60Yams80Nuts[a]

Customized based on patient history.

List of Allergens, Allergic Symptoms, and Patient Details Customized based on patient history. The feature evaluator ranks the features of Strain based on their relevance value. The relevance threshold (τ) was set to 0.01. From among the 91 features (excluding class), 41 features were selected. The selected features with the same 518 samples constitute the preprocessed data. The complete list of features, ranked by their relevance value, is presented in Table 5.
Table 5

Relief Relevance Values

Allergen (Feature)Relevance ValueAllergen (Feature)Relevance ValueAllergen (Feature)Relevance Value
Red rashes0.63514Channa0.02317Malli0
Swelling0.62355Coffee0.02124Road dust−0.00193
Itching0.60425Pumpkin0.02124Fish 1−0.00193
Cough0.22201Chicken0.02124Cat dander−0.00193
Running nose0.17954Headache0.01931Cockroach−0.00193
Wheeze/blocks0.14865Garlic0.01931Nuckol−0.00386
Sneeze0.14093F_history0.01737Chillie−0.00386
Coconut0.09653Wheat0.01544PS dust−0.00386
Lady’s finger0.07915Pepper0.01351Cucumber−0.00386
Carrot0.07915Peas0.01351Spices−0.00386
Tamarind0.07336Prawns0.01158Pudina−0.00579
Greens0.05985Beef0.00965Mutton−0.00579
Curd0.05598Mushroom0.00772Milk (P)−0.00579
Tea0.04826Capsicum0.00772House dust−0.00772
Egg0.04826Kovaikai0.00579Parthenium−0.00965
Brinjal0.04633Chow chow0.00386Pollen−0.00965
Oats0.03861Paneer0.00386Onion−0.01351
Radish0.03861Oil0.00386Beans−0.01351
Dhal0.03668Cotton dust0.00386Maida−0.01544
Yams0.03282Cabbage0.00193Potato−0.01544
Drumstick0.03282Gram0.00193Dog fur−0.01931
Aginomoto0.03282Corn0.00193Kothavarai−0.01931
Banana0.03282Tondaikai0Potroot−0.01931
Aspergilus0.03089Shark0Vazpoo/thandu−0.0251
Ragi0.02703Nuts0Fish2−0.0251
Avaraikai0.02703Horlicks0Age−0.02736
Crab0.02703Boost0Milk(B)−0.03282
Ginger0.02703Coco0Gourds−0.03475
Tomato0.02703Rice0Beet root−0.03475
Cauliflower0.0251Paper dust0Mango−0.03861
Relief Relevance Values The classifier evaluator accesses the performance of class-based associative classifier (CBA), decision tree classifier (C4.5), support vector machine (SVM), multi-layer perceptron (MLP), naïve Bayes classifier (NB), and k-nearest neighbor classifier (kNN).[24] In order to make the evaluation unbiased, cross-validation is applied over the same features and same partitions of the preprocessed data. The samples in each partition remain the same when each fold is iteratively tested. However, different runs had different samples in the folds in order to avoid the variations and perturbations that may exist due to cross-validation. The evaluator carries out 10 independent runs of 10-fold cross-validation. The complete results of cross-validations are presented in Online Appendix 1. Figure 3 presents the classification accuracy of six classifiers.
Figure 3

Classification accuracy of six classifiers (C4.5 = decision tree classifier; CBA = class-based associative classifier; kNN = k-nearest neighbor classifier; MLP = multilayer perceptron; NB = naïve Bayes classifier; SVM = support vector machine).

Classification accuracy of six classifiers (C4.5 = decision tree classifier; CBA = class-based associative classifier; kNN = k-nearest neighbor classifier; MLP = multilayer perceptron; NB = naïve Bayes classifier; SVM = support vector machine). The naïve Bayes classifier was tested with the test data (Stest). A set of sample test instances were also presented to three junior clinicians working at the Good Samaritan Lab and Allergy Centre, Chennai. The clinicians diagnosed the test instances in the absence of the expert. The performance of the clinicians was evaluated using the same performance evaluation measures used by the performance evaluator. The classification performance of the clinicians and the CMD system over the test instances is presented in Table 6.
Table 6

Performance Evaluation on IDST Test Data

Clinician
123CMD System
Sensitivity0.50000.97820.47820.969
Specificity0.81030.82750.62060.969
Precision0.67640.81810.50000.964
Accuracy, %67.3089.4270.3396.9231

Note: IDST = intradermal skin test.

Performance Evaluation on IDST Test Data Note: IDST = intradermal skin test. The significance of the classifier evaluation results was evaluated using Student’s two-tailed paired t test.[25] The significance level of the test was set to 0.05 (5%). From the observations, it was inferred that there is a significant improvement in the classification accuracy of the NB Classifier. The run numbers of the 10-fold cross-validation, accuracies obtained, and the corresponding P values for the classifiers are shown in Table 7.
Table 7

Statistical Significance of Classifier Evaluator

Run NumberNBCBASVMC4.5MLP kNN
1 94.59276 94.208144691.116138694.781391.881693.23906
2 94.80015 94.223227692.477375394.8001591.9004593.82353
3 94.9736 93.815987792.8582294.7963893.0505394.20814
4 94.77376 93.996983392.070135594.008392.0776894.01584
5 94.78884 94.023378392.662141694.2156993.4351493.43891
6 94.79261 94.208144692.662141594.2156993.2466193.43891
7 94.79638 94.204373992.469833994.7888492.2699893.43514
8 94.78507 94.200603192.081447793.2352991.8778393.81599
9 94.58899 94.019607793.823529296.1387693.4389194.20814
10 94.98115 94.208144692.092759994.9773892.4773894.21192
P 1.4164e−062.2574e−060.46794.9900e−050.025153

Note: NB = naïve Bayes classifier; CBA = class-based associative classifier; SVM = support vector machine; C4.5 = decision tree classifier; MLP = multilayer perceptron; kNN = k-nearest neighbor classifier.

Statistical Significance of Classifier Evaluator Note: NB = naïve Bayes classifier; CBA = class-based associative classifier; SVM = support vector machine; C4.5 = decision tree classifier; MLP = multilayer perceptron; kNN = k-nearest neighbor classifier.

Discussion

The inhalants, contactants, and ingestants of an individual are influenced by food habits, biocoenosis, elements of the biosphere, and social environment of an individual. The interactions and adaptations of an individual are prone to be based on socioeconomic status, cultures, traditions, religious beliefs, people groups, and physical environmental factors such as seasons, weather conditions, heat, and humidity. The list of allergens enumerated in Table 4 is neither exhaustive nor generic. In Chennai, a place of diverse people groups, it is not feasible to either capture or generalize all the characteristics of food, behavior, and lifestyle of a population. However, even in the midst of all these limitations, an attempt has been made by a panel of experienced immunologists and medical experts at the allergy center for framing a list of allergens (inhalants, contactants, and ingestants) that is used for analyzing the history of a patient. It can be observed from the list (Table 4) that some of the allergens are customized after a thorough analysis of the history and background of the patient. Tests are continually performed at the center for more than four decades and the list of allergens are annually revised based on the people’s present food habits and environmental conditions. A patient may be associated with many inhalants, contactants, and ingestants; however, it is not possible to test all possible allergic triggers. According to international allergic testing standards, it is suggested that an upper limit on the number of pricks is up to 40 for intradermal skin tests.[26] Though this is a strict limitation for skin tests, it is followed in most allergy testing centers. The feature evaluator with a relevance threshold (τ) of 0.01 selects 41 features that include allergens and allergic symptoms. This ensures that the number of features (allergens) selected by the feature evaluator is in accordance to the allergic testing standards. All the features selected by the feature evaluator are nominal and are independent of the values of the other attributes. Based on the results of the classifier evaluator the Bayes classifier is used to validate the CMD system using the test data. Bayesian classification approach is well suited for data that are nominal and satisfy the class conditional independence assumption.[27] Laplacian correction is used for probability estimation when zero probability values are encountered.[24] The evaluation measures used by the performance evaluator to assess the performance of the CMD system are used to evaluate the performance of the clinicians too. From the classification performance of the clinicians, it can be observed that there is a high deviation from one clinician to another. Hence, in the absence of the allergist, clinicians may use the CMD system as a secondary consideration for decision making in the diagnosis of urticaria based on skin test results. The framework presented in this work is purely developed and validated by using trivial mathematical and statistical models. The data mining process is fully automated and does not require the intervention of an expert to tune or adjust the system. However, the CMD system is intended to replicate expert judgement. Therefore, its purpose in clinical utility is to prompt the clinician to reconsider and confirm his or her decision in the absence of an expert. Clinical judgement is far more comprehensive than pure mathematics.[28] There may exist additional subconscious factors that are overlooked by the model.

Conclusion

Medical decision-making systems are widely used for diagnosis. They are also used by junior clinicians and medical students to confirm their decisions. In the diagnosis of allergic disorders, it is not desirable to use a CMD system for complete analysis and diagnosis but as an aid for decision making. The framework of the CMD system used in this work is generic and can be used for a different location. However, the efficiency and efficacy of the system depend on the data distribution, skin test results, and other biological and clinical factors. The medical data that are given as input to the system are important. The testing methods at the allergy center are well established, and the list of allergens is chosen and revised based on the changing food habits of the people and environmental conditions. Hence, the decisions suggested by the system are meaningful and reliable. A focused study on the population and the environmental factors would enable system designers to develop more customized CMD systems. There are allergic disorders and triggers whose causes are ill-defined or unknown. A better biological insight of these disorders may allure the interest of knowledge engineers to develop appropriate CMD systems to enhance and support medical decision making.

Set of Sample Instances

Cotton DustWheatChickenPrawnBrinjalCarrotDhalSneezingItchingSwellingClass
1RNRNRNRRNRNRNoYesYesPositive
2R000NRNRNRNoYesYesNegative
3NRRRRNRNRNRYesNoNoNegative
4RRRNRNRNRRNoYesYesPositive
5R0NRRRNRNRNoYesYesPositive
6RNRRRRNR0YesYesYesPositive
7RNRNR0NRR0NoNoNoNegative
8RNR00RNRNRYesNoNoNegative
9RR00NRNRRYesNoNoNegative
10NRNRNRNRNR0RNoNoNoNegative
11RNRRRRNRNRNoYesYesPositive
12RNNR0NRNRRYesNoNoNegative

Note: R = reactive; NR = not reactive; 0 = not tested/not associated.

Attribute CLASS
POSITIVE NEGATIVE
(4) (6)
Cotton Dust
R 4.0 4.0
NR 0.0 2.0
[total] 4.0 6.0
Wheat
NR 2.0 3.0
0 1.0 1.0
R 1.0 2.0
[total] 4.0 6.0
Chicken
NR 2.0 2.0
0 0.0 3.0
R 2.0 1.0
[total] 4.0 6.0
Prawn
NR 2.0 1.0
0 0.0 4.0
R 2.0 1.0
[total] 4.0 6.0
Attribute CLASS
POSITIVE NEGATIVE
(4) (6)
Brinjal
R 3.0 1.0
NR 0.0 2.0
[total] 5.0 6.0
Dhal
NR 2.0 3.0
0 1.0 2.0
R 1.0 1.0
[total] 4.0 6.0
Swelling
YES 4.0 1.0
NO 0.0 5.0
[total] 4.0 6.0
Sneezing
NO 3.0 3.0
YES 1.0 3.0
[total] 4.0 6.0
Itching
YES 4.0 1.0
0 0.0 5.0
[total] 4.0 6.0
Cotton DustWheatChickenPrawnBrinjalCarrotDhalSneezingItchingSwelling
11RNRRRRNRNRNOYESYES
  14 in total

Review 1.  Percutaneous and intracutaneous diagnostic tests of IgE-mediated diseases (immediate hypersensitivity).

Authors:  P C Turkeltaub
Journal:  Clin Allergy Immunol       Date:  2000

Review 2.  EAACI/GA2LEN/EDF guideline: definition, classification and diagnosis of urticaria.

Authors:  T Zuberbier; C Bindslev-Jensen; W Canonica; C E H Grattan; M W Greaves; B M Henz; A Kapp; M M A Kozel; M Maurer; H F Merk; T Schäfer; D Simon; G A Vena; B Wedi
Journal:  Allergy       Date:  2006-03       Impact factor: 13.146

3.  A clinical decision support system for diagnosis of Allergic Rhinitis based on intradermal skin tests.

Authors:  J Jabez Christopher; H Khanna Nehemiah; A Kannan
Journal:  Comput Biol Med       Date:  2015-08-04       Impact factor: 4.589

4.  A Temporal Mining Framework for Classifying Un-Evenly Spaced Clinical Data: An Approach for Building Effective Clinical Decision-Making System.

Authors:  Nancy Yesudhas Jane; Khanna Harichandran Nehemiah; Kannan Arputharaj
Journal:  Appl Clin Inform       Date:  2016-01-13       Impact factor: 2.342

5.  Evaluation of a decision-support system for preoperative staging of prostate cancer.

Authors:  P L Chang; Y C Li; T M Wang; S T Huang; M L Hsieh; K H Tsui
Journal:  Med Decis Making       Date:  1999 Oct-Dec       Impact factor: 2.583

6.  Epidemiology, healthcare, resources, use and clinical features of different types of urticaria. Alergológica 2005.

Authors:  M Ferrer
Journal:  J Investig Allergol Clin Immunol       Date:  2009       Impact factor: 4.333

7.  Diagnostic certainty as a source of medical practice variation in coronary heart disease: results from a cross-national experiment of clinical decision making.

Authors:  Karen E Lutfey; Carol L Link; Lisa D Marceau; Richard W Grant; Ann Adams; Sara Arber; Johannes Siegrist; Markus Bönte; Olaf von dem Knesebeck; John B McKinlay
Journal:  Med Decis Making       Date:  2009-05-21       Impact factor: 2.583

8.  Consensus statement on the management of urticaria.

Authors:  Kiran V Godse; Vijay Zawar; Ds Krupashankar; Mukesh Girdhar; Sanjiv Kandhari; Sandipan Dhar; Sanjay Ghosh; Murlidhar Rajagopalan; Torsten Zuberbier
Journal:  Indian J Dermatol       Date:  2011 Sep-Oct       Impact factor: 1.494

9.  Chronic urticaria.

Authors:  Sandeep Sachdeva; Vibhanshu Gupta; Syed Suhail Amin; Mohd Tahseen
Journal:  Indian J Dermatol       Date:  2011-11       Impact factor: 1.494

10.  State of world allergy report 2008: allergy and chronic respiratory diseases.

Authors:  Ruby Pawankar; Carlos E Baena-Cagnani; Jean Bousquet; G Walter Canonica; Alvaro A Cruz; Michael A Kaliner; Bobby Q Lanier
Journal:  World Allergy Organ J       Date:  2008-06       Impact factor: 4.084

View more
  5 in total

Review 1.  How the Smartphone Is Changing Allergy Diagnostics.

Authors:  Ana Margarida Pereira; Cristina Jácome; Rute Almeida; João Almeida Fonseca
Journal:  Curr Allergy Asthma Rep       Date:  2018-10-25       Impact factor: 4.806

2.  Feature Selection and Classification of Clinical Datasets Using Bioinspired Algorithms and Super Learner.

Authors:  S Murugesan; R S Bhuvaneswaran; H Khanna Nehemiah; S Keerthana Sankari; Y Nancy Jane
Journal:  Comput Math Methods Med       Date:  2021-05-17       Impact factor: 2.238

Review 3.  COVID-19 and artificial intelligence: Experts and dermatologists perspective.

Authors:  Yaser Goldust; Farah Sameem; Samia Mearaj; Atula Gupta; Anant Patil; Mohamad Goldust
Journal:  J Cosmet Dermatol       Date:  2022-08-17       Impact factor: 2.189

4.  Correlation-Based Ensemble Feature Selection Using Bioinspired Algorithms and Classification Using Backpropagation Neural Network.

Authors:  V R Elgin Christo; H Khanna Nehemiah; B Minu; A Kannan
Journal:  Comput Math Methods Med       Date:  2019-09-23       Impact factor: 2.238

Review 5.  The Potential of Clinical Decision Support Systems for Prevention, Diagnosis, and Monitoring of Allergic Diseases.

Authors:  Stephanie Dramburg; María Marchante Fernández; Ekaterina Potapova; Paolo Maria Matricardi
Journal:  Front Immunol       Date:  2020-09-10       Impact factor: 7.561

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.