| Literature DB >> 35400761 |
Asmaa H Rabie1, Nehal A Mansour2, Ahmed I Saleh1, Ali E Takieldeen3.
Abstract
Covid-19, what a strange, unpredictable mutated virus. It has baffled many scientists, as no firm rule has yet been reached to predict the effect that the virus can inflict on people if they are infected with it. Recently, many researches have been introduced for diagnosing Covid-19; however, none of them pay attention to predict the effect of the virus on the person's body if the infection occurs but before the infection really takes place. Predicting the extent to which people will be affected if they are infected with the virus allows for some drastic precautions to be taken for those who will suffer from serious complications, while allowing some freedom for those who expect not to be affected badly. This paper introduces Covid-19 Prudential Expectation Strategy (CPES) as a new strategy for predicting the behavior of the person's body if he has been infected with Covid-19. The CPES composes of three phases called Outlier Rejection Phase (ORP), Feature Selection Phase (FSP), and Classification Phase (CP). For enhancing the classification accuracy in CP, CPES employs two proposed techniques for outlier rejection in ORP and feature selection in FSP, which are called Hybrid Outlier Rejection (HOR) method and Improved Binary Genetic Algorithm (IBGA) method respectively. In ORP, HOR rejects outliers in the training data using a hybrid method that combines standard division and Binary Gray Wolf Optimization (BGWO) method. On the other hand, in FSP, IBGA as a hybrid method selects the most useful features for the prediction process. IBGA includes Fisher Score (FScore) as a filter method to quickly select the features and BGA as a wrapper method to accurately select the features based on the average accuracy value from several classification models as a fitness function to guarantee the efficiency of the selected subset of features with any classifier. In CP, CPES has the ability to classify people based on their bodies' reaction to Covid-19 infection, which is built upon a proposed Statistical Naïve Bayes (SNB) classifier after performing the previous two phases. CPES has been compared against recent related strategies in terms of accuracy, error, recall, precision, and run-time using Covid-19 dataset [1]. This dataset contains routine blood tests collected from people before and after their infection with covid-19 through a Web-based form created by us. CPES outperforms the competing methods in experimental results because it provides the best results with values of 0.87, 0.13, 0.84, and 0.79 for accuracy, error, precision, and recall.Entities:
Keywords: Covid-19; Naïve Bayes; Prediction; Prudential Expectation
Year: 2022 PMID: 35400761 PMCID: PMC8983097 DOI: 10.1016/j.patcog.2022.108693
Source DB: PubMed Journal: Pattern Recognit ISSN: 0031-3203 Impact factor: 8.518
Fig. 1Symptomatic, asymptomatic, and pre-symptomatic transmission.
Fig. 2Symptomatic versus asymptomatic cases on board the Diamond Princess Cruise ship, Yokohama, Japan, 2020.
Fig. 3People classification based on their vulnerability level to Covid-19.
People classification based on their vulnerability level to Covid-19.
| Type | Description | Risk level | Treatment | Case |
| Type A | No Symptoms (NS) | Low | To eliminate virus spread, persons of Type A need continuous follow-up and periodic examination, where he /she may be infected with Corona, despite the absence of symptoms. By making sure of constant observation, a person of type A can be allowed to be in crowded places. It is preferred to receive the vaccine if it is available. | Asymptomatic |
| Type B | Simple Symptoms, No Deteriorate (SSND) | No need for continuous follow-up, but Home isolation is necessary as soon as symptoms appear. A person of type B can be allowed to be in crowded places. Simple patient treatments can be followed whenever the symptoms appear. It is preferred to receive the vaccine if it is available. | Symptomatic | |
| Type C | Simple Symptoms, But Deteriorate (SSBD) | Medium | The same treatments as SSND, but more serious patient treatments can be followed whenever the symptoms appear. | |
| Type D | Medium Symptoms (MS) | Precautionary measures must be applied, such as staying at home. A person of type D is not allowed to be in crowded places. Serious patient treatments can be followed whenever the symptoms appear. For persons of type D, Vaccination is recommended. | ||
| Type E | High Symptoms (HS) | High | Persons of Type E (or F) must receive the vaccine as soon as possible. Strict precautionary measures must be applied, he/she must staying at home. | |
Fig. 4The proposed Covid-19 Prudential Expectation Strategy (CPES).
Fig. 5The sequential steps of HOR method.
Fig. 6Different types of features.
Fig. 7The sequential steps of IBGA.
Determine the best chromosome based on both every classifier and average accuracy.
| Classifier # | Accuracy of every chromosome | The best chromosome | |
| Ch1 | Ch2 | ||
| C1 = NB | 0.75 | 0.7 | Ch1 |
| C2 = KNN | 0.9 | 0.7 | Ch1 |
| C3 = SVM | 0.8 | 0.9 | Ch2 |
| Average accuracy | 0.816 | 0.767 | Ch1 |
Algorithm 1Feature Convergence within Classes Algorithm.
Algorithm 2Feature Divergence among Classes Algorithm.
Algorithm 3(a) Estimating δδ and ββ graphically (b) Estimating the optimal values of δδ and ββ algorithm.
The corresponding used values of the applied tunable parameters.
| Parameter | Description | Applied value |
| Spro | Probability of selection | Random (0 ≤ Spro ≤ 1) |
| Cpro | Probability of Crossover | Random (0 ≤ Cpro ≤ 1) |
| Mpro | Probability of Mutation | Random (0 ≤ Mpro ≤ 1) |
| r1 and r2 | Two independent random numbers | Random (0 ≤ r1,r2 ≤ 1) |
| a | Linearly decrease | [2,0] |
| Max_iter_BGA | The maximum number of iterations for GA | 100 |
| Max_iter_BGWO | The maximum number of iterations for GWO | 100 |
Descriptions about the features of Covid-19.
| (a) | ||
| Feature | Description | Selected Feature |
| Age | Age of the patient. | Yes |
| Gender | Male / Female. | No |
| Glucose | Glucose represents the main type of sugar found in the blood. | Yes |
| Blood type | Determine the type of the blood. | Yes |
| Blood Pressure | It is the pressure of the blood on the walls of the arteries. | Yes |
| Body Mass Index (BMI) | BMI is a measure that indicates to total body fat. It is used to measure whether a person is at a healthy weight. | Yes |
| Diabetes Pedigree Function | A function that scores the likelihood of diabetes based on family histor | No |
| Total_Bilirubin | It is a measure of liver function in which it measures the amount of a substance called bilirubin in the blood. | Yes |
| Direct_Bilirubin | It is a measure of the amount of conjugated bilirubin in which it looks for bilirubin in the urine or blood. | Yes |
| Alkaline_ Phosphotase | It is a measure of the amount of alkaline phosphotase in your blood in which Alkaline_ Phosphotase is an enzyme found throughout the body. Actually, it is mostly found in the liver, digestive system, kidneys, and bones. | Yes |
| Alamine_ Aminotransferase (ALT) | ALT is an enzyme that is normally found in the cells of the liver and kidney. When ALT levels in blood are high that means a liver is damaged. | Yes |
| Aspartate_ Aminotransferase (AST) | AST is an enzyme that is normally found in liver and heart. When AST levels in blood are high that indicates liver diseases and heart problems or pancreatitis. | Yes |
| Total_ Protiens | It is a measure of the amount of protein in your blood. When total protein level is high that indicates dehydration or a certain type of cancer. | No |
| Albumin | It is protein that is made by liver in which it is a test that measures of the amount of albumin in the blood. | Yes |
| Globulin_Ratio | It is a ratio of albumin to globulin in blood plasma. | Yes |
| Red blood count | It is a blood test that is used to measure how many red blood cells that contain haemoglobin, which carries oxygen throughout the body. | Yes |
| Pus Cell | It is a white blood cell (as a neutrophil) that is found in pus. | No |
| Bacteria | Bacteria are used to help diagnose certain types of infections in which their organisms not visible with the naked eye. | Yes |
| Blood urea test | It measures the amount of nitrogen in the blood. When your blood urea level rises, this means that your kidneys cannot remove urea from the blood normally. | Yes |
| (b) | ||
| Feature | Description | Selected Feature |
| Serum creatinine | It indicates kidney health in which it is an easily measured by product of muscle metabolism. | Yes |
| Sodium | A sodium is a part of an electrolyte panel that is used as a blood test to measure the amount of sodium in the blood. | Yes |
| Potassium | A | Yes |
| Haemoglobin | It is a blood test that is used to measure the amount of haemoglobin in the blood in which it carries oxygen to organs and tissues in the body and also transfer carbon dioxide from organs and tissues to lungs. | Yes |
| Packed cell volume | It's a test used to determine whether a patient has polycythaemia, dehydration, or anaemia. It's usually part of a whole blood count test. | No |
| White blood | The immune system is made up of white blood cells. They aid in the battle against infections and other disorders. | Yes |
| Hypertension | It is a disorder in which the blood arteries have a consistently elevated pressure. Hypertension is a significant medical condition that can put your heart, brain, kidneys, and other organs at risk. | Yes |
| Pedal edema | The medical term for swelling is edoema. Injuries and inflammation cause body parts to swell. edoema might affect a small portion of the body or the full body. | No |
| Resting electrocardiographic results | It's a medical test that measures the electrical activity generated by the heart while it contracts to diagnose heart abnormalities. | Yes |
| Anemia | Anemia is a condition in which the number of red blood cells or haemoglobin is lower than normal. | Yes |
| Diabetes mellitus | Diabetes mellitus is a disorder in which the body's ability to form of sugar is impaired. | Yes |
| Coronary artery disease | The coronary arteries supply your heart with blood, oxygen, and nourishment. Coronary artery disease develops when your heart's primary blood arteries become damaged or diseased. | Yes |
| Appetite | The appetite test is used to determine the appetite of a person. | No |
| Maximum | The number of contractions (beats) of the heart per minute is used to determine the heart rate (bpm). | Yes |
| Exercise induced angina | Angina is chest pain that occurs as a result of exercise, stress, or other factors that cause the heart to pump harder. It's a symptom of coronary artery disease that's very frequent. | Yes |
| Atherosclerosis | Arteriosclerosis is a condition in which the arteries that carry oxygen and nutrients from your heart to the rest of your body thicken and stiffen, reducing blood flow to your organs and tissues. | Yes |
| D-Dimer | A D-dimer test examines the presence of D-dimer in the blood. When a blood clot dissolves in your body, a protein fragment called a D-dimer is formed. | Yes |
| C-Reactive Protein (CRP) | CRP is a protein made by the liver, and the CRP test is used to discover or monitor inflammatory diseases. | Yes |
| Lactate Dehydrogenase (LDH) | The LDH test is performed to detect any tissue damage. | Yes |
| Troponin | Troponins are a family of proteins that govern muscular contraction in skeletal and cardiac muscle fibres. Troponin tests detect heart damage by measuring the quantity of cardiac-specific troponin in the blood. | Yes |
| (c) | ||
| Feature | Description | Selected Feature |
| Platelets Count (PC) | The platelet count (PC) is a blood test which measures the average amount of platelets in a person's blood. Platelets aid in the healing of wounds and the prevention of excessive bleeding in the bloodstream. | Yes |
| Neutrophils Count (NC) | Neutrophils are a type of WBC that form (50-75%) of the total. NC gives critical information regarding the patient's health status. | Yes |
| Lymphocytes Count (LC) | The lymphocyte count, which is a component of WBC, is determined by LC test. | Yes |
| Monocytes count | The quantity of monocytes circulating in the blood is measured by the monocytes count test. | No |
| Eosinophil | Eosinophil is a type of white blood cell that helps the immune system combat disease by preventing infections and increasing inflammation. | No |
| Basophils | Basophils are bone marrow-derived white blood cells that aid in the proper functioning of the immune system. | No |
| Gamma-Glutamyl Transpeptidas | GGT is an ubiquitous enzyme found throughout the body. GGT levels in the blood can indicate bile duct damage or liver disease; a GGT test can determine the quantity of GGT in the blood. | No |
| Chest pain type | The discomfort in the chest or presence of abnormal pain , between the diaphragm and the base of the neck, is defined as chest pain. | Yes |
| Fasting blood sugar | After an overnight fast, this test measures how much sugar is in a blood sample. | No |
| Ferritin | Ferritin is the most important protein for iron storage, and it can become raised in the context of circumstances that cause severe inflammation. | No |
| Creatine phosphokinase (CPK) | CPK is a protein located in your heart, brain, and skeletal muscles that helps to induce chemical changes in your body. | Yes |
Distribution of people in dataset according to their type.
| Criteria | Value / Description | ||
|---|---|---|---|
| Total number of cases | Covid-19 Patients | Un-Covid-19 People | Un-confirmed cases |
| 1389 | 430 | 396 | |
| Type of Covid-19 Patients | Type A | Type B | Type C |
| 173 | 239 | 129 | |
| Type D | Type E | Type F | |
| 228 | 471 | 149 | |
Fig. 8A snapshot from the Covid-19 dataset.
Fig. 12Recall of several feature selection methods.
Fig. 13Run time of several feature selection methods.
Fig. 14Accuracy of several Covid-19 prudential expectation strategies.
Fig. 17Recall of several Covid-19 prudential expectation strategies.
Fig. 18Run time of several Covid-19 prudential expectation strategies.