| Literature DB >> 34035886 |
Naresh Kumar1, Nripendra Narayan Das2, Deepali Gupta3, Kamali Gupta3, Jatin Bindra1.
Abstract
Recently, many researchers have designed various automated diagnosis models using various supervised learning models. An early diagnosis of disease may control the death rate due to these diseases. In this paper, an efficient automated disease diagnosis model is designed using the machine learning models. In this paper, we have selected three critical diseases such as coronavirus, heart disease, and diabetes. In the proposed model, the data are entered into an android app, the analysis is then performed in a real-time database using a pretrained machine learning model which was trained on the same dataset and deployed in firebase, and finally, the disease detection result is shown in the android app. Logistic regression is used to carry out computation for prediction. Early detection can help in identifying the risk of coronavirus, heart disease, and diabetes. Comparative analysis indicates that the proposed model can help doctors to give timely medications for treatment.Entities:
Year: 2021 PMID: 34035886 PMCID: PMC8101482 DOI: 10.1155/2021/9983652
Source DB: PubMed Journal: J Healthc Eng ISSN: 2040-2295 Impact factor: 2.682
Comparative analysis of the existing techniques.
| Ref. | Year | Model | Features | Application |
|---|---|---|---|---|
| [ | 2019 | Machine learning models | Used general linear model (GLM) regression, support vector machines (SVMs) with a radial basis function kernel, and single-layer artificial neural networks | Medicine |
| [ | 2019 | Artificial intelligence in healthcare | AI can perform healthcare tasks as well or better than humans, implementation factors will prevent large-scale automation of healthcare professional jobs for a considerable period | Healthcare |
| [ | 2016 | — | In addition to individual CVD risk factors, Framingham and systematic coronary risk evaluation (SCORE) algorithms were used to assess the absolute risk of a CVD | Open heart |
| [ | 2020 | DenseNet201 | A DenseNet201-based deep transfer learning (DTL) is proposed to classify the patients as COVID infected or not, i.e., COVID-19 (+) or COVID (−) | COVID-19 |
| [ | 2019 | Deep learning | A Comprehensive analysis was presented on the use of machine and deep learning for IDS systems in wireless sensor networks (WSNs) | Wireless networks |
| [ | 2016 | Data mining technique | Decision tree shows better results as compared with J48, logistic model tree algorithm, and random forest | Heart disease |
| [ | 2016 | Heart disease | The features reduction has an impact on classifiers performance in terms of accuracy and execution time of classifiers | Medical |
| [ | 2019 | Machine learning | Artificial neural network optimized by particle swarm optimization (PSO) combined with ant colony optimization (ACO) approaches | Heart disease |
| [ | 2019 | Ensemble classification techniques | The ensemble technique can be applied for improving prediction accuracy in heart disease | Medicine |
| [ | 2020 | Deep transfer learning | Used to detect and diagnose COVID-19. Chest X-rays is preferred over CT scan | COVID-19 |
| [ | 2020 | Deep convolutional neural networks | Computed tomography (CT) scans to diagnose pneumonia, lung inflammation, abscesses, and enlarged lymph nodes. Since COVID-19 attacks the epithelial cells that line our respiratory tract, therefore, X-ray images are utilized | COVID-19 |
Figure 1Diagrammatic flow of the proposed methdology.
Features of the dataset.
| Age | Sex | Symptoms | Country | Travel _history location | Outcome | |
|---|---|---|---|---|---|---|
| 0 | 42 | Female | Fever | China | Wuhan | 1 |
| 1 | 59 | Female | Fever | China | Wuhan | 1 |
| 2 | 38 | Female | Cough | China | Wuhan | 1 |
| 3 | 45 | Male | Fever | China | Wuhan | 1 |
| 4 | 33 | Female | Fever | China | Wuhan | 1 |
Features in COVID-19 dataset.
| ID | date_admission_hospital | Sequence available |
|---|---|---|
| Age | Date confirmation | Outcome |
| Sex | Symptoms | date_death_or_discharge |
| City | lives_in_Wuhan | notes_for_discussion |
| Province | travel_history_dates | Location |
| Country | travel_history_location | admin3 |
| Wuhan(0)_not Wuhan(1) | reported_market_exposure | admin2 |
| Latitude | Additional information | admin1 |
| Longitude | chronic_disease_binary | Country new |
| geo_resolution | Chronic disease | admin_id |
| date_onset_symptoms | Source | data_moderator_initials |
Figure 2Heatmap of COVID-19 dataset.
Heart disease dataset.
| Id | Age | Gender | Height | Weight | Ap_lo | Cholesterol | Gluc | Smoke | Alco | Active | Cardio | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 18393 | 2 | 168 | 62.0 | 110 | 80 | 1 | 1 | 0 | 1 | 0 |
| 1 | 1 | 20228 | 1 | 156 | 85.0 | 140 | 90 | 3 | 1 | 0 | 1 | 1 |
| 2 | 2 | 18857 | 1 | 165 | 64.0 | 130 | 70 | 3 | 1 | 0 | 0 | 1 |
| 3 | 3 | 17623 | 2 | 169 | 82.0 | 150 | 100 | 1 | 1 | 0 | 1 | 1 |
| 4 | 4 | 17474 | 1 | 158 | 58.0 | 100 | 60 | 1 | 1 | 0 | 0 | 0 |
Figure 3Heat map of heart disease dataset.
Features in heart disease dataset.
| Feature | Description |
|---|---|
| Age | Days-integer |
| Height | Height in cm-integer |
| Weight | Weight in kg-float |
| Gender | Categorical code (1-women, 2-men) |
| Systolic blood pressure | Integer |
| Diastolic blood pressure | Integer |
| Cholesterol | 1: normal, 2: above normal, 3: well above normal |
| Glucose | 1: normal, 2: above normal, 3: well above normal |
| Smoking | Binary |
| Alcohol intake | Binary |
| Physical activity | Binary |
| Presence or absence of cardiovascular disease | Binary |
Features in diabetes dataset.
| Features | Description |
|---|---|
| Pregnancies | Number of times pregnant |
| Glucose | Plasma glucose concentration 2 hours in an oral glucose tolerance test |
| Blood pressure | Diastolic blood pressure (mm Hg) |
| Skin thickness | Triceps skin fold thickness (mm) |
| Insulin | 2-hour serum insulin (mu U/ml) |
| BMI | Body mass index (weight in kg/(height in m)^2) |
| Diabetes pedigree function | Diabetes pedigree function |
| Age | Age (years) |
| Outcome | Class variable (0 or 1) |
Diabetes dataset.
| Pregnancies | Glucose | Blood pressure | Skin thickness | Insulin | BMI | Diabetes pedigree function | Age | Outcome | |
|---|---|---|---|---|---|---|---|---|---|
| 0 | 6 | 148 | 72 | 35 | 0 | 33.6 | 0.63 | 50 | 1 |
| 1 | 1 | 85 | 66 | 29 | 0 | 26.6 | 0.35 | 31 | 0 |
| 2 | 8 | 183 | 64 | 0 | 0 | 23.3 | 0.67 | 32 | 1 |
| 3 | 1 | 89 | 66 | 23 | 94 | 28.1 | 0.17 | 21 | 0 |
| 4 | 0 | 137 | 40 | 35 | 168 | 43.1 | 2.29 | 33 | 1 |
Figure 4Heatmap of diabetes dataset.
Figure 5Prediction steps.
Figure 6Accuracy analysis on diabetes dataset.
Figure 7Accuracy analysis on COVID-19 dataset.
Figure 8Accuracy analysis on heart disease dataset.
Figure 9F-measure analysis on diabetes dataset.
Figure 10F-measure analysis on COVID-19 dataset.
Figure 11F-measure analysis on heart disease dataset.