| Literature DB >> 32871492 |
Abolfazl Mollalo1, Behrooz Vahedi2, Shreejana Bhattarai3, Laura C Hopkins4, Swagata Banik5, Behzad Vahedi6.
Abstract
OBJECTIVE: Although lower respiratory infections (LRI) are among the leading causes of mortality in the US, their association with underlying factors and geographic variation have not been adequately examined.Entities:
Keywords: Accuracy assessment; Decision trees; GIS; Hotspots; Lower respiratory infections; US
Mesh:
Year: 2020 PMID: 32871492 PMCID: PMC7442929 DOI: 10.1016/j.ijmedinf.2020.104248
Source DB: PubMed Journal: Int J Med Inform ISSN: 1386-5056 Impact factor: 4.046
Fig. 1Principle of linearly separable SVM using maximum margin.
Fig. 2A non-linear boundary in the input space (left) and a maximum margin hyperplane in feature space (right).
Results of the global Moran’s I and General G statistic of age-adjusted LRI mortality rates, continental US, 1980-2014.
| Year | Index | Z-score | Type of distribution | P-value | ||
|---|---|---|---|---|---|---|
| Moran’s | General | Moran’s | General | |||
| 1980 | 0.38 | 0.0019 | 36.31 | 8.27 | Clustered | ∼ 0 |
| 1985 | 0.36 | 0.0019 | 34.59 | 8.40 | Clustered | ∼ 0 |
| 1990 | 0.37 | 0.0019 | 35.04 | 9.57 | Clustered | ∼ 0 |
| 1995 | 0.41 | 0.0018 | 39.50 | 12.10 | Clustered | ∼ 0 |
| 2000 | 0.49 | 0.0018 | 47.00 | 15.50 | Clustered | ∼ 0 |
| 2005 | 0.53 | 0.0018 | 51.06 | 18.81 | Clustered | ∼ 0 |
| 2010 | 0.58 | 0.0018 | 55.79 | 22.24 | Clustered | ∼ 0 |
| 2014 | 0.61 | 0.0018 | 58.35 | 24.68 | Clustered | ∼ 0 |
Fig. 3Location of hotspots of LRI mortality rates in the continental US using Getis-Ord Gi* hotspot detection technique, 1980-2014.
Fig. 4Location of counties that were persistently identified as hotspots of LRI mortality rates by Getis-Ord Gi* hotspot detection technique, 1980-2014.
Evaluation metrics associated with each of the employed machine learning classifiers.
| Accuracy | Precision | Recall | F1-Score | ROC AUC | PR AUC | FPR | |
|---|---|---|---|---|---|---|---|
| Classifier | |||||||
| LR | 0.84 | 0.75 | 0.87 | 0.78 | 0.86 | 0.72 | 0.17 |
| RF | 0.92 | 0.87 | 0.82 | 0.84 | 0.82 | 0.83 | 0.03 |
| GBDT | 0.92 | 0.87 | 0.83 | 0.85 | 0.83 | 0.84 | 0.04 |
| KNN | 0.90 | 0.84 | 0.8 | 0.82 | 0.8 | 0.82 | 0.05 |
| SVM | 0.91 | 0.83 | 0.86 | 0.84 | 0.86 | 0.82 | 0.07 |
Fig. 5Results of the precision-recall curve for employed machine learning classifiers. The orange dash line annotates the average precision.
Fig. 6Relative variable importance analysis using the gradient boosting and random forest decision trees. A detailed description of x-axis codes is provided in Supplementary Material.