| Literature DB >> 36006685 |
Xianglong Xu1,2,3, Zhen Yu2,3,4, Zongyuan Ge4, Eric P F Chow1,2,5, Yining Bao3, Jason J Ong1,2,3, Wei Li6, Jinrong Wu7, Christopher K Fairley1,2,3, Lei Zhang1,2,3.
Abstract
BACKGROUND: HIV and sexually transmitted infections (STIs) are major global public health concerns. Over 1 million curable STIs occur every day among people aged 15 years to 49 years worldwide. Insufficient testing or screening substantially impedes the elimination of HIV and STI transmission.Entities:
Keywords: HIV; algorithm; chlamydia; development; gonorrhea; machine learning; model; prediction; predictive; risk; risk assessment; sexual health; sexual transmission; sexually transmitted; sexually transmitted infections; syphilis; validation; web-based
Mesh:
Year: 2022 PMID: 36006685 PMCID: PMC9459839 DOI: 10.2196/37850
Source DB: PubMed Journal: J Med Internet Res ISSN: 1438-8871 Impact factor: 7.076
Characteristics of clinic consultations in the training and testing data set.
| Variables | HIV (n=88,642 consultations) | Syphilis (n=92,291 consultations) | Gonorrhea (n=97,473 consultations) | Chlamydia (n=115,845 consultations) | |
|
| |||||
|
| Female | 26,651 (30.1) | 27,134 (29.4) | 31,282 (32.1) | 38,548 (33.3) |
|
| Male | 61,991 (69.9) | 65,157 (70.6) | 66,191 (67.9) | 77,297 (66.7) |
| Age at consultation (years), median (IQR) | 29.0 (24.0-35.0) | 29.0 (25.0-35.0) | 28.0 (24.0-35.0) | 28.0 (24.0-34.0) | |
|
| |||||
|
| Australia | 39,148 (44.2) | 40,990 (44.4) | 43,881 (45.0) | 51,162 (44.2) |
|
| Overseas | 46,003 (51.9) | 47,670 (51.7) | 49,835 (51.1) | 60,272 (52.0) |
|
| Missing | 3491 (3.9) | 3631 (3.9) | 3757 (3.9) | 4411 (3.8) |
|
| |||||
|
| No | 56,175 (63.4) | 57,413 (62.2) | 54,595 (56.0) | 68,584 (59.2) |
|
| Yes | 25,067 (28.3) | 27,150 (29.4) | 34,751 (35.7) | 38,930 (33.6) |
|
| Missing | 7383 (8.3) | 7728 (8.4) | 8127 (8.3) | 8331 (7.2) |
|
| |||||
|
| Not applicable (female) | 26,651 (30.1) | 27,134 (29.4) | 31,282 (32.1) | 38,548 (33.3) |
|
| No | 16,508 (18.6) | 17,089 (18.5) | 15,245 (15.6) | 26,975 (23.3) |
|
| Yes | 45,483 (51.3) | 48,068 (52.1) | 50,946 (52.3) | 50,322 (43.4) |
aSTI: sexually transmitted infection.
Figure 1Development of machine learning algorithms. The architecture of the gradient boosting machine was adapted from Feng et al [35]. LASSO: least absolute shrinkage and selection operator.
Figure 2Importance of the top 10 predictors in the prediction of HIV or sexually transmission infections (STIs) using a gradient boosting machine, for detecting (A) HIV, (B) syphilis, (C) gonorrhea, and (D) chlamydia.
Figure 3Receiver operating characteristic curve performance of the HIV and sexually transmitted infection (STI) risk prediction tool on (A) testing data analysis from 2015-2018, (B) external data validation analysis from 2019, and (C) external data validation analysis from 2020-2021. AUC: area under the curve.
Figure 4Graphical user interface elements of the HIV and sexually transmitted infection (STI) risk prediction tool, called MySTIRisk. A prototype version of the tool is available at [48]. Machine learning algorithms are used to predict a person’s risk of chlamydia, gonorrhea, syphilis, and HIV.