Hsueh-Chun Lin1, Yu-Chen Kuo2, Meng-Yu Liu3. 1. Department of Health Services Administration, China Medical University, 91 Hsueh-Shih Road, Taichung 40402, Taiwan, ROC. Electronic address: snowlin@mail.cmu.edu.tw. 2. Institute of Information System and Applications, National Tsing Hua University, Hsinchu, Taiwan. 3. Department of Health Services Administration, China Medical University, 91 Hsueh-Shih Road, Taichung 40402, Taiwan, ROC.
Abstract
BACKGROUND AND OBJECTIVE: Many studies regarding health analysis request structured datasets but the legacy resources provide scattered data. This study aims to establish a health informatics transformation model (HITM) based upon intelligent cloud computing with the self-developed analytics modules by open source technique. The model was exemplified by the open data of type 2 diabetes mellitus (DM2) with related cardiovascular diseases. METHODS: The Apache-SPARK framework was employed to generate the infrastructure of the HITM, which enables the machine learning (ML) algorithms including random forest, multi-layer perceptron classifier, support vector machine, and naïve Bayes classifier as well as the regression analysis for intelligent cloud computing. The modeling applied the MIMIC-III open database as an example to design the health informatics data warehouse, which embeds the PL/SQL-based modules to extract the analytical data for the training processes. A coupling analysis flow can drive the ML modules to train the sample data and validate the results. RESULTS: The four modes of cloud computation were compared to evaluate the feasibility of the cloud platform in accordance with its system performance for more than 11,500 datasets. Then, the modeling adaptability was validated by simulating the featured datasets of obesity and cardiovascular-related diseases for patients with DM2 and its complications. The results showed that the run-time efficiency of the platform performed in around one minute and the prediction accuracy of the featured datasets reached 90%. CONCLUSIONS: This study helped contribute the modeling for efficient transformation of health informatics. The HITM can be customized for the actual clinical database, which provides big data for training, with the proper ML modules for a predictable process in the cloud platform. The feedback of intelligent computing can be referred to risk assessment in health promotion.
BACKGROUND AND OBJECTIVE: Many studies regarding health analysis request structured datasets but the legacy resources provide scattered data. This study aims to establish a health informatics transformation model (HITM) based upon intelligent cloud computing with the self-developed analytics modules by open source technique. The model was exemplified by the open data of type 2 diabetes mellitus (DM2) with related cardiovascular diseases. METHODS: The Apache-SPARK framework was employed to generate the infrastructure of the HITM, which enables the machine learning (ML) algorithms including random forest, multi-layer perceptron classifier, support vector machine, and naïve Bayes classifier as well as the regression analysis for intelligent cloud computing. The modeling applied the MIMIC-III open database as an example to design the health informatics data warehouse, which embeds the PL/SQL-based modules to extract the analytical data for the training processes. A coupling analysis flow can drive the ML modules to train the sample data and validate the results. RESULTS: The four modes of cloud computation were compared to evaluate the feasibility of the cloud platform in accordance with its system performance for more than 11,500 datasets. Then, the modeling adaptability was validated by simulating the featured datasets of obesity and cardiovascular-related diseases for patients with DM2 and its complications. The results showed that the run-time efficiency of the platform performed in around one minute and the prediction accuracy of the featured datasets reached 90%. CONCLUSIONS: This study helped contribute the modeling for efficient transformation of health informatics. The HITM can be customized for the actual clinical database, which provides big data for training, with the proper ML modules for a predictable process in the cloud platform. The feedback of intelligent computing can be referred to risk assessment in health promotion.
Authors: Nicholas Jones; Simon de Lusignan; Jienchi Dorward; Rachel Byford; Harshana Liyanage; John Briggs; Filipa Ferreira; Oluwafunmi Akinyemi; Gayatri Amirthalingam; Chris Bates; Jamie Lopez Bernal; Gavin Dabrera; Alex Eavis; Alex J Elliot; Michael Feher; Else Krajenbrink; Uy Hoang; Gary Howsam; Jonathan Leach; Cecilia Okusi; Brian Nicholson; Philip Nieri; Julian Sherlock; Gillian Smith; Mark Thomas; Nicholas Thomas; Manasa Tripathy; William Victor; John Williams; Ian Wood; Maria Zambon; John Parry; Shaun O'Hanlon; Mark Joy; Chris Butler; Martin Marshall; F D Richard Hobbs Journal: JMIR Public Health Surveill Date: 2020-07-02