Literature DB >> 31984341

MAV-clic: management, analysis, and visualization of clinical data.

Zeeshan Ahmed1, Minjung Kim2, Bruce T Liang3.   

Abstract

OBJECTIVES: Develop a multifunctional analytics platform for efficient management and analysis of healthcare data.
MATERIALS AND METHODS: Management, Analysis, and Visualization of Clinical Data (MAV-clic) is a Health Insurance Portability and Accountability Act of 1996 (HIPAA)-compliant framework based on the Butterfly Model. MAV-clic extracts, cleanses, and encrypts data then restructures and aggregates data in a deidentified format. A graphical user interface allows query, analysis, and visualization of clinical data.
RESULTS: MAV-clic manages healthcare data for over 800 000 subjects at UConn Health. Three analytic capabilities of MAV-clic include: creating cohorts based on specific criteria; performing measurement analysis of subjects with a specific diagnosis and medication; and calculating measure outcomes of subjects over time. DISCUSSION: MAV-clic supports clinicians and healthcare analysts by efficiently stratifying subjects to understand specific scenarios and optimize decision making.
CONCLUSION: MAV-clic is founded on the scientific premise that to improve the quality and transition of healthcare, integrative platforms are necessary to analyze heterogeneous clinical, epidemiological, metabolomics, proteomics, and genomics data for precision medicine.
© The Author(s) 2018. Published by Oxford University Press on behalf of the American Medical Informatics Association.

Entities:  

Keywords:  HIPAA; analysis; data mining; database; healthcare; management

Year:  2018        PMID: 31984341      PMCID: PMC6951942          DOI: 10.1093/jamiaopen/ooy052

Source DB:  PubMed          Journal:  JAMIA Open        ISSN: 2574-2531


INTRODUCTION

Healthcare data include information about patients’ lifestyles, medical histories, visits to the practices, lab tests, imaging tests, diagnoses, medications, surgical procedures, metabolomics and genomics profiles, consulted providers and claims. Healthcare analytics has the potential to revolutionize the field of medicine by improving the quality and transition of care, improving outcomes by reducing costs, detecting diseases at earlier stages, developing a better understanding of biological mechanisms, and modeling complex biological interactions through integration and analysis of data with holistic approach. The ability to stratify patients, understand scenarios, and optimize decision-making would consistently improve based on the myriad data obtained during the care-delivery process. Healthcare data analytics begins with the process of database development: this includes data collection, preparation (extraction, cleansing, discretization, validation, integration, and transformation), modeling, validation, and resulting in the creation of patient knowledge bases. To effectively implement analytic processes, various big data management challenges must be overcome. These include the inadequacy of clinical data; existence of multiple data standards, structures and types; rapid growth in heterogeneous data; understanding of analysis algorithms for clinical data interpretation, exploration, and drawing inference; unavailability of effective open source tools that combine various approaches to model biological interactions; integration of clinical and analytic systems; interdisciplinary field barriers; high cost; and implementation of secure frameworks for the healthcare data collection, simplification, synchronizations, raw to knowledge conversion management, analysis, reporting etc. Establishment of healthcare data analytics system can help achieve goals for better care and health of populations at lower costs with better work–life balance for clinicians and staff. However, it is not forthright, as significant efforts are required from experts in various disciplines (eg data science, biomedical informatics, bioinformatics, biostatistics, genomics, metabolomics, clinical science, etc.) and from within multiple organizational units (eg hospitals, research and bench/wet laboratories, insurance companies, information technology providers, data security, etc.). Another challenge is to establish an efficient and secure workflow that can connect all organizational units to streamline transparent data flow and sharing. In past decades, various systems have been developed both in academic (eg dRiskKB, PhenoPredict, MeSHDD, 3D-MICE, CRISP, TCMRs, PhOSCo, OpenEMR, GNU Health, ClearHealth, OpenClinica, openCDMS, TrialDB, OpenMRS, FreeMED), and commercial (eg NextGen, Epic, Cerner, GE Healthcare, eClinicalWorks, athenahealth, McKesson, Allscripts, Care360, Practice Fusion, Meditech, Greenway Health, etc.) sectors. Academic systems put significant value on analytics; while commercial systems focus on supporting clinical operations. However, both commercial and academic sectors are unable to identify problems by their effects, and significantly help in clinical decision-making, healthcare process implementation, and cost reduction. Promoting significant medical transformation in public health, here, we present a new clinical operation and research based scientific platform, named: Management, Analysis, and Visualization of Clinical Data (MAV-clic). It is based on the vision of allowing research for innovation and sustainability to solve public health problems and challenges, while focusing on high quality research.

OBJECTIVES

Innovative healthcare data analytic platforms are necessary to improve the quality and transition of healthcare by analyzing heterogeneous healthcare data of huge volume, velocity, variety, and veracity, and to obtain actionable care gap-based information about patients, developing communication and co-ordination across healthcare units, providers, nurses, quality inspectors, researchers, analysts, and administration. The overall concept of this research was to help support and implement a new healthcare data analytic and research process that can connect people from different backgrounds and specialists with electronic health records (EHR) to facilitate analytical queries and informative outputs. Another goal was to design and implement a proficient Extract-Transform-Load (ETL) strategy to derive meaningful information across different EHR systems. With vast differences across EHRs, the goal is to unite them on a single platform. The ability to deliver these analytics services are increasingly compromised by tight fiscal conditions, therefore, we aimed to build the capacity of our institution to innovate and invent solutions to complex and previously intractable healthcare problems at an affordable price.

METHODS

MAV-clic is Health Insurance Portability and Accountability Act of 1996 (HIPAA) compliant platform, which implements healthcare data analytic processes. Its product line architecture (Figure 1) is based on the Butterfly model and developed using JAVA programming language, where all major modules are capable of performing individual key roles and can integrate with each other. The data oriented functions of MAV-clic are represented by different components: Store, Structure, Secure, Handle, Process, Track, Visualize, and Search. It implements healthcare and users data security, which includes: Application and Data Criticality; Risk Management and Analysis; Information System Activity Review; Contingency Plan; Device and Media Controls; and Access Controls. The user-friendly graphical interface of MAV-clic offers multi-role-based operation, which is divided in to six different modules: Main, Users, Analysis, Measures, Databases, and ETL (details are provided in attached Supplementary Material S1).
Figure 1.

MAV-clic HIPAA-compliant product line architecture for clinical data handling, extraction, cleansing, transfer, load, store, structure, standardization, management, processing, analysis, quality assessment, visualization, security, tracking, searching, and reporting.

MAV-clic HIPAA-compliant product line architecture for clinical data handling, extraction, cleansing, transfer, load, store, structure, standardization, management, processing, analysis, quality assessment, visualization, security, tracking, searching, and reporting. One of the most difficult and complex tasks of implementing healthcare data analytics, is the extraction of data from multiple databases, as each database has large volume and a complex structure that includes different numbers of relations attributed with different numbers of fields and data types. MAV-clic interfaces multiple databases for a streamline data flow with secured and controlled accessibility, transparency, and full audit trail. It provides a dynamic data classification module to organize all integrated databases, which is capable of automatically understanding the structure of a source databases, removing data abnormalities, performing data simplification, and efficiently establishing direct sequential and parallel High Performance Computing (HPC) -based data transfer (Figure 2). The data are structured (normalized relational schemas, standardized naming conventions) and secured in terms of the encryption, archiving, backup, event logging, limited access, and password management in identified and deidentified formats. In order to turn these data into actionable intelligence, MAV-clic provides a holistic view of the data by applying our proposed approach to consolidate healthcare data into a secure data warehouse to perform big data analytics.
Figure 2.

MAV-clic big data workflow, handling input from multiple data sources, dynamic transfer of data using high-performance computing, partitioning of extracted heterogeneous data into different databases, and sharing of restructured and aggregated data.

MAV-clic big data workflow, handling input from multiple data sources, dynamic transfer of data using high-performance computing, partitioning of extracted heterogeneous data into different databases, and sharing of restructured and aggregated data. MAV-clic consists of three main modules: Cohort building, Data analysis, and Measurement analysis (Figure 3A). In the cohort building module, multiple features are offered to build groups and ontologies based on patients’ personal details (eg gender, age, marital status, language, race, religion, smoking, and other habits), regional information (eg zip code, street, city, county, state, country, etc.), and medical history. Once the cohort is built, care providers can analyze their patients’ data based on customized date and time, visit to practices, diagnoses, lab tests, prescribed medications, surgical procedures, and consulted providers in the data analysis module. It is capable of tracking one specific patient in the selected cohort as well as the cohort itself. In the measurement analysis module, customized functions are offered to report electronic clinical quality measures (eCQMs) that the Center for Medicare and Medicaid Services (CMS) has proposed to help uncover insights from patient data that demonstrate value in evidence-based medicine, such as improving outcomes and reducing costs (details are provided in attached Supplementary Material S2). The confluence of MAV-clic modules can create space for a new era of open data and discovery in public healthcare by making the most of the new opportunities, utilizing resources maximally and sensibly, and identifying our capabilities to build capacity.
Figure 3.

(A) MAV-clic features for cohort building and ontology making for patient data analysis using CMS proposed and customized measures. (B) Cohort building example: patients who visited UConn Health within the last 7 days for any diagnosis, any medication, and with any provider. (C) Measurement analysis example: patients visited a specific doctor within the last 365 days for particular diagnoses and lab tests. (D) Calculated measure outcome example: customized measurement analysis of patients diagnosed for diabetes, sinusitis, and aged over 65 years.

(A) MAV-clic features for cohort building and ontology making for patient data analysis using CMS proposed and customized measures. (B) Cohort building example: patients who visited UConn Health within the last 7 days for any diagnosis, any medication, and with any provider. (C) Measurement analysis example: patients visited a specific doctor within the last 365 days for particular diagnoses and lab tests. (D) Calculated measure outcome example: customized measurement analysis of patients diagnosed for diabetes, sinusitis, and aged over 65 years.

RESULTS

MAV-clic has developed for healthcare data analytics and research at UConn Health, fulfilling the requirements of data owners and users in implementing the health information system. It efficiently extracts data from the NextGen (deployed EHR system) having over 7000 relations and other inhouse databases, then cleanses and stores the extracted data in Microsoft SQL and MySQL data clusters. It is managing dataset of over 800 000 subjects, which includes demographics, medical visits, diagnoses, lab tests, prescribed medications, and procedures. Furthermore, it includes detailed information about all associated providers to practices at UConn Health, diagnosis (International Classification of Diseases) and medications (National Drug Codes) codes. In this manuscript, we present three examples to help potential users better understand the analytic capabilities of the MAV-clic system. Example 1 (Figure 3B) explains the cohort building of subjects, who visited UConn Health within the last 7 days for any diagnosis, medication, and treated by any provider. Aggregated, approximate results show that over 23 000 subjects were entered and updated in the system. Out of those, over 16 000 physically appeared and were diagnosed with over 18 000 diagnoses and 14 000 medications were prescribed. Example 2 (Figure 3C) presents a measurement analysis, which includes the information about subjects’ visits to a specific doctor within the last 365 days for particular diagnoses and lab tests. The selected cohort of subjects are used as an input to evaluate the quality measures of “High Risk Elderly Patients,” “Blood Urea Nitrogen,” “Diabetes HbA1c > 9%,” and “Adults Sinusitis,” Aggregated results show that over 1000 subjects have visited a selected doctor during last 365 days and were prescribed over 9000 medications for over 5000 diagnoses. The current version of MAV-clic supports 3 eCQMs (CMS156v6, CMS122v5, and PQRS331) for measurement analysis. Example 3 (Figure 3D) shows a calculated measure outcome for each, which includes 3 aggregated cases reported in a deidentified form. Each report includes subjects (newly entered or pre-existing) who have visited UConn Health within the last 30 days and satisfy the following clinical conditions: Diagnosed with “diabetes” with ICD-9 Codes E13, E11, E10, O24 and/or related codes (diabetes) with Hemoglobin A1C Tested: 1) with Hemoglobin A1C Tested ≥ 9, 2) with Hemoglobin A1C Tested ≥ 9 && ≤ 7, and 3) with Hemoglobin A1C Tested ≤ 7. Diagnosed for “sinusitis” and related codes: 1) prescribed medicine within last days ≥ 10, 2) prescribed medicine within last days ≥ 5, and 3) prescribed med with last days ≥ 2. Elder Age ≥ 65 and: 1) prescribed med quantity ≥2, 2) prescribed med quantity ≥4, and 3) prescribed med quantity ≥8. MAV-clic reports include information about customized eCQMs, denominators, numerators, and percentages drawn in chronological line charts (Figure 3D). These reports can be exported and shared in different file formats, including PDF and CSV (details are provided in attached Supplementary Material S3).

DISCUSSION

To facilitate and improve public sector clinical research and practice, there is a critical need for pure academic frameworks that can connect operational and analytical systems in a way that experts from multiple domains can perform measurement and descriptive analysis. MAV-clic offers numerous conceptual and technological innovations as it addresses a major gap in the field by establishing a state-of-the-art application. It is a new platform at UConn Health, supporting comparative research to determine clinically relevant quality improvements and to evaluate cost-effectiveness in the healthcare system. It is a multidatabase management system, capable of handling different levels and types of healthcare data. Programmed analytic processes in MAV-clic help build cohorts by collecting patients’ demographics and medical histories, which can facilitate application of different quality measures, visualization of patterns, and reporting of summaries in an automated and timely manner. The well-organized data management features in MAV-clic allow users to analyze complex and disparate healthcare data. Its potential benefits include: best strategies to diagnose and treat patients especially at risk for medical complications; patient recruitment candidates matching to the treatments; analyzing clinical trials and patient records to identify follow-on indications; analyzing large volumes of clinical data to predict and prevent crisis; and analyzing patient profiles and medical history to perform proactive care. All the combinations from thousands of diagnosis codes, millions of medications, and dozens of lab tests create vast numbers of scenarios that can now be efficiently managed and analyzed by MAV-clic.

CONCLUSION

MAV-clic is a pure academic application to facilitate and improve public sector clinical research and practice. It is based on the vision of allowing research for innovation and sustainability to solve public health problems and challenges, while focusing on high quality research. It has the potential for not only solving many complex healthcare data oriented problems of implementing secure networking, management, pervasive computing, advanced analytics, process modeling, data representation, integrity, privacy, reliability, and exchange but establishing collaborative research environment that can lead to new fundamental insights and advancements in healthcare by analyzing original as well as aggregated healthcare data.

SUPPLEMENTARY MATERIAL

Supplementary material is available at Journal of the American Medical Informatics Association online.

CONTRIBUTORS

ZA perceived the idea and did all work on the software and infrastructure design and implementation and related aspects of MAV-clic. ZA and MK did analysis and performance evaluation of MAV-clic. BL guided study. ZA drafted manuscript and all authors participated in writing and review. Click here for additional data file.
  25 in total

1.  Navigating the road to implementation of the Health Insurance Portability and Accountability Act.

Authors:  Shiloh Turner; Serena Foong
Journal:  Am J Public Health       Date:  2003-11       Impact factor: 9.308

2.  On educating about medical data management - the other side of the electronic health record.

Authors:  R Haux; P Knaup; F Leiner
Journal:  Methods Inf Med       Date:  2007       Impact factor: 2.176

3.  Development of a HIPAA-compliant environment for translational research data and analytics.

Authors:  Wayne Bradford; John F Hurdle; Bernie LaSalle; Julio C Facelli
Journal:  J Am Med Inform Assoc       Date:  2013-08-02       Impact factor: 4.497

4.  Sharing clinical research data in the United States under the Health Insurance Portability and Accountability Act and the Privacy Rule.

Authors:  James D Miller
Journal:  Trials       Date:  2010-11-19       Impact factor: 2.279

Review 5.  Computational solutions for omics data.

Authors:  Bonnie Berger; Jian Peng; Mona Singh
Journal:  Nat Rev Genet       Date:  2013-05       Impact factor: 53.242

6.  Problems, challenges and promises: perspectives on precision medicine.

Authors:  David J Duffy
Journal:  Brief Bioinform       Date:  2015-08-05       Impact factor: 11.622

Review 7.  Big data analytics in healthcare: promise and potential.

Authors:  Wullianallur Raghupathi; Viju Raghupathi
Journal:  Health Inf Sci Syst       Date:  2014-02-07

8.  The CRISP colorectal cancer risk prediction tool: an exploratory study using simulated consultations in Australian primary care.

Authors:  Jennifer G Walker; Adrian Bickerstaffe; Nadira Hewabandu; Sanjay Maddumarachchi; James G Dowty; Mark Jenkins; Marie Pirotta; Fiona M Walter; Jon D Emery
Journal:  BMC Med Inform Decis Mak       Date:  2017-01-19       Impact factor: 2.796

9.  Criteria for the use of omics-based predictors in clinical trials.

Authors:  Lisa M McShane; Margaret M Cavenagh; Tracy G Lively; David A Eberhard; William L Bigbee; P Mickey Williams; Jill P Mesirov; Mei-Yin C Polley; Kelly Y Kim; James V Tricoli; Jeremy M G Taylor; Deborah J Shuman; Richard M Simon; James H Doroshow; Barbara A Conley
Journal:  Nature       Date:  2013-10-17       Impact factor: 49.962

10.  MeSHDD: Literature-based drug-drug similarity for drug repositioning.

Authors:  Adam S Brown; Chirag J Patel
Journal:  J Am Med Inform Assoc       Date:  2017-05-01       Impact factor: 4.497

View more
  6 in total

Review 1.  Applications of artificial intelligence multiomics in precision oncology.

Authors:  Ruby Srivastava
Journal:  J Cancer Res Clin Oncol       Date:  2022-07-07       Impact factor: 4.553

Review 2.  Artificial Intelligence, Healthcare, Clinical Genomics, and Pharmacogenomics Approaches in Precision Medicine.

Authors:  Habiba Abdelhalim; Asude Berber; Mudassir Lodi; Rihi Jain; Achuth Nair; Anirudh Pappu; Kush Patel; Vignesh Venkat; Cynthia Venkatesan; Raghu Wable; Matthew Dinatale; Allyson Fu; Vikram Iyer; Ishan Kalove; Marc Kleyman; Joseph Koutsoutis; David Menna; Mayank Paliwal; Nishi Patel; Thirth Patel; Zara Rafique; Rothela Samadi; Roshan Varadhan; Shreyas Bolla; Sreya Vadapalli; Zeeshan Ahmed
Journal:  Front Genet       Date:  2022-07-06       Impact factor: 4.772

3.  ICU Cockpit: a platform for collecting multimodal waveform data, AI-based computational disease modeling and real-time decision support in the intensive care unit.

Authors:  Jens Michael Boss; Gagan Narula; Christian Straessle; Jan Willms; Jan Azzati; Dominique Brodbeck; Rahel Luethy; Susanne Suter; Christof Buehler; Carl Muroi; David Jule Mack; Marko Seric; Daniel Baumann; Emanuela Keller
Journal:  J Am Med Inform Assoc       Date:  2022-06-14       Impact factor: 7.942

4.  RNA-seq driven expression and enrichment analysis to investigate CVD genes with associated phenotypes among high-risk heart failure patients.

Authors:  Zeeshan Ahmed; Saman Zeeshan; Bruce T Liang
Journal:  Hum Genomics       Date:  2021-11-13       Impact factor: 4.639

5.  Artificial intelligence with multi-functional machine learning platform development for better healthcare and precision medicine.

Authors:  Zeeshan Ahmed; Khalid Mohamed; Saman Zeeshan; XinQi Dong
Journal:  Database (Oxford)       Date:  2020-01-01       Impact factor: 3.451

6.  Practicing precision medicine with intelligently integrative clinical and multi-omics data analysis.

Authors:  Zeeshan Ahmed
Journal:  Hum Genomics       Date:  2020-10-02       Impact factor: 4.639

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.