| Literature DB >> 35874460 |
Kavishwar B Wagholikar1,2, David Zelle3, Layne Ainsworth4, Kira Chaney3, Alexander J Blood1,3, Angela Miller2, Rupendra Chulyadyo4, Michael Oates4, William J Gordon1,3,4, Samuel J Aronson4, Benjamin M Scirica1,3, Shawn N Murphy1,2.
Abstract
Analysis of health data typically requires development of queries using structured query language (SQL) by a data-analyst. As the SQL queries are manually created, they are prone to errors. In addition, accurate implementation of the queries depends on effective communication with clinical experts, that further makes the analysis error prone. As a potential resolution, we explore an alternative approach wherein a graphical interface that automatically generates the SQL queries is used to perform the analysis. The latter allows clinical experts to directly perform complex queries on the data, despite their unfamiliarity with SQL syntax. The interface provides an intuitive understanding of the query logic which makes the analysis transparent and comprehensible to the clinical study-staff, thereby enhancing the transparency and validity of the analysis. This study demonstrates the feasibility of using a user-friendly interface that automatically generate SQL for analysis of health data. It outlines challenges that will be useful for designing user-friendly tools to improve transparency and reproducibility of data analysis.Entities:
Keywords: Databases; Graphical user-interface; Reproducibility of analysis; Structured query language; Validity of analysis
Year: 2022 PMID: 35874460 PMCID: PMC9306316 DOI: 10.1016/j.imu.2022.100996
Source DB: PubMed Journal: Inform Med Unlocked ISSN: 2352-9148
Fig. 1.In the conventional approach the data-analyst developed a SQL query to generate the report, while in the proposed auto-SQL approach the data-analyst first denormalized the study database and then the domain expert used the i2b2-webclient graphical-user-interface that automatically generated the SQL for performing the analysis.
Reasons for exit from the Lipid therapy optimization program.
| Exit category | Description |
|---|---|
| Dropped out | Patient dropped out of the study before their therapy could be optimized |
| Entered maintenance | Patient successfully completed the program |
| MD-handoff | Patient was referred to their primary cardiologist for management |
| MD rejected program | Patient’s care provider did not consent to including the patient in the optimization program |
| Unreachable | Patient could not be contacted by the study team. |
Analytical report generated by conventional approach.
| Milestones | Not titrated before exit | Titrated before exit | Row total |
|---|---|---|---|
| Dropped out | 285 | 107 | 392 |
| Entered maintenance | 704 | 876 | 1580 |
| MD-handoff | 59 | 60 | 119 |
| MD rejected program | 18 | 5 | 23 |
| Unreachable | 718 | 278 | 996 |
| Total | 1784 | 1326 | 3110 |
Fig. 2.Graphical query interface from the i2b2 platform. The criteria for querying can be easily constructed by dragging terms from the hierarchical tree structure on the left to the widgets on the right. The SQL query is automatically generated in the back-end, which enables clinical staff that are not familiar with SQL to perform complex queries on the data.