| Literature DB >> 30109157 |
Dahee Chung1, Junggu Choi1, Jong-Hwan Jang1, Tae Young Kim1, JungHyun Byun1, Hojun Park1, Hong-Seok Lim2, Rae Woong Park1,3, Dukyong Yoon1,3.
Abstract
OBJECTIVES: Electrocardiogram (ECG) data are important for the study of cardiovascular disease and adverse drug reactions. Although the development of analytical techniques such as machine learning has improved our ability to extract useful information from ECGs, there is a lack of easily available ECG data for research purposes. We previously published an article on a database of ECG parameters and related clinical data (ECG-ViEW), which we have now updated with additional 12-lead waveform information.Entities:
Keywords: Adverse Drug Reaction; Database; Electrocardiogram; QT Interval; Waveform
Year: 2018 PMID: 30109157 PMCID: PMC6085199 DOI: 10.4258/hir.2018.24.3.242
Source DB: PubMed Journal: Healthc Inform Res ISSN: 2093-3681
Figure 1Example of an electrocardiogram (ECG) report. Alphanumeric values (demographic information, ECG parameter values, and interpretation) are located in the upper part of the report. Waveform data are given as time-series graphs with a grid, covering the middle and lower part of the report. One grid unit (1 mm × 1 mm square) corresponds to 0.1 mV × 0.04 seconds.
Figure 2Data transformation process for the waveform data. (A) Raw data in SVG format contains information regarding the exact position of the data point on the electrocardiogram. (B) The start point of all 12-lead waveforms was set to 0. (C) Because 1 mV in the raw data corresponds to y-axis values of 28, we divided all y-axis values by 28 to adjust the scale of the y-axis to mV. (D) Using linear interpolation (500 Hz), vector image data given as x- and y-coordinates were converted into equidistant time series data. Although the resulting waveform data consist of a series of values without timestamps, the timestamps could be calculated by counting the data points from the starting point because the starting point was provided in the file name or database, and one data point corresponds to 1/500 seconds.
Figure 3Comparison between the raw waveform and converted waveform data. Due to high-density interpolation, there was no significant difference between the raw data (which are based on x- and y-coordinates) and time series data.
Summary of demographic and ECG data covered by ECG-ViEW III (n = 447,445)
Values are presented as mean ± standard deviation or number (%).
ECG: electrocardiogram; ECG-ViEW: Electrocardiogram Vigilance with Electronic data Warehouse.