Literature DB >> 28800599

Extracting information from free-text electronic patient records to identify practice-based evidence of the performance of coronary stents.

Yoon Seob Kim1, Dukyong Yoon1,2, JungHyun Byun1, Hojun Park1, Ahram Lee1, Il Hyun Kim1, Sukhoon Lee3, Hong-Seok Lim4, Rae Woong Park1,2.   

Abstract

BACKGROUND AND
OBJECTIVE: Percutaneous coronary intervention (PCI) using drug-eluting stents (DES) is an indispensable treatment for coronary artery disease. However, to evaluate the performance of various types of stents for PCI, numerous resources are required. We extracted clinical information from free-text records and, using practice-based evidence, compared the efficacy of various DES.
MATERIALS AND METHODS: We developed a text mining tool based on regular expression and applied it to PCI reports stored in the electronic health records (EHRs) of Ajou University Hospital from 2010-2014. The PCI data were extracted from EHRs with a sensitivity of 0.996, a specificity of 1.000, and an F-measure of 0.995 when compared with a sample of 200 reports. Using these data, we compared the performance of stents by Kaplan-Meier analysis and the Cox hazard proportional regression.
RESULTS: In the self-validation analysis comparing the first-generation to the second-generation DES, the second-generation DES was superior to the first-generation DES (hazard ratio [HR]: 0.423, 95% confidence interval [CI]: 0.284-0.630) in terms of target vessel revascularization (TVR), showing similar findings to the established results of previous studies. Among the second-generation DES, the biodegradable-polymer DES tended to be superior, with a risk of TVR (HR: 0.568, 95% CI: 0.281-1.147) falling below than that for the durable-polymer DES approximately 1 year after the index procedure. The Endeavor stent had the highest TVR risk among the newer generation DES (HR: 2.576, 95% CI: 1.273-5.210).
CONCLUSIONS: In this study, we demonstrated how to construct a PCI data warehouse of PCI-related parameters obtained from free-text electronic records with high accuracy for use in the post surveillance of coronary stents in a time- and cost effective manner. Post surveillance of the practice based evidence in the PCI data warehouse indicated that the biodegradable-polymer DES might have a lower risk of TVR than the durable-polymer DES.

Entities:  

Mesh:

Substances:

Year:  2017        PMID: 28800599      PMCID: PMC5553787          DOI: 10.1371/journal.pone.0182889

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Percutaneous coronary intervention (PCI) is an indispensable method for the treatment of coronary artery disease [1]. In 2010, it was estimated that 492,000 patients underwent PCI procedures in the United States, of which approximately 454,000 were PCI procedures using stents [2]. Because of the high incidence of PCI and stent use, even a small difference in the performance of stents could result in an enormous difference in patient outcome. With the rapid development of stent technology aimed at lowering the rate of stent failure, not enough is known about the outcomes of new products. Therefore, constant and rapid surveillance of the performance of emerging stents, and comparisons among various types of stents, is required, but doing so remains difficult. The earliest form of coronary angioplasty involved only balloon catheters, while coronary stents were first introduced in 1989 to prevent restenosis after angioplasty by maintaining lumen integrity [1, 3]. However, in-stent restenosis and the need for frequent revascularization arose as bare metal stents (BMS) began to be used [4]. The first-generation drug eluting stents (DES), which interfere with cell proliferation, were thus developed, further lowering the risk of restenosis [5]. However, these stents were shown to increase the risk of late stent thrombosis and delay arterial healing due to their interfering with the cellular environment [6, 7]. Moreover, the structure of the first-generation Sirolimus-eluting stents was also vulnerable to mechanical complications such as stent fracture [8-10]. Subsequently, second-generation DES were designed to decrease the risk of late target lesion revascularization, improving the reliability of DES [11, 12]. Biodegradable polymer-coated stents were developed to improve long-term outcome by providing temporary function and stepping away of the polymers [13]. It has been shown that biodegradable polymers improve the safety and efficacy of second-generation stents compared to first-generation DES [14-16]. However, further studies are needed to address the differences between the newer generation durable polymer DES (DP-DES) and biodegradable polymer DES (BP-DES) [17]. After clinical adoption of the first-generation DES (which had shortcomings), technological advances have included improvements in materials and design, the employed polymer, and the eluted drugs, rendering adverse clinical events very rare. Therefore, it is very important, but has become increasingly difficult, to evaluate and compare performance among stents [17]. A decrease in the incidence of major adverse cardiac events (MACE) has made it difficult to compare the performance of various stents with clear statistical validity. Also, techniques for revascularization are undergoing rapid evolution, which render many studies obsolete or difficult to interpret [18]. This problem derives mainly from the fact that traditional research methods, such as prospective cohort studies, require vast resources. This limitation warrants a new research method that carries less economic burden and is less time-consuming. Thus, a retrospective cohort study using information from electronic health records (EHRs) was considered, as records that have been originally stored for practice purposes can also be used for research. However, using EHRs poses a barrier as coronary angiography (CAG) reports are written in free-text; free-text files cannot be directly analyzed using existing tools, and in the present study, CAG reports were too numerous to review manually. We overcame this obstacle by developing a method of analysis using regular expression. Regular expression refers to a formal programming language for specifying strings, and is used practically to identify particular strings in texts [19]. Using this method, we extracted variables related to PCI from CAG reports originally stored in EHRs. Patient characteristics, such as age, sex, and a history of diabetes mellitus and hypertension, were also processed from the EHRs. The data were used to build a PCI data warehouse for statistical analyses. Using free-text CAG reports from the Ajou University Hospital between February 2010 and October 2014, we extracted information and conducted a retrospective cohort study of a real-world population that had undergone PCI. To validate our data, we examined and compared the clinical performance of different stents and analyzed the hazard ratios (HR) of target vessel revascularization (TVR) between first- and second-generation DES. Following validation of our program, we explored the efficacy and safety of BP-DES versus DP-DES with the risk of TVR as the primary endpoint.

Materials and methods

Ethics statement

This study was approved and informed consent was waived by the Institutional Review Board of Ajou University Hospital because the anonymized data was analyzed retrospectively.

Data source

Patients who underwent PCI procedures were identified from the EHRs of Ajou University Hospital. A total of 13,567 coronary reports (CAG reports with or without descriptions of PCI procedures performed after diagnostic CAG) written between February 2010 and October 2014 were analyzed by regular expression. Patient characteristics including clinical and demographic information, drug prescriptions, and diagnoses were also extracted from EHRs.

Data processing of coronary angiography reports

We extracted information from the free-text PCI reports and loaded it into a data warehouse; we applied regular expression using the R programming language to complete this task. The ‘stringr’ package developed by Hadley Wickham was used primarily for extracting patterns [20]. A table of pre-defined vessel and stent terms was created, taking advantage of the fact that PCI reports had recurring structures around these words (e.g., newline, colon, slash). The terms in their presumed locations were extracted, manually reviewed by a cardiologist, and categorized (Table 1). The various forms of the terms on the list were then used for extraction.
Table 1

Vessel and stent terms used in PCI reports and their categories in this study.

Vessel/stent categoriesVessel/stent terms used in PCI reports
VessselLADLAD, Di, D1, D2, diagonal/Dx
LCxLCx, OM, RI/Ramus
LMLM, LM-LAD, LM-LCx
RCARCA, PDA, PLV, PLB
StentBMSVision, Genoss, Coroflex Blue, Zeta
DESFirst generationCypher, Coroflex Please, Taxus
Second generationDurable polymerResolute family, Promus family, Xience family, Endeavor
Biodegradable polymerBiomatrix, Desyne, Nobori, Orsiro

LAD, left anterior descending artery; LCx, left circumflex; LM, left main; RCA, right coronary artery; BMS, bare metal stent; DES, drug-eluting stent; D1, the first diagonal artery; D2, the second diagonal artery; Dx, diagonal artery; OM, obtuse marginal; PDA, posterior descending artery; PLV, posterior left ventricular branch; PLB, posterior lateral branch.

LAD, left anterior descending artery; LCx, left circumflex; LM, left main; RCA, right coronary artery; BMS, bare metal stent; DES, drug-eluting stent; D1, the first diagonal artery; D2, the second diagonal artery; Dx, diagonal artery; OM, obtuse marginal; PDA, posterior descending artery; PLV, posterior left ventricular branch; PLB, posterior lateral branch. The overall process is shown in Fig 1. First, we extracted all words between newline (\n) and colon (Step 1). These words mainly referred to the target vessels in the procedure. We then compared the extracted words with the pre-defined vessel categories shown in Table 1 (Step 2). After identifying the target vessels, we went on to find the names, diameters, and lengths of the stents, and ascertained whether a non-compliant adjuvant balloon was applied (Step 3). The stent names were extracted by matching them with pre-defined stent names (Step 4), and the diameters and lengths of the stents were captured by extracting two numbers followed by “mm”, which were separated by “/” (Step 5). If any of these three variables (stent name, diameter, and length) were missing, it was determined that there was no record of inserting a stent in the corresponding vessel. Finally, we identified whether a non-compliant high-pressure balloon was applied by checking whether “HP” or the name of a non-compliant high-pressure balloon was written in the report (Step 6). Script samples and a CAG report are shown in S1 File.
Fig 1

The percutaneous coronary intervention (PCI)-related parameter information extraction process from free-text PCI procedure reports.

The scripts for data extraction were first developed by three individual programmers. Upon completion, their results were compared side-by-side, and the scripts underwent two cycles of revision for higher conformity. Subsequently, the matching parts of the three tables were used for the PCI data warehouse. The parts that did not reach consensus were filled in through manual review of the original reports. The completed PCI data warehouse itself was validated by manual review. To evaluate the degree of agreement among the three different programmers and between information retrieved by each script and the data filled out manually from 200 randomly chosen samples (the gold standard), we compared the list of pairs that consisted of the target vessel and the stent categories inserted into the target vessel. The kappa values for the primary, secondary, and tertiary pairwise comparisons of the extracted results between the three programmers were 0.659, 0.974, and 0.986, respectively. As shown in Table 2, the mean script sensitivities, specificities, and F-measures approached those of the gold standard after two cycles of revision. All of the comparisons were performed after all revisions were complete, eliminating the possibility that the revisions were biased.
Table 2

Mean sensitivities, specificities, and F-measures for each iterative round.

RoundMean SensitivityMean SpecificityMean F-measure
Primary0.7410.9990.864
Secondary0.9601.0000.959
Tertiary0.9911.0000.991

Study covariates and outcome variables

The outcome variable for this study was TVR within 2 years after index PCI with stent implantation. TVR was defined as cases where patients had repeated PCI procedures on a target vessel. The patient characteristic covariates included age, sex, hypertension, type 2 diabetes mellitus, and the duration of dual antiplatelet therapy (DAPT). The procedural characteristic covariates included the total length and average diameter of the implanted stent(s) in each target vessel, the approach site (radial artery or femoral artery), and whether the PCI was emergent. Patients with type 2 diabetes were extracted according to the process defined by Kho et al. [21]. Hypertension was defined as either the assignment of the ICD-10 diagnostic code I10 or prescription of at least one type of antihypertensive drug.

Statistical analysis

First, an analysis for data warehouse validation was performed to confirm that the PCI data warehouse reflected established real-world findings. We compared the risk of TVR between first- and second-generation DES using Kaplan-Meier analysis and the log-rank test, and all study covariates were adjusted using the Cox proportional hazard regression model. We used the PCI data warehouse to analyze the differences between DP-DES and BP-DES using the same statistical methods as described above. Within-class comparisons of stent groups were also performed. The types of stents that were used in greater than 100 procedures, as determined from the EHRs, were selected and categorized, including the Promus family, the Xience family, the Resolute family, the Endeavor, the Nobori, and the Biomatrix stents. Prior to each analysis, cases where a vessel was treated with more than one category of stents were excluded (i.e., vessels that were treated with both BP-DES and DP-DES). All statistical analyses were performed using the R package (R Development Core Team, Vienna, Austria). A p-value < 0.05 was considered significant.

Results

Data processing

The final consensus version of the data extraction program by three different programmers, including a manual review of elements for which consensus was not attained (220 of 5,612 descriptions, 3.9%), was used to generate the PCI data warehouse. A negative predictive value of 1.000 and a positive predictive value of 0.995 were observed when these data were compared with the gold standard data of the target vessel and stent categories. Sensitivity was 0.996, specificity was 1.000, and the F-measure was 0.995, indicating that the data warehouse can be used in research settings.

PCI data warehouse construction

A PCI data warehouse containing information from PCI reports was constructed, and included information from 5,612 target vessels of 3,817 patients who underwent index PCI between 2010 and 2014. The patient and procedural characteristics of the index procedures are shown in Table 3.
Table 3

Baseline, procedural, and outcome findings.

VariableAllDP-DESBP-DESp-value*
No. of patients, n38172468568-
Age, mean ± SD62.75 ± 11.7762.19 ± 11.8063.14 ± 11.610.080
Male, n (%)2724 (71%)1799 (73%)386 (68%)0.021
Hypertension, n (%)1756 (46%)1126 (46%)263 (46%)0.806
Diabetes mellitus, n (%)746 (20%)489 (20%)116 (20%)0.788
Average duration of DAPT (days), mean ± SD572.59 ± 444.77559.29 ± 425.71549.12 ± 384.320.578
Duration of DAPT over 12 months, n (%)2326 (61%)1510 (61%)347 (61%)1.000
No. of treated vessels, n (%)
    LAD2565 (67%)1474 (60%)349 (61%)0.480
    RCA1346 (35%)806 (33%)156 (27%)0.019
    LCx1288 (34%)639 (26%)145 (26%)0.900
    LM413 (11%)312 (13%)37 (7%)< 0.001
Average diameter of stents per target vessel (mm), mean ± SD2.61 ± 1.193.11 ± 1.193.07 ± 1.190.020
Total length of stents per target vessel (mm), mean ± SD27.56 ± 19.2434.17 ± 19.2428.64 ± 19.24< 0.001
No. of stents per target vessel, mean ± SD1.06 ± 0.651.28 ± 0.651.23 ± 0.650.029
Post-stent non-compliant balloon use, n (%)1970 (52%)1335 (54%)293 (52%)0.301
Procedures with femoral artery approach, n (%)3087 (81%)1998 (81%)450 (79%)0.378
Vessels treated by major stent types, n (%)
    Xience family995 (40%)
    Promus family1143 (46%)
    Resolute family978 (40%)
    Endeavor162 (7%)
    Nobori482 (85%)
    Biomatrix185 (33%)
    Orsiro22 (4%)
Outcomes
    Cases of TVR, n (%)222 (5.8%)97 (3.9%)18 (3.2%)0.462
    Observation period, days730.31 ± 463.78693.98 ± 463.78724.12 ± 463.780.098

SD, standard deviation; DP-DES, durable polymer drug eluting stents; BP-DES, biodegradable polymer drug eluting stents; DAPT, dual antiplatelet therapy; LAD, left anterior descending artery; RCA, right coronary artery; LCx, left circumflex; LM, left main; TVR, target vessel revascularization.

*DP-DES group and BP-DES group were compared by Pearson’s chi-squared test for categorical variables and by Student’s t-test for continuous variables

SD, standard deviation; DP-DES, durable polymer drug eluting stents; BP-DES, biodegradable polymer drug eluting stents; DAPT, dual antiplatelet therapy; LAD, left anterior descending artery; RCA, right coronary artery; LCx, left circumflex; LM, left main; TVR, target vessel revascularization. *DP-DES group and BP-DES group were compared by Pearson’s chi-squared test for categorical variables and by Student’s t-test for continuous variables

Validation of findings to demonstrate similar findings to previous studies

To validate our data warehouse, we compared the TVR risk of the second-generation DES with that of the first-generation DES using Kaplan-Meier analysis and Cox proportional hazard regression. The TVR risk of second-generation stents was lower (adjusted HR: 0.423; 95% confidence interval (CI): 0.284–0.630), which was in accordance with previous studies (Fig 2A). The known predictors of TVR, including age, hypertension, diabetes mellitus, emergent PCI, and DAPT over 12 months, are also shown as significant (Table 4). These results indicate that the data in the constructed database are reliable and could be used for further analyses.
Fig 2

Comparison of target vessel revascularization free survival between the first- and second-generation stents.

Table 4

Hazard ratios of first- versus second-generation stents and the observed variables in PCI patients.

VariableHR (95% CI)p-value
Second-generation stents (reference, first-generation stents)0.423 (0.284–0.630)< 0.001
Age0.978 (0.962–0.994)0.009
Sex (male)1.010 (0.652–1.564)0.966
Hypertension3.469 (2.294–5.245)< 0.001
Diabetes mellitus2.100 (1.420–3.107)< 0.001
Total stent length1.005 (0.983–1.027)0.661
Average stent diameter0.943 (0.571–1.556)0.818
Non-compliant high pressure balloon use1.349 (0.919–1.980)0.126
Target vessel (reference, LAD)
    LCx0.697 (0.401–1.211)0.200
    LM0.415 (0.163–1.055)0.065
    RCA1.121 (0.732–1.716)0.599
No. of stents used0.833 (0.419–1.656)0.601
Radial artery approach0.714 (0.405–1.259)0.244
Emergent PCI3.270 (2.025–5.280)< 0.001
DAPT over 12 months0.389 (0.270–0.559)< 0.001

HR, hazard ratio; CI, confidence interval; LAD, left anterior descending artery; RCA, right coronary artery; LCx, left circumflex; LM, left main; DAPT, dual antiplatelet therapy; PCI, percutaneous coronary intervention.

HR, hazard ratio; CI, confidence interval; LAD, left anterior descending artery; RCA, right coronary artery; LCx, left circumflex; LM, left main; DAPT, dual antiplatelet therapy; PCI, percutaneous coronary intervention.

New findings: BP-DES versus DP-DES

The baseline clinical characteristics of the BP-DES and DP-DES groups were similar (Table 3). Compared with the DP-DES group, RCA and LM were less frequently treated in the BP-DES group, and patients in the DP-DES group had a lower number of stents with smaller diameters and shorter lengths per target vessel. The rate of TVR was not significantly different between the BP-DES and DP-DES groups (3.2% vs. 3.9%, p = 0.462). We compared the risk of TVR between the BP-DES and DP-DES groups. In patients treated with BP-DES, the TVR-free survival tended to be higher (log rank, p = 0.059) and the HR of TVR was lower without statistical significance (adjusted HR 0.568, 95% CI 0.281–1.147, p = 0.114) compared to patients treated with DP-DES (Table 5). However, in the Kaplan-Meier plot, the TVR-free survival rates of the two types of stents diverged at approximately 1 year after the index procedure was performed (Fig 2B).
Table 5

Hazard ratios of DP-DES vs. BP-DES and the observed variables in PCI patients.

VariableHR (95% CI)p-value
Biodegradable stents (reference, durable stents)0.568 (0.281–1.147)0.114
Age0.980 (0.961–0.999)0.040
Sex (male)1.008 (0.597–1.702)0.975
Hypertension5.111 (2.983–8.757)< 0.001
Diabetes mellitus2.195 (1.386–3.477)0.001
Total stent length1.007 (0.982–1.032)0.588
Average stent diameter1.031 (0.577–1.843)0.918
Non-compliant high pressure balloon use1.392 (0.879–2.203)0.158
Target vessel (reference, LAD)
    LCx0.417 (0.196–0.887)0.023
    LM0.319 (0.097–1.052)0.061
    RCA0.967 (0.589–1.588)0.894
No. of stents used0.823 (0.381–1.774)0.618
Radial artery approach0.411 (0.178–0.951)0.038
Emergent PCI3.861 (2.273–6.559)< 0.001
DAPT over 12 months0.389 (0.251–0.603)< 0.001

HR, hazard ratio; CI, confidence interval; LAD, left anterior descending artery; RCA, right coronary artery; LCx, left circumflex; LM, left main; DAPT, dual antiplatelet therapy; PCI, percutaneous coronary intervention.

HR, hazard ratio; CI, confidence interval; LAD, left anterior descending artery; RCA, right coronary artery; LCx, left circumflex; LM, left main; DAPT, dual antiplatelet therapy; PCI, percutaneous coronary intervention.

Within-class comparisons of DP-DES and BP-DES

Fig 2C shows the within-class comparison of TVR-free survival among the second-generation DES using Cox proportional hazard regression. The Endeavor stents showed a significantly higher HR (adjusted HR 2.576, 95% CI 1.273–5.210) than the other second-generation stents (Table 6). Other comparisons did not yield statistical significance, although BP-DES (i.e., Nobori and Biomatrix) showed a trend for lower HRs.
Table 6

Adjusted TVR risk of each stent compared to the other second generation stents.

VariableAdjusted HR* (95%CI)p-value
Xience family0.717 (0.392–1.312)0.280
Promus family1.096 (0.691–1.740)0.696
Resolute family1.145 (0.697–1.880)0.594
Endeavor2.576 (1.273–5.210)0.008
Nobori0.721 (0.328–1.585)0.416
Biomatrix0.376 (0.091–1.552)0.176

HR, hazard ratio; CI, confidence interval.

*Adjusted for age, sex, hypertension, diabetes mellitus, total length and average diameter of stents, non-compliant high pressure balloon use, target vessel (LAD, LCx, LM, or RCA), the number of stent used, approach artery, emergent percutaneous coronary intervention, and dual antiplatelet therapy over 12 months.

HR, hazard ratio; CI, confidence interval. *Adjusted for age, sex, hypertension, diabetes mellitus, total length and average diameter of stents, non-compliant high pressure balloon use, target vessel (LAD, LCx, LM, or RCA), the number of stent used, approach artery, emergent percutaneous coronary intervention, and dual antiplatelet therapy over 12 months.

Discussion

In this study, we extracted PCI-related parameters and constructed a PCI data warehouse from free-text electronic patient records with high accuracy. Using this data warehouse, we identified practice-based evidence to assess the performance of various coronary stents. In a self-validation analysis, the present study reaffirmed the established findings that newer generation DES are superior to first-generation DES in terms of TVR. Moreover, we provided clinically meaningful evidence that BP-DES were associated with a lower TVR risk than DP-DES, particularly 1 year after the index procedure. The Endeavor family of stents was significantly associated with higher risk of TVR among the newer generation stents. Our findings support the results of previous studies and provide novel findings regarding the comparative efficacy of various newer types of DES in an unrestricted real-world clinical setting. A study based on the Swedish Coronary Angiography and Angioplasty Registry (SCAAR) showed similar results to this study when comparing older and newer generation DES [22]. Although the classification of Endeavor stents was different (old generation DES in the SCAAR study and second-generation DES in this study) the adjusted cumulative risk of target lesion revascularization (TLR) by PCI for up to 2 years (adjusted HR 0.60, 95% CI 0.51–0.70) was comparable to that of this study (adjusted HR 0.423, 95% CI 0.284–0.630). Moreover, when comparing each DES, the cumulative restenosis plot in the SCAAR study and the Kaplan-Meier plot in this study both showed that the TVR risk for Endeavor stents was higher than that of other stents [23]. Also, although there were no significant differences between the TVR HR of DP-DES and BP-DES for the 2-year outcome, the occurrence of TVR of the two stent types diverged after approximately 1 year. This reflects the results of the LEADERS trial, which showed a reduction in very late stent thrombosis (> 1 year) and a significant reduction in all-cause revascularization for BP-DES [24]. However, these results remain debatable, as some recent clinical trials state that the safety and efficacy of these stents did not differ [25, 26]. Previous studies have focused on comparing only a few types of stents; they were unable to make various comparisons due to the inherent limitations of clinical trials, which require numerous resources and are time-consuming. These studies were often confined to comparison of two or three types of stents and sometimes included only short follow-up periods (1 year), making it difficult to compare various stents from a single data source [27-29]. Also, as developments in DES technology occur rapidly, many studies become obsolete after just a few years, raising further questions regarding the timing and cost-effectiveness of prospective clinical trials. In contrast, we were able to include a large number of procedures, with low cost and within a short period, by extracting information from reports stored in EHRs. We were also able to obtain data regarding the newer BP-DES. To do so, we developed text-mining tools using regular expression. Three different researchers developed mining programs independently and subsequently compared their results to ensure reliable data. Our data showed high validity and reliability when compared with the gold standard and already established researches. The sensitivity and specificity were 0.996 and 1.000, respectively, and the F-measure was 0.995. We confirmed that the second-generation DES are associated with significantly lower TVR risk than first-generation DES. The high sensitivity and specificity we obtained are primarily the result of both the inherent characteristics of regular expression itself and the extensive reviews and revisions that we performed. High specificity was achieved because our expressions matched only the strings that were defined in the script, thus avoiding false positives. However, to ensure high sensitivity, it was paramount that all forms of abbreviations and spellings were pre-defined and taken into account. Although a pre-defined vessel and stent table was made, it was not sufficient to accommodate the various forms of the terms used. Therefore, the script review by the three programmers was of great importance to ensure high sensitivity. Using three individual scripts, it was possible to take into account the different forms other programmers had found, and an almost perfect match was obtained using the three scripts. The principal differences are the expressions used to match the patterns. A particular script may fail to extract certain words that other scripts extract, so these were revised during each round. This process was sufficient to reach a surprisingly high sensitivity. The construction of a PCI data warehouse opens up possibilities for further research. First, our data warehouse contained information concerning the implemented stents and a detailed description of each stent in each target vessel (i.e., brand, number of stents, average diameter, and total length). Moreover, the analyses were strengthened using covariates (i.e., age, sex, hypertension, diabetes mellitus, and DAPT duration). Lastly, a total of 5,612 treated vessels from 3,817 PCI procedures were included, which resulted in a sample size large enough to achieve statistical validity. The fact that EHRs stored solely for practice purposes can be used for research also opens up new possibilities. With proper extraction and organization, through application of technologies such as regular expression and natural language processing (NLP), the majority of EHRs could be used in a research setting. This study encourages the use of the vast data that are stored in the EHRs of hospitals, even those in free-text form. There are several known NLP tools, which extract information through semantic analysis of medical records, such as the Linguistic Strings Project and the Medical Language Extraction and Encoding system (MedLEE) [30, 31]. However, most of these were developed for texts in English, while the majority of medical records in Korea are written in both English and Korean. There are also terms and abbreviations that are used only in Korea, making it difficult to apply existing tools directly to Korean medical records. However, the development of new NLP tools to analyze Korean EHRs would be expensive and labor-intensive. In addition, the CAG and PCI reports that we used did not contain complete sentences, but rather, were in a semi-structured format. Thus, pattern matching, not semantic analysis of the text, was the method of choice, particularly regular expression. When semi-structured medical records are written in both English and a local language (here, Korean), NLP tools cannot be used, and regular expression may be cost-effective and reliable when employed to extract information. This study was not without limitations. First, the data warehouse was based solely upon procedures and prescriptions performed at Ajou University Hospital. Although patients are likely to return for follow-ups, some may choose different hospitals for further treatment. Others who were initially treated at another hospital (e.g., emergency reasons) may have subsequently visited Ajou University Hospital, possibly resulting in underreported outcomes and insufficient information on baseline characteristics. This is the reason why we only evaluated TVR rather than myocardial infarction and death, which are also major outcomes of PCI. The second limitation was that using regular expression may have caused unknown errors in cases with critical spelling mistakes on reports. Although most spelling mistakes were accounted for in our scripts, some extremely unique errors may have been missed, causing an imperfect sensitivity. Third, we only compared the list of pairs that consisted of the target vessel and stent categories to evaluate the degree of agreement between information retrieved using each script and the gold standard. Therefore, we could not determine how well the other variables were extracted other than the two mentioned above. However, it makes sure that the complete set of information was extracted, because if any of the three variables (stent name, diameter, and length) were missing, we determined that there was no record of a stent insertion in the corresponding vessel. The other variables (route of approach [radial or femoral], whether emergent PCI was performed or not, and whether a non-compliant high-pressure balloon was applied or not) were not evaluated in any way.

Conclusions

A PCI data warehouse, constructed using regular expression reports on CAG and PCI stored in EHRs, was useful and reliable to evaluate the clinical performance of coronary stents. In terms of TVR, our study confirmed previous results demonstrating that second-generation DES are superior to first-generation DES, and demonstrated that Endeavor stents showed the highest event rate among the newer generation DES. BP-DES had a similar efficacy to DP-DES, but were associated with a superior long-term outcome after 1 year of observation. It is encouraging for further research that free-text reports originally stored for diagnostic purposes can be processed into forms that are appropriate for research.

Demonstration of the regular expression script.

(DOCX) Click here for additional data file.

A Sample data set of the PCI data warehouse.

(XLSX) Click here for additional data file.
  28 in total

1.  Initial clinical experience with an everolimus eluting platinum chromium stent (Promus Element) in unselected patients from the Swedish Coronary Angiography and Angioplasty Registry (SCAAR).

Authors:  Giovanna Sarno; Bo Lagerqvist; Jörg Carlsson; Göran Olivecrona; Johan Nilsson; Fredrik Calais; Matthias Götberg; Tage Nilsson; Iwar Sjögren; Stefan James
Journal:  Int J Cardiol       Date:  2012-01-13       Impact factor: 4.164

2.  A tale of coronary artery disease and myocardial infarction.

Authors:  Elizabeth G Nabel; Eugene Braunwald
Journal:  N Engl J Med       Date:  2012-01-05       Impact factor: 91.245

3.  Sirolimus-eluting versus uncoated stents in acute myocardial infarction.

Authors:  Christian Spaulding; Patrick Henry; Emmanuel Teiger; Kevin Beatt; Ezio Bramucci; Didier Carrié; Michel S Slama; Bela Merkely; Andrejs Erglis; Massimo Margheri; Olivier Varenne; Ana Cebrian; Hans-Peter Stoll; David B Snead; Christoph Bode
Journal:  N Engl J Med       Date:  2006-09-14       Impact factor: 91.245

4.  2013 ESC guidelines on the management of stable coronary artery disease: the Task Force on the management of stable coronary artery disease of the European Society of Cardiology.

Authors:  Gilles Montalescot; Udo Sechtem; Stephan Achenbach; Felicita Andreotti; Chris Arden; Andrzej Budaj; Raffaele Bugiardini; Filippo Crea; Thomas Cuisset; Carlo Di Mario; J Rafael Ferreira; Bernard J Gersh; Anselm K Gitt; Jean-Sebastien Hulot; Nikolaus Marx; Lionel H Opie; Matthias Pfisterer; Eva Prescott; Frank Ruschitzka; Manel Sabaté; Roxy Senior; David Paul Taggart; Ernst E van der Wall; Christiaan J M Vrints; Jose Luis Zamorano; Stephan Achenbach; Helmut Baumgartner; Jeroen J Bax; Héctor Bueno; Veronica Dean; Christi Deaton; Cetin Erol; Robert Fagard; Roberto Ferrari; David Hasdai; Arno W Hoes; Paulus Kirchhof; Juhani Knuuti; Philippe Kolh; Patrizio Lancellotti; Ales Linhart; Petros Nihoyannopoulos; Massimo F Piepoli; Piotr Ponikowski; Per Anton Sirnes; Juan Luis Tamargo; Michal Tendera; Adam Torbicki; William Wijns; Stephan Windecker; Juhani Knuuti; Marco Valgimigli; Héctor Bueno; Marc J Claeys; Norbert Donner-Banzhoff; Cetin Erol; Herbert Frank; Christian Funck-Brentano; Oliver Gaemperli; José R Gonzalez-Juanatey; Michalis Hamilos; David Hasdai; Steen Husted; Stefan K James; Kari Kervinen; Philippe Kolh; Steen Dalby Kristensen; Patrizio Lancellotti; Aldo Pietro Maggioni; Massimo F Piepoli; Axel R Pries; Francesco Romeo; Lars Rydén; Maarten L Simoons; Per Anton Sirnes; Ph Gabriel Steg; Adam Timmis; William Wijns; Stephan Windecker; Aylin Yildirir; Jose Luis Zamorano
Journal:  Eur Heart J       Date:  2013-08-30       Impact factor: 29.983

5.  Biodegradable polymer drug-eluting stents reduce the risk of stent thrombosis at 4 years in patients undergoing percutaneous coronary intervention: a pooled analysis of individual patient data from the ISAR-TEST 3, ISAR-TEST 4, and LEADERS randomized trials.

Authors:  Giulio G Stefanini; Robert A Byrne; Patrick W Serruys; Antoinette de Waha; Bernhard Meier; Steffen Massberg; Peter Jüni; Albert Schömig; Stephan Windecker; Adnan Kastrati
Journal:  Eur Heart J       Date:  2012-03-24       Impact factor: 29.983

6.  Everolimus-eluting versus paclitaxel-eluting stents in coronary artery disease.

Authors:  Gregg W Stone; Ali Rizvi; William Newman; Kourosh Mastali; John C Wang; Ronald Caputo; Julie Doostzadeh; Sherry Cao; Charles A Simonton; Krishnankutty Sudhir; Alexandra J Lansky; Donald E Cutlip; Dean J Kereiakes
Journal:  N Engl J Med       Date:  2010-05-06       Impact factor: 91.245

7.  Incidence and Characteristics of Late Catch-Up Phenomenon Between Sirolimus-Eluting Stent and Everolimus-Eluting Stent: A Propensity Matched Study.

Authors:  Norihiro Kobayashi; Yoshiaki Ito; Masatsugu Nakano; Motoharu Araki; Keisuke Hirano; Masahiro Yamawaki; Hideyuki Takimura; Yasunari Sakamoto; Reiko Tsukahara; Toshiya Muramatsu
Journal:  J Interv Cardiol       Date:  2015-10-21       Impact factor: 2.279

8.  Paclitaxel-Eluting versus Everolimus-Eluting Coronary Stents in Diabetes.

Authors:  Upendra Kaul; Sripal Bangalore; Ashok Seth; Priyadarshini Arambam; Rajpal K Abhaichand; Rajpal K Abhaychand; Tejas M Patel; Darshan Banker; Atul Abhyankar; Ajit S Mullasari; Sanjay Shah; Rajneesh Jain; Premchand R Kumar; C G Bahuleyan
Journal:  N Engl J Med       Date:  2015-10-14       Impact factor: 91.245

9.  Architectural requirements for a multipurpose natural language processor in the clinical environment.

Authors:  C Friedman; S B Johnson; B Forman; J Starren
Journal:  Proc Annu Symp Comput Appl Med Care       Date:  1995

10.  Intravascular stents to prevent occlusion and restenosis after transluminal angioplasty.

Authors:  U Sigwart; J Puel; V Mirkovitch; F Joffre; L Kappenberger
Journal:  N Engl J Med       Date:  1987-03-19       Impact factor: 91.245

View more
  1 in total

1.  Text Processing for Detection of Fungal Ocular Involvement in Critical Care Patients: Cross-Sectional Study.

Authors:  Sally L Baxter; Adam R Klie; Bharanidharan Radha Saseendrakumar; Gordon Y Ye; Michael Hogarth
Journal:  J Med Internet Res       Date:  2020-08-14       Impact factor: 5.428

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.