Literature DB >> 24223403

Development of prostate cancer research database with the clinical data warehouse technology for direct linkage with electronic medical record system.

In Young Choi1, Seungho Park, Bumjoon Park, Byung Ha Chung, Choung-Soo Kim, Hyun Moo Lee, Seok-Soo Byun, Ji Youl Lee.   

Abstract

PURPOSE: In spite of increased prostate cancer patients, little is known about the impact of treatments for prostate cancer patients and outcome of different treatments based on nationwide data. In order to obtain more comprehensive information for Korean prostate cancer patients, many professionals urged to have national system to monitor the quality of prostate cancer care. To gain its objective, the prostate cancer database system was planned and cautiously accommodated different views from various professions.
METHODS: This prostate cancer research database system incorporates information about a prostate cancer research including demographics, medical history, operation information, laboratory, and quality of life surveys. And, this system includes three different ways of clinical data collection to produce a comprehensive data base; direct data extraction from electronic medical record (EMR) system, manual data entry after linking EMR documents like magnetic resonance imaging findings and paper-based data collection for survey from patients.
RESULTS: We implemented clinical data warehouse technology to test direct EMR link method with St. Mary's Hospital system. Using this method, total number of eligible patients were 2,300 from 1997 until 2012. Among them, 538 patients conducted surgery and others have different treatments.
CONCLUSIONS: Our database system could provide the infrastructure for collecting error free data to support various retrospective and prospective studies.

Entities:  

Keywords:  Hospital information systems; Information Storage and Retrieval; Prostate neoplasms; Retrospective studies

Year:  2013        PMID: 24223403      PMCID: PMC3814112          DOI: 10.12954/PI.12015

Source DB:  PubMed          Journal:  Prostate Int        ISSN: 2287-8882


INTRODUCTION

The National Statistics Office in Korea exclaimed that the prevalence of prostate cancer quadrupled between 2002 and 2008 [1,2]. The incidence of prostate cancer in Korea increased up to 24.9 per 100,000 men in 2009 in comparison with 13 per 100,000 in 2008. Environmental elements, western dietary habits, and the rise in average life expectancy are known as influential factors of the increased rate of prostate cancer patients. The Korean Urologic Association launched a campaign of prostate-specific antigen (PSA) screening test to raise public awareness on the increase in prostate cancer in Korean patients. The rapid increase of prostate cancer patients in Korea requires intensive disease progression and management. In spite of increased prostate cancer patients, little is known about the impact of treatments for prostate cancer patients and outcome of different treatments with nationwide data. In order to obtain more comprehensive information for Korean prostate cancer patients, multicenter longitudinal database had been proposed. There were similar projects in the United States and Japan. One of the most popular database for prostate cancer is the Cancer of the Prostate Strategic Urologic Research Endeavor (CaPSURE), which is the web-based database developed in 1995 for longitudinal observation of prostate cancer patients in natural settings in the United States. This project began with ten participating health care centers and increased to 26 centers in one year. Currently, the CaP-SURE is one of the most powerful prospective study groups for prostate cancer in the world, composed of approximately 14,000 registered prostate cancer patients. The most recent mover in Asia, the Japan Study Group for Prostate Cancer (J-CaP), was developed in 2001. In the case of J-CaP, the database is comprised of 17,872 prostate cancer patients from prospective studies and research to improve patient care. Against this background, many professionals urged to have national system to monitor the quality of prostate cancer care [3]. The requirement for multicenter observational prostate cancer database was also proposed in Korea. Observational databases are useful in evaluating large amounts of data in a timely manner, and evaluating clinical outcomes in real healthcare setting. The database is also used to improve the quality of clinical researches because data collection can assist investigators. To gain its objective, the database was carefully planned and cautiously accommodated different views from various professions. Our first step was to develop research database structure including data elements for a successful observational database. We defined the important questions to which providers want answers, and data elements need to be captured. And, the second step was to compare different ways of data collection and suggest more efficient methods. Data capture with less human effort is important step to maintain the database for long period time. The purpose of this study is to propose the multi center observational research database structure incorporating clinical factors and patient self reports and suggest effective ways of data collection linking with clinical information systems. Our objective was to develop research database system having easy data access, transparent scientific reproducibility, and interoperability between multiple centers.

MATERIALS AND METHODS

To develop database structure for multicenter prostate registry system, we analyzed previously developed database systems as follows.

Materials

CaPSURE database

According to cancer statistics in 2012, prostate cancer was the second leading cause of cancer-related death among men in the United States [4]. The CaPSURE was founded in 1995 as a disease registry of men with all stages of prostate cancer [5]. Currently, a group of 31 urological practice sites enroll patients in CaPSURE, and 40 sites including community based site, 4 Veterans Affairs medical centers are involved. CaPSURE collects approximately 1,000 clinical and patient reported variables. The clinical information includes history of prostate cancer diagnosis, biopsies, pathological findings, staging tests, primary and subsequent treatments, clinic procedures, Karnofsky performance status scores and medications. At each clinic visit the urologist completes a progress record, including current disease status, new prostate or unrelated diagnoses, disease signs and symptoms, and changes in medications. Results of imaging studies and laboratory tests are recorded when they are determined. In addition to the clinical data, the patient information is also collected. At enrollment each patient completes a questionnaire about sociodemographic parameters, comorbidities [6], and baseline health-related quality of life (HRQoL). Every 6 months thereafter patients are mailed a follow-up questionnaire, and HRQoL questionnaires including the Medical Outcomes Study Short Form-36 (SF-36) [7] for general HRQoL and the University of California-Los Angeles Prostate Cancer Index [8] for disease specific HRQoL are collected. Since 1999, the survey on patient satisfaction with care [9,10] and fear of cancer recurrence [11] are also included.

J-Cap database

In Japan, the J-CaP database was established in 2001 with financial support from Japan Kidney Foundation. And the Japanese Urological Association commenced a study to gather information about hormone therapy administered to Japanese patients and to analyze the outcomes of treatment. The purposes of this study group were to gather information about the hormone therapy administered to Japanese prostate cancer patients living in Japan and to analyze the outcomes of treatment in order to create a guideline for optimal hormone therapy. This study analyzes different forms primary androgen deprivation therapy (PADT), including combined androgen blockade therapy, for the treatment of prostate cancer within Japan. The J-CaP registry is a large, multicenter, population-based database of men newly starting PADT for prostate cancer. The following clinical information captured over the Internet; date of birth, family history, date of PSA reading, PSA value, PSA kit name, testosterone value, biopsy date, Gleason score, histological grade, clinical stage, case history, details of hormone therapy, whether or not there has been progress observation, whether or not surgery was carried out, date of surgery, operative procedure, whether or not radiotherapy is being conducted, irradiation method, irradiation date, progress. The interim analysis of the registration status of the patients and their background variables was reported in 2003 [12], and treatment patterns with PADT have been reported along with an interim analysis of prognosis in 2007 [13]. As of 2005, J-CaP included data for 26,272 patients from 406 institutions comprising 77 university hospitals (67% of those in Japan), 267 general hospitals and 62 private hospitals. Around 50% of new prostate cancer patients treated with hormone therapy in Japan was registered with J-CaP at that time.

CPDR database

The Center for Prostate Disease Research (CPDR) was established in 1992, by the United States Congress-Public Law 102–172 [14]. The participating institutions of the CPDR are the Department of Defense, multisite program within the School of Medicine, Department of Surgery, at the Uniformed Services University of the Health Sciences, Bethesda, MD, with its primary clinical program located at Walter Reed Army Medical Center, Washington, D.C. and a scientific laboratory in Rockville, MD. For this program, urologists, cancer biologists, genitor-urinary pathologists, epidemiologists, biostaticians, medical- and bio-informaticians are involved. The goal of the CPDR-Clinical Center Program is to combine prostate screening, clinical diagnosis, data collection, education and counseling, and prostate disease clinical trial research. To address this goal, the National Multicenter Database had been developed. The National Multicenter Database of the CPDR is comprised of several military medical centers including the Army, Air Force, and Navy, and a civilian institution. The data base contains approximately 500 data fields in 48 tables that include registration, patient contact information, pretreatment diagnosis, cancer staging, treatment types, follow-up and recurrence, QoL issues, and many others. Data are collect for standard clinical care, which includes hormonal therapy, radiation therapy, chemotherapy, and surgery. Each major treatment type has a subset of questions asked. The majority of the information is collected during the treatment state. With an average follow-up of 8 years and over 1,200 prostate specimens processed, the CPDR database is rapidly becoming a national prostate cancer research resource. Their efforts had led to more than 300 peer-reviewed publications, numerous scientific presentations, more than 20 clinical trials ranging from disease prevention to QoL.

Methods

Staged approach of database structure development

Developing research data base structure for multi center prostate cancer research is a complex undertaking. We benchmarked previous research project by searching PubMed database for prostate cancer registry system. We entered the keyword as “prostate cancer registry system”, “prostate cancer database system”, and “prostate cancer retrospective research”. Then, large number of articles used CaPSURE database system. Then, we reviewed CaPSURE database system and explore the similarity and difference of both systems and develop basic category for database system. Later, we finalized data elements by working together with physicians of the related departments and refine the set of data element. We had a weekly meeting with doctors from urology, pathology, radiology and radiology oncology for two months to explore database design structure. Total numbers of participants were around 20 people. During the meeting, we asked the participants to gain consensus regarding important items to which patients and providers want answers to understand data elements to be captured. Finally, our research database structure includes major outcome results for prostate cancer such as Table 1. The database incorporates all information about a prostate cancer research; demographic data, medical history, clinical information, laboratory, survey, and follow-up data. The final database results to include approximately 222 clinical and patient-reported items. Our database structure has flexibility to add new measurements when appropriate and to ensure variables to compare outcomes across other healthcare organizations.
Table 1

Data elements of prostate cancer database system

1. DemographicsAge, sex, height, weight, body mass index
2. Medical historyOperation history, preexisting comorbidities, postdiagnosis comorbidities
3. Cancer stageClinical TNM stage, pathological stage
4. LaboratoryProstate-specific antigen, hemoglobin, UDS finding
5. Patient self-reported surveyIPSS, IIEF
6. RadiologyMRI (initial, follow-up), CT (initial, follow-up), TRUS
7. PathologyPreoperative Bx, postoperative Bx
8. Treatment typeActive surveillance, surgery, hormonal therapy, brachytherapy, cryotherapy
9. MedicationMedication category, duration, route

UDS, urodynamic study; IPSS, International Prostate Symptom Score; IIEF, International Index of Erectile Function; MRI, magnetic resonance imaging; CT, computed tomography; TRUS, transrectal ultrasonography; Bx, biopsy.

Data collection method

After determining the data elements for multicenter research, we then develop the strategy of data collection within time-lines. Most of the previous research database is completed by manual entry of physicians or clinical research coordinators or data entry staffs. This data entry method is very labor intensive and cumbersome, so it usually fails to capture relevant data at early times. We concluded that high quality data collection strategy is fundamental to achieve relevant information at early times. Our system includes three different ways of clinical data collection to produce a comprehensive data base; direct data extraction from electronic medical record (EMR) system, manual data entry after linking EMR documents like magnetic resonance imaging findings and paper-based data collection for survey from patients. We combine various data collection methods for different types of data elements. For example, preoperative PSA value can be collected from EMR system. Table 2 shows different data collection methods to include research data into the database system. 27% of total data elements can be collected through direct EMR link, and 70% can be completed through indirect EMR link.
Table 2

Data collection methods of prostate cancer database system

Data collection methodsItemsPercent
Direct electronic medical record extraction5926.6
Indirect electronic medical record link (unstructured)15569.8
Patient reports (paper or mobile link in future)83.6
Total222100
Our integrated data collection and data management will contribute to prevent redundant entry of the same information such as direct linking with hospital information systems using clinical data warehousing technique.

Clinical data warehouse for direct link with EMR system

Implementing the direct extraction program may decrease the performance of EMR system, and thus it is very difficult to add programs into the hospital’s operating system. We suggest using clinical data warehouse (CDW) technology to access and extract research information with less effort. The CDW is the method to develop clinical database that is optimized for distribution, mass storage and complex query processing [15]. It also provides comprehensive views of clinical data for specific purpose. A CDW can provide numerous benefits to researchers with quality data collection, and decision support capability by quick and efficient access to patient information and linkage to multiple operational data sources. Using CDW methodology, accurate and high quality prostate cancer patients’ data can be collected from EMR system and feeds them to central prostate cancer registry system. All eligible patients with newly diagnosed prostate cancer can be electronically transferred into the prostate cancer registry database from EMR system. And all registered patient information will be periodically updated (Fig. 1).
Fig. 1

Clinical data warehouse for electronic medical record link. CMC, catholic medical center; ODS, operational data store; DW, data warehouse; DB, database; ETL, extraction transaction loading.

RESULTS

Our prostate cancer research database system is developed with Microsoft SQL server running on the Microsoft NT servers and is programmed with Java for user interface development. All data and research records are maintained in Internet Data Center which is a facility having telecommunications and storage systems, backup power supplies and redundant data communications connections. This database system provides three different ways of data collection depending on the information technology infrastructure; direct manual entry; Excel upload, and direct EMR link using CDW technology (Fig. 2). For example, a hospital A already has affluent human resource for prostate cancer registry, and prefers manual entry. Then they can access web browser and enter each data field. A hospital prefers to enter data into Excel file and later upload Excel file into the web system. Other hospital wants to download research data automatically and import the extracted file into the research data base system.
Fig. 2

Integrated prostate cancer database system. EMR, electronic medical record; CDW, clinical data warehouse; MRI, magnetic resonance imaging.

We implemented CDW technology to test direct EMR link method with St. Mary’s Hospital system. We defined the EMR variables mapped with the research data elements. Relevant data was transferred into the EMR system of St. Mary’s Hospital using CDW technology and extracted data into the prostate cancer database system. To validate CDW technology, we selected sample data and compared with EMR system. Using this method, total number of eligible patients were 2,300 from 2008 until 2012. Among them, 538 patients conducted surgery and others have different treatments. After completion of the data control for registered patients, analytic reports can be prepared for prospective studies and research for improved prostate cancer patient care. These data summaries evaluate clinical occurrences, patient QoL, economic impact, and oncology outcomes, as well as compare types of treatment by stage and practice, among patients with prostate cancer in other institutions of Korea. Our system can provide visualization integrating valuable information from different data sources. Researchers interpret clinical effectiveness in place and can be the turning point in uncovering new insights and knowledge about a patient or a disease (Fig. 3).
Fig. 3

Patient characteristics by treatment.

DISCUSSION

Our suggested database structure is applicable for any hospital which wants to link their EMR system directly with our research system and can be a representative database to understand prostate cancer patients and treatment patters in Korea. Our system can provide complete treatment histories and patient information and can allow for comparison of different outcomes. As the number of enrolled patients is increased, the system will contribute to compare primary indicator for prostate cancer patients with other institutions in Korea and other databases such as CaPSURE from United States and J-CaP from Japan. For example, UCSF-CAPRA (University of California, San Francisco Cancer of the Prostate Risk Assessment) is a risk assessment tool developed from a cohort of radical prostatectomy patients (n=1,439) in the CaPSURE database. The Japanese tool named as ‘J-CAPRA’ is developed for patients undergoing PADT and is applied for those with both localized and advanced disease [16]. The longitudinal observation database is an important source to investigate therapeutic efficacy and patient outcomes in the real clinical settings and therefore can be an invaluable complement of randomized clinical trials. Our database system could provide the infrastructure for collecting data on the quality of prostate cancer care. Our large database system, like J-CaP and CaPSURE, can provide valuable real-world information and would help advance clinical management of prostate cancer patients in the future.
  15 in total

1.  The MOS 36-item short-form health survey (SF-36). I. Conceptual framework and item selection.

Authors:  J E Ware; C D Sherbourne
Journal:  Med Care       Date:  1992-06       Impact factor: 2.983

2.  Older patients' health status and satisfaction with medical care in an HMO population.

Authors:  J A Hall; M Feldstein; M D Fretwell; J W Rowe; A M Epstein
Journal:  Med Care       Date:  1990-03       Impact factor: 2.983

3.  Fifth Joint Meeting of J-CaP and CaPSURE: advancing the global understanding of prostate cancer and its management.

Authors:  Hideyuki Akaza; Peter Carroll; Matthew R Cooperberg; Shiro Hinotsu
Journal:  Jpn J Clin Oncol       Date:  2012-01-04       Impact factor: 3.019

4.  Risk assessment among prostate cancer patients receiving primary androgen deprivation therapy.

Authors:  Matthew R Cooperberg; Shiro Hinotsu; Mikio Namiki; Kazuto Ito; Jeanette Broering; Peter R Carroll; Hideyuki Akaza
Journal:  J Clin Oncol       Date:  2009-08-10       Impact factor: 44.544

5.  Cancer statistics, 2012.

Authors:  Rebecca Siegel; Deepa Naishadham; Ahmedin Jemal
Journal:  CA Cancer J Clin       Date:  2012-01-04       Impact factor: 508.702

6.  An instrument to measure patient satisfaction with healthcare in an observational database: results of a validation study using data from CaPSURE.

Authors:  D P Lubeck; M S Litwin; J M Henning; S D Mathias; L Bloor; P R Carroll
Journal:  Am J Manag Care       Date:  2000-01       Impact factor: 2.229

7.  Validity of cancer registry data for measuring the quality of breast cancer care.

Authors:  Jennifer L Malin; Katherine L Kahn; John Adams; Lorna Kwan; Marianne Laouri; Patricia A Ganz
Journal:  J Natl Cancer Inst       Date:  2002-06-05       Impact factor: 13.506

8.  Current status of endocrine therapy for prostate cancer in Japan analysis of primary androgen deprivation therapy on the basis of data collected by J-CaP.

Authors:  Shiro Hinotsu; Hideyuki Akaza; Michiyuki Usami; Osamu Ogawa; Susumu Kagawa; Tadaichi Kitamura; Taiji Tsukamoto; Seiji Naito; Mikio Namiki; Yoshihiko Hirao; Masaru Murai; Hidetoshi Yamanaka
Journal:  Jpn J Clin Oncol       Date:  2007-10-26       Impact factor: 3.019

Review 9.  The contemporary management of prostate cancer in the United States: lessons from the cancer of the prostate strategic urologic research endeavor (CapSURE), a national disease registry.

Authors:  Matthew R Cooperberg; Jeanette M Broering; Mark S Litwin; Deborah P Lubeck; Shilpa S Mehta; James M Henning; Peter R Carroll
Journal:  J Urol       Date:  2004-04       Impact factor: 7.450

10.  The UCLA Prostate Cancer Index: development, reliability, and validity of a health-related quality of life measure.

Authors:  M S Litwin; R D Hays; A Fink; P A Ganz; B Leake; R H Brook
Journal:  Med Care       Date:  1998-07       Impact factor: 2.983

View more
  10 in total

1.  Development of a large urban longitudinal HIV clinical cohort using a web-based platform to merge electronically and manually abstracted data from disparate medical record systems: technical challenges and innovative solutions.

Authors:  Alan E Greenberg; Harlen Hays; Amanda D Castel; Thilakavathy Subramanian; Lindsey Powers Happ; Maria Jaurretche; Jeff Binkley; Mariah M Kalmin; Kathy Wood; Rachel Hart
Journal:  J Am Med Inform Assoc       Date:  2015-12-31       Impact factor: 4.497

2.  Development of Clinical Data Mart of HMG-CoA Reductase Inhibitor for Varied Clinical Research.

Authors:  Hun Sung Kim; Hyunah Kim; Yoo Jin Jeong; Tong Min Kim; So Jung Yang; Sun Jung Baik; Seung Hwan Lee; Jae Hyoung Cho; In Young Choi; Kun Ho Yoon
Journal:  Endocrinol Metab (Seoul)       Date:  2017-02-28

3.  Architecture and Implementation of a Clinical Research Data Warehouse for Prostate Cancer.

Authors:  Martin G Seneviratne; Tina Seto; Douglas W Blayney; James D Brooks; Tina Hernandez-Boussard
Journal:  EGEMS (Wash DC)       Date:  2018-06-01

4.  The Korea Cancer Big Data Platform (K-CBP) for Cancer Research.

Authors:  Hyo Soung Cha; Jip Min Jung; Seob Yoon Shin; Young Mi Jang; Phillip Park; Jae Wook Lee; Seung Hyun Chung; Kui Son Choi
Journal:  Int J Environ Res Public Health       Date:  2019-06-28       Impact factor: 3.390

5.  Using natural language processing to construct a metastatic breast cancer cohort from linked cancer registry and electronic medical records data.

Authors:  Albee Y Ling; Allison W Kurian; Jennifer L Caswell-Jin; George W Sledge; Nigam H Shah; Suzanne R Tamang
Journal:  JAMIA Open       Date:  2019-09-18

6.  Prostate-specific antigen kinetics after primary stereotactic body radiation therapy using CyberKnife for localized prostate cancer.

Authors:  Yong Hyun Park; In Young Choi; Sei Chul Yoon; Hong Seok Jang; Hyong Woo Moon; Sung-Hoo Hong; Sae Woong Kim; Tae-Kon Hwang; Ji Youl Lee
Journal:  Prostate Int       Date:  2015-02-12

7.  Effect of Nerve-Sparing Radical Prostatectomy on Urinary Continence in Patients With Preoperative Erectile Dysfunction.

Authors:  Yong Hyun Park; Oh Seong Kwon; Sung-Hoo Hong; Sae Woong Kim; Tae-Kon Hwang; Ji Youl Lee
Journal:  Int Neurourol J       Date:  2016-03-16       Impact factor: 2.835

8.  Is lymphovascular invasion a powerful predictor for biochemical recurrence in pT3 N0 prostate cancer? Results from the K-CaP database.

Authors:  Yong Hyun Park; Yejin Kim; Hwanjo Yu; In Young Choi; Seok-Soo Byun; Cheol Kwak; Byung Ha Chung; Hyun Moo Lee; Choung Soo Kim; Ji Youl Lee
Journal:  Sci Rep       Date:  2016-05-05       Impact factor: 4.379

9.  Development of South Australian-Victorian Prostate Cancer Health Outcomes Research Dataset.

Authors:  Rasa Ruseckaite; Kerri Beckmann; Michael O'Callaghan; David Roder; Kim Moretti; John Zalcberg; Jeremy Millar; Sue Evans
Journal:  BMC Res Notes       Date:  2016-01-22

Review 10.  The Role of Clinical Virology Laboratory and the Clinical Virology Laboratorian in Ensuring Effective Surveillance for Influenza and Other Respiratory Viruses: Points to Consider and Pitfalls to Avoid.

Authors:  Steven J Drews
Journal:  Curr Treat Options Infect Dis       Date:  2016-07-05
  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.