| Literature DB >> 21966505 |
Brigitte Walther1, Safayet Hossin, John Townend, Neil Abernethy, David Parker, David Jeffries.
Abstract
BACKGROUND: Traditionally, clinical research studies rely on collecting data with case report forms, which are subsequently entered into a database to create electronic records. Although well established, this method is time-consuming and error-prone. This study compares four electronic data capture (EDC) methods with the conventional approach with respect to duration of data capture and accuracy. It was performed in a West African setting, where clinical trials involve data collection from urban, rural and often remote locations. METHODOLOGY/PRINCIPALEntities:
Mesh:
Year: 2011 PMID: 21966505 PMCID: PMC3179496 DOI: 10.1371/journal.pone.0025348
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Schematic of the Graeco Latin Square Design.
| S1 | S2 | S3 | S4 | S5 | ||
|
| M1 O1 | M2 O4 | M3 O2 | M4 O5 | M5 O3 |
|
|
| M2 O2 | M3 O5 | M4 O3 | M5 O1 | M1 O4 |
|
|
| M3 O3 | M4 O1 | M5 O4 | M1 O2 | M2 O5 |
|
|
| M4 O4 | M5 O2 | M1 O5 | M2 O3 | M3 O1 |
|
|
| M5 O5 | M1 O3 | M2 O1 | M3 O4 | M4 O2 |
|
Interviewer/Fieldworker 1–5 (F1–F5).
Interviewee 1–5 (S1–S5).
Method 1 (M1): face-to-face interview & MRC standard paper-based data capturing and processing.
Method 2 (M2): face-to-face interview & EDC using a netbook.
Method 3 (M3): face-to-face interview & EDC using a tablet-PC.
Method 4 (M4): face-to-face interview & EDC using a PDA.
Method 5 (M5): telephone interview & EDC using a laptop.
Order (O1–O5): Each interviewer/field worker interviewed 5 interviewees/volunteers per day. O1–O5 indicate the order in which the interviews were conducted (O1: 1st interview at a particular day, O2: 2nd interview, O3: 3rd interview, etc.).
Figure 1Error rates (%) per questionnaire.
The graph above presents the error rates (%) per interview for the standard paper-based data collection method and the four electronic data capture methods; netbook, tablet PC, PDA, and telephone interview in combination with EDC, in the 1st, 2nd, and 3rd study week. The bars represent median error rates per method and week.
Error rate (%) per week - Trend over time.
| Method | 1st week | 2nd week | 3rd week | 3rd week compared to 1st week | Trend over time | |||
| Mean | CI95% | Mean | CI95% | Mean | CI95% | p | padjusted
| |
|
| 5.2 | 3.4–7.1 | 4.3 | 2.6–6.0 | 3.6 | 2.0–5.2 | 0.189 | 0.210 |
|
| 8.8 | 6.5–11.2 | 5.9 | 4.0–7.9 | 5.0 | 3.2–6.9 | 0.013 | 0.012 |
|
| 6.3 | 3.7–8.9 | 4.9 | 3.1–6.7 | 5.2 | 3.4–7.1 | 0.499 | 0.642 |
|
| 13.2 | 10.3–16.0 | 6.7 | 4.6–8.7 | 7.9 | 5.7–10.2 | 0.005 | 0.003 |
|
| 10.5 | 7.9–13.0 | 9.9 | 7.4–12.4 | 6.3 | 4.3–8.3 | 0.013 | 0.014 |
Wilson 95% confidence interval for the error rate.
Error rate(%) for three interviews using the tablet.
Error rate(%) for four interviews using the PDA.
**p-value for the test of proportions comparing the error rates for the first and third week of the study.
***p-value, mixed effect model adjusting for the clustering effects for ‘fieldworker’ and ‘order’ in which the interviews were conducted.
Figure 2Error rate (%).
The overall error rates per 100 fields (%) and 95% confidence Intervals (Wilson) for the standard paper-based as well as the electronic data capturing methods in the first, second and third week of the study (1st week: blue, 2nd week: red, and 3rd week: green) are presented in graph A. The five smaller graphs present the error rate by field type and study week (B: text fields, C: date, D: single select, E: numerical, F: skip).
Figure 3Missing values.
The overall error rate (%) was defined as error per 100 fields and included the missing values. In the graph above the overall error rate is split into error rate, defined as incorrect entries per 100 fields (white), and missing rate, defined as missing values per 100 fields (black). The missing rates for the 1st, 2nd and 3rd week are presented in the three smaller graphs.
Duration of the interviews.
| Method | 1st week | 2nd week | 3rd week | 3rd week compared to 1st week | Trend over study period | |||
| Median (min) | Range | Median (min) | Range | Median (min) | Range | p | padjusted
| |
|
| 21 | 17–29 | 16 | 10–20 | 17 | 12–21 | 0.056 | 0.133 |
|
| 43 | 30–66 | 45 | 24–57 | 38 | 27–50 | 0.078 | 0.168 |
|
| 66 | 33–78 | 45 | 32–75 | 33 | 23–50 | 0.109 | 0.001 |
|
| 64.5 | 49–88 | 49 | 41–80 | 40 | 29–67 | 0.068 | <0.0001 |
|
| 54 | 35–64 | 40 | 27–48 | 32 | 22–41 | 0.042 | <0.0001 |
*Minimum and maximum duration of the interviews.
**Comparison of the median duration of the interviews between the first and the last week of the study period for each device using the Wilcoxon signed-rank test.
***Mixed effect model fitting device, week, sequence and interaction terms for device/week, adjusting for random effects for the 25 combinations of fieldworker and device. P-values indicate if there was a significant reduction in duration of the interviews over the study period of three weeks.
Median duration for three interviews using the tablet PC.
During the first week of the study 3 of 5 scheduled interviews for the EDC method using the tablet PC were conducted. The p-value of p = 0.109 for the non-parametric test might therefore not be a reliable estimate.
Median duration for four interviews using the PDA.
Figure 4Duration of data entry per questionnaire.
The figure presents the time (min) a data entry clerk needed to transcribe the data from one paper Case Report Form to an electronic record. The median time is represented by a red line.
Comparison of cost estimates for standard and Electronic Data Capture (EDC) methods.
| Type of costs | Standard Method | EDC using Netbook | EDC using Tablet |
|
|
| 1.052 | 0.942 |
| training of nurses/fieldworkers and data entry clerks | training of nurses/fieldworkers | training of nurses/fieldworkers | |
|
| desktop | netbook | tablet PC |
| server space | use of ruggedized devices | use of ruggedized devices | |
| replacement device(s) | replacement device(s) | ||
| server space | server space | ||
|
| office space for data entry clerks | additional costs for transportation (cars/motor-bikes) | additional costs for transportation (cars/motor-bikes) |
| archiving paper CRFs |
*Salary costs per correctly entered field were calculated according to the formulas which are presented in the methods section. The time a data supervisor (standard method) or a data manager (EDC) would need for the synchronization of the data bases and the performance of range and consistency checks was assumed to be 5 min/questionnaire. Duration of double entry of the data of one record was calculated by multiplying the median duration for one entry by two. The values for the interview duration and error rates for the standard and EDC methods were taken from the last week of the study. The costs for netbook and tablet PC are presented as relative percent increase/decrease compared to the standard method.
**The duration of the interviews using EDC was longer compared to the standard paper-based method. The number of interviews per fieldworker and working day might therefore be reduced with EDC compared to the standard method. The costs for transportation might increase consequently.