Literature DB >> 25135245

Open-source electronic data capture system offered increased accuracy and cost-effectiveness compared with paper methods in Africa.

David G Dillon1, Fraser Pirie2, Stephen Rice3, Cristina Pomilla1, Manjinder S Sandhu1, Ayesha A Motala2, Elizabeth H Young4.   

Abstract

OBJECTIVES: Existing electronic data capture options are often financially unfeasible in resource-poor settings or difficult to support technically in the field. To help facilitate large-scale multicenter studies in sub-Saharan Africa, the African Partnership for Chronic Disease Research (APCDR) has developed an open-source electronic questionnaire (EQ). STUDY DESIGN AND
SETTING: To assess its relative validity, we compared the EQ against traditional pen-and-paper methods using 200 randomized interviews conducted in an ongoing type 2 diabetes case-control study in South Africa.
RESULTS: During its 3-month validation, the EQ had a lower frequency of errors (EQ, 0.17 errors per 100 questions; paper, 0.73 errors per 100 questions; P-value ≤0.001), and a lower monetary cost per correctly entered question, compared with the pen-and-paper method. We found no marked difference in the average duration of the interview between methods (EQ, 5.4 minutes; paper, 5.6 minutes).
CONCLUSION: This validation study suggests that the EQ may offer increased accuracy, similar interview duration, and increased cost-effectiveness compared with paper-based data collection methods. The APCDR EQ software is freely available (https://github.com/apcdr/questionnaire).
Copyright © 2014 The Authors. Published by Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Data capture; Electronic questionnaire; Epidemiology; Open-source; Sub-Saharan Africa; Survey

Mesh:

Year:  2014        PMID: 25135245      PMCID: PMC4271740          DOI: 10.1016/j.jclinepi.2014.06.012

Source DB:  PubMed          Journal:  J Clin Epidemiol        ISSN: 0895-4356            Impact factor:   6.437


Introduction

Existing electronic data capture options are often financially unfeasible in resource-poor settings or difficult to support technically in the field. To help facilitate large-scale multicenter studies in sub-Saharan Africa, the African Partnership for Chronic Disease Research (APCDR) has developed an open-source electronic questionnaire (EQ). This validation study compares the APCDR EQ with traditional pen-and-paper methods to assess the relative efficiency and accuracy of the EQ. Our results suggest that the EQ may offer increased accuracy, similar interview duration, and increased cost-effectiveness compared with paper-based data collection methods. The EQ software is freely available (https://github.com/apcdr/questionnaire). In the past decade, electronic data capture systems have been increasingly adopted over traditional pen-and-paper methods for field survey data collection [1]. By combining the act of data collection with that of data input, electronic data capture devices may lower the cost and time of study monitoring and data management, as well as reduce the amount of data cleaning necessary on return from the field [2], [3], [4]. However, when doing research in a resource-poor setting such as sub-Saharan Africa (SSA), the use of electronic data capture systems may be problematic. Access to high-quality electronic devices and software may be financially unfeasible or restricted because of proprietary licensing, or existing software may be difficult to use for the creation of study-specific data collection forms and require specialist knowledge to maintain once in the field [2]. To address these limitations, the African Partnership for Chronic Disease Research (APCDR) has developed novel software to electronically capture questionnaire data in the field. This open-source data collection system was specifically designed to address the difficulties of working in SSA, and we anticipate that the electronic questionnaire (EQ) may provide an efficient and cost-effective alternative to traditional data collection methods. The validation study presented here aims to compare the APCDR EQ with traditional pen-and-paper methods to assess the relative efficiency and accuracy of the EQ.

Materials and methods

Study overview

We performed the EQ validation study between December 2012 and February 2013 within an ongoing type 2 diabetes case–control (T2DCC) study in Durban, South Africa. The default data collection method in the T2DCC study consisted of paper-based questionnaires and manual double entry of data into a single electronic database. We validated the EQ in the case collection arm of the T2DCC study. Two hundred consecutive diabetes cases, eligible for inclusion in the T2DCC study, were randomized to be interviewed using either the EQ or the existing pen-and-paper methods. All 200 interviews were conducted by the same research nurse. After the EQ validation study was finished, the T2DCC study continued to recruit using existing pen-and-paper methods. The EQ validation study was approved as a minor amendment to the T2DCC by the Biomedical Research Ethics Committee of the University of KwaZulu-Natal (study reference: BF078/08).

The APCDR EQ

The EQ is a C#-based program with supporting extensible markup language (XML) documentation containing the question list and visual formatting. Creation and alteration of question lists is done directly in XML, which is a programming language that allows the creation of design documents in a format that is both human and machine readable. As the main EQ program is coded separately from the question list, multiple XML documents can be created, allowing a single EQ program to run the same questionnaire with several language options or even several different questionnaires. Importantly, XML was specifically designed to emphasize simplicity and usability [5], providing a relatively low barrier to entry for the design and maintenance of questionnaires while still providing access to powerful options, such as Boolean logic question skips and automatic variable range control. Data collected by the EQ are stored in a MySQL database or Excel spreadsheet. Data transfer is facilitated through USB connections, avoiding the need for a constant Internet connection. Furthermore, the EQ program is being adapted for use across multiple platforms (eg, Android and Windows) and across multiple devices (eg, tablet, PC, and mobile phone). The source code for the EQ program and example XML documentation (Windows version) are freely available (https://github.com/apcdr/questionnaire) under the Affero General Public License free software license. Additionally, subsequent updated versions of the EQ, including additional applications such as audio recording and Global Positioning System capabilities, will also be available from the same Web site. In this relative validation study, the question list consisted of 46 questions, which gathered information on sociodemographic indicators, medical history, and anthropometric measurements. We programed the EQ question list to duplicate the existing paper questionnaire from the T2DCC study as closely as possible (Appendix A; see at www.jclinepi.com). A sample screen of the EQ program is shown in Fig. 1.
Fig. 1

Sample electronic questionnaire screen presenting a multiple-choice question. GPS, Global Positioning System.

Sample electronic questionnaire screen presenting a multiple-choice question. GPS, Global Positioning System.

Statistical analyses

We defined data collection errors as nonsensical or impossible inputs, missing data, or inputs that were inconsistent or incompatible with previous responses during the interview. Minor errors were classified a priori as differences of 1 year or less in date questions, whereas all remaining errors were classified as major errors. We calculated P-values for error comparisons between the EQ and paper methods using Pearson chi-square test or Fisher exact test, as appropriate. Cuzick's nonparametric test for trend across ordered groups was used to calculate P-values for the trend in duration of interviews over time. In all instances, P-values <0.05 (two sided) were considered statistically significant. Based on an error frequency estimate of one error per 100 questions for the traditional pen-and-paper method used in the T2DCC study, these analyses have more than 80% power to detect a change of 0.50 errors per 100 questions at the 5% level between the two methods. We based all salary cost analyses on formulae and estimates proposed by Walther et al. [6] for the comparison of electronic data capture systems with traditional pen-and-paper methods in an SSA country. The data were analyzed using Stata, version 11 (Stata Corporation, Texas, USA) and Microsoft Excel 2010.

Results

We collected data on 105 participants using the EQ and 95 participants using the paper method over 3 consecutive months. The slight discrepancy between groups was because of mistakes in implementing the randomization procedure; however, when we compared demographic characteristics between the EQ and paper randomization groups, we did not find any statistically significant differences (Appendix B; see at www.jclinepi.com). The number of errors found for both the EQ and paper methods is presented in Table 1. The EQ had a lower number of major errors per 100 questions (EQ, 0.00 errors; paper, 0.59 errors; P < 0.001), as well as a lower overall number of errors per 100 questions (EQ, 0.17 errors; paper, 0.73 errors; P < 0.001). When using the pen-and-paper method, at least one error occurred every three interviews on average (33.4% of interviews contained at least one error), whereas for the EQ an error occurred once every 14 interviews on average (7.6% of interviews contained at least one error).
Table 1

Frequency of errors in 200 interviews, by month and method

Error typeNumber of errors per 100 questions
Percent of interviews containing at least one error
Month 1Month 2Month 3OverallMonth 1Month 2Month 3Overall
All errors
 Paper0.930.530.960.7338.122.444.033.4
 EQ0.170.870.300.177.74.013.87.6
Major errors
 Paper0.830.440.690.5933.318.432.027.4
 EQ0.000.000.000.000.00.00.00.0
Minor errors
 Paper0.100.890.260.144.84.112.06.3
 EQ0.170.870.300.177.74.013.87.6

Abbreviation: EQ, electronic questionnaire.

Minor errors classified as differences of 1 year or less in date calculations. Major errors classified as all other error types. P-values compare the number of errors for the EQ and paper methods for each error type using Pearson chi-square test.

P ≤ 0.001.

Frequency of errors in 200 interviews, by month and method Abbreviation: EQ, electronic questionnaire. Minor errors classified as differences of 1 year or less in date calculations. Major errors classified as all other error types. P-values compare the number of errors for the EQ and paper methods for each error type using Pearson chi-square test. P ≤ 0.001. Average duration of participant interviews, stratified by month and data collection method, is presented in Table 2. The overall average interview duration was similar for each method (EQ, 5.4 minutes; paper, 5.6 minutes), and the difference was not found to be statistically significant. There appeared to be no trend in interview duration over time for either data collection method.
Table 2

Duration of interview, by month and method (n = 200)

MethodOverall
Month 1
Month 2
Month 3
Trend over time
Mean (range)P-value
EQ (n = 105)5.4 (3.7–8.8)5.7 (4.2–8.8)5.2 (4.0–7.4)5.5 (3.7–8.7)0.100
Paper (n = 95)5.6 (2.0–15.0)4.8 (2.0–7.0)6.0 (4.0–15.0)5.4 (4.0–10.0)0.056

Abbreviation: EQ, electronic questionnaire.

Duration reported in minutes.

Duration of interview, by month and method (n = 200) Abbreviation: EQ, electronic questionnaire. Duration reported in minutes. A summary of estimated economic costs for each data collection method is presented in Table 3. All cost calculations were standardized to 1.00 for the paper method, for ease of comparison. Using the formulae proposed by Walther et al., the EQ salary cost per correctly entered question was almost half the cost of using pen-and-paper methods. In contrast, the initial technology costs for the EQ were 2.47 times larger than those estimated for paper methods. However, based on the costs incurred in the study presented here (Fig. 2), the EQ would recoup the difference in initial start-up costs within 6 months; this time frame would decrease if the number of questions, or the number of interviews per month, was increased.
Table 3

Estimated economic cost per method

Type of costPaperEQ
Salary cost per correctly entered questiona1.000.51
Initial technology costsbDesktopDesktop + 2 tablets
1.002.47
Additional overheadsStorage space for paper hard copies; office space for data entry clerkHardware maintenance and upkeep

Abbreviation: EQ, electronic questionnaire.

All costs standardized to 1.00 for the paper questionnaire.

Salary costs per correctly entered question were calculated using the formulae presented by Walther et al. This includes the following assumptions: (1) Minimum staffing requirements are one field worker, one data entry clerk, and one data supervisor for the pen-and-paper method; and one field worker and one data manager for the EQ method. (2) Based on the EQ study budget, a data entry clerk would cost £250 per month. (3) Although no direct measurements were taken for the time required for double data entry and data quality control during the validation study, applicable estimates are available from published SSA studies. Based on these data, it was assumed that double data entry took 11.6 minutes and data quality control took 5 minutes per paper questionnaire, whereas data quality control took 3 minutes per EQ questionnaire.

Technology costs are based on actual incurred costs of £308 per tablet and £420 per desktop.

Fig. 2

Worked example of estimated time taken to recoup costs using the EQ method. EQ, electronic questionnaire. aSalary estimates are based on formulae presented by Walther et al. (see Table 3). bEquipment costs are taken from Table 3.

Estimated economic cost per method Abbreviation: EQ, electronic questionnaire. All costs standardized to 1.00 for the paper questionnaire. Salary costs per correctly entered question were calculated using the formulae presented by Walther et al. This includes the following assumptions: (1) Minimum staffing requirements are one field worker, one data entry clerk, and one data supervisor for the pen-and-paper method; and one field worker and one data manager for the EQ method. (2) Based on the EQ study budget, a data entry clerk would cost £250 per month. (3) Although no direct measurements were taken for the time required for double data entry and data quality control during the validation study, applicable estimates are available from published SSA studies. Based on these data, it was assumed that double data entry took 11.6 minutes and data quality control took 5 minutes per paper questionnaire, whereas data quality control took 3 minutes per EQ questionnaire. Technology costs are based on actual incurred costs of £308 per tablet and £420 per desktop. Worked example of estimated time taken to recoup costs using the EQ method. EQ, electronic questionnaire. aSalary estimates are based on formulae presented by Walther et al. (see Table 3). bEquipment costs are taken from Table 3.

Discussion

We describe an open-source adaptable electronic data capture system and compare it with traditional paper-based data collection methods in a cohort of 200 individuals. The results of this study suggest that the EQ may offer increased accuracy, similar interview duration, and increased data collection cost-effectiveness over paper-based methods. Overall, the APCDR EQ appears to offer a feasible and cost-effective alternative to paper-based data collection methods in SSA. One advantage of the EQ is its increased data collection accuracy. Although both the EQ and paper methods contained a relatively low proportion of errors, the proportion of errors using the EQ system was less than a third of the proportion using the paper method. Additionally, all the EQ errors were classed as minor, whereas most of the errors made when using the paper method were considered major. Major errors, which included missing or nonsensical data, often require either dropping the piece of data in question or resurveying the participant to obtain corrected information, thus necessitating a choice between reduced statistical power or additional study costs. The increased accuracy of the EQ appears to reduce the frequency of such cost–benefit decisions. Importantly, neither was there any statistically significant difference between the average interview durations for both methods nor was there a trend over time in interview duration for either data collection method. Previous research has proposed that, even assuming electronic data capture methods were as time efficient as paper methods, a lag would occur at the beginning of a study where using the electronic method would take markedly longer as study staff familiarized themselves with the system [6], [7]. There are several possible explanations why the data from the EQ validation study does not show this lag, including a well-run staff training program, software that was easy to use, or study staff who were already technically adept. Regardless, it does not appear that there is any appreciable time lag in data collection when moving from paper methods to the EQ in this setting. Although determining the true economic cost efficiency of each data collection method is quite complex and study specific, this analysis has attempted to highlight and compare the major costs for both the EQ and paper methods. The EQ has higher start-up costs than the paper method, although it appears to have lower running costs once in place. Given that the initial start-up costs are a one-off expenditure, and the salary cost per correctly entered question is ongoing and cumulative, these data suggest that the EQ may be a cost-effective alternative to paper methods in longer running studies. Our study has a number of strengths and potential limitations, which should be considered. We conducted this validation exercise within the context of an ongoing case–control study; our use of a pre-existing research questionnaire, real study participants, and the existing study team strengthens the relative validity of the study. We used a single staff member to administer the study questionnaires and so cannot exclude bias as a result of preference between the two data collection methods. Conversely, using a single staff member brought consistency across the duration of the study. Our findings are in keeping with other studies that have shown the utility of electronic data capture and other informatics tools to facilitate data checks and early detection and correction of faulty procedures and data management [3]. For example, Singleton et al. [8], like us, identified that electronic capture systems often comprised web browsers, requiring keyboard and mouse input. These limitations led the authors to develop a touchscreen interface for their research studies, taking into consideration portability, limited screen space, and potentially inexperienced and low literacy users. This software, however, is not readily obtainable. By contrast—and with these same considerations in mind, which are particularly relevant to working in SSA and other low- and middle-income countries (LMICs)—we have developed our EQ as a publicly available resource and endeavored to overcome some of the barriers to using electronic data capture instruments identified by others [2], [9]. E-health technologies, and their evaluation, are becoming widespread in LMICs [10]; in this context, we anticipate that the benefits of easy-to-use simple electronic data capture systems such as the EQ will be increasingly realized. Other electronic data capture systems are available in addition to the APCDR EQ. For example, programs such as REDCap and Microsoft Access, among others, have been used extensively for electronic data capture and storage. REDCap offers a secure web-based system of data capture, allowing the user to input data from anywhere in the world. It is widely used, with good technical support available and extensive training resources. Similarly, Microsoft Access is a commonly used program. Among its advantages, it is easy to integrate with other programs including non-Microsoft products such as Oracle and Sybase, and it has comprehensive technical support available on the Internet. However, we created the APCDR EQ as our studies based in SSA highlighted certain limitations in existing data capture programs such as those mentioned previously. Specifically, programs such as REDCap require a constant Internet connection to be fully functioning, because of the online nature of the data collection program, and is not easy to customize. Microsoft Access requires licensing fees and does not give full access to the source code for expansion and adaptation of the software. By contrast, in developing the APCDR EQ, we aimed to provide a robust adaptable program that would operate at full capacity without an Internet connection, which was easy to maintain without compromising participant privacy and data security and thus was directly suited for use in challenging and isolated field environments such as in SSA. In our study, the APCDR EQ appears to be more accurate and cost effective than currently used pen-and-paper methods, while maintaining data collection rates. With its multiplatform support and easily designed and maintained XML questionnaires, the EQ provides a simple, yet robust, option for data collection in resource-poor settings. We now plan to develop the EQ further, including exploring options for greater Internet connectivity and use of mobile networks to facilitate real-time data collection. The EQ software and supporting documentation are freely available (https://github.com/apcdr/questionnaire).
  9 in total

1.  Wireless data collection of self-administered surveys using tablet computers.

Authors:  Kyle W Singleton; Mars Lan; Corey Arnold; Mani Vahidi; Lisa Arangua; Lillian Gelberg; Alex A T Bui
Journal:  AMIA Annu Symp Proc       Date:  2011-10-22

2.  Informatics tools to improve clinical research study implementation.

Authors:  Cynthia A Brandt; Stephanie Argraves; Roy Money; Gowri Ananth; Nina M Trocky; Prakash M Nadkarni
Journal:  Contemp Clin Trials       Date:  2006-01-04       Impact factor: 2.226

3.  Implementation of electronic data capture systems: barriers and solutions.

Authors:  James A Welker
Journal:  Contemp Clin Trials       Date:  2007-01-11       Impact factor: 2.226

4.  Comparison of paper-based and electronic data collection process in clinical trials: costs simulation study.

Authors:  Ivan Pavlović; Tomaz Kern; Damijan Miklavcic
Journal:  Contemp Clin Trials       Date:  2009-04-02       Impact factor: 2.226

Review 5.  E-health technologies show promise in developing countries.

Authors:  Joaquin A Blaya; Hamish S F Fraser; Brian Holt
Journal:  Health Aff (Millwood)       Date:  2010-02       Impact factor: 6.301

6.  Application of handheld devices to field research among underserved construction worker populations: a workplace health assessment pilot study.

Authors:  Alberto J Caban-Martinez; Tainya C Clarke; Evelyn P Davila; Lora E Fleming; David J Lee
Journal:  Environ Health       Date:  2011-04-01       Impact factor: 5.984

Review 7.  Health Informatics in Developing Countries: Going beyond Pilot Practices to Sustainable Implementations: A Review of the Current Challenges.

Authors:  Daniel Luna; Alfredo Almerares; John Charles Mayan; Fernán González Bernaldo de Quirós; Carlos Otero
Journal:  Healthc Inform Res       Date:  2014-01-31

8.  Comparison of electronic data capture (EDC) with the standard data capture method for clinical trial data.

Authors:  Brigitte Walther; Safayet Hossin; John Townend; Neil Abernethy; David Parker; David Jeffries
Journal:  PLoS One       Date:  2011-09-23       Impact factor: 3.240

Review 9.  Who's using PDAs? Estimates of PDA use by health care providers: a systematic review of surveys.

Authors:  Chantelle Garritty; Khaled El Emam
Journal:  J Med Internet Res       Date:  2006-05-12       Impact factor: 5.428

  9 in total
  22 in total

1.  We must harness technology to deliver the musculoskeletal disease epidemiology that is urgently needed across sub-Saharan Africa.

Authors:  Doug L Fink; David Oladele; Oseme Etomi; Hakeem Olaosebikan; Ida Dzifa Dey; Olufemi Olawale Adelowo
Journal:  Clin Rheumatol       Date:  2018-02-08       Impact factor: 2.980

2.  Contactable Non-responders Show Different Characteristics Compared to Lost to Follow-Up Participants: Insights from an Australian Longitudinal Birth Cohort Study.

Authors:  Shu-Kay Ng; Rani Scott; Paul A Scuffham
Journal:  Matern Child Health J       Date:  2016-07

3.  Implementing a Redcap-based research data collection system for mental health.

Authors:  Arturo Marroquin Rivera; Juan Camilo Rosas-Romero; Sergio Mario Castro; Fernando Suárez-Obando; Jeny Aguilera-Cruz; Sophia Marie Bartels; Sena Park; William Chandler Torrey; Carlos Gómez-Restrepo
Journal:  Rev Colomb Psiquiatr (Engl Ed)       Date:  2021-07-10

4.  A spectrum of sharing: maximization of information content for brain imaging data.

Authors:  Vince D Calhoun
Journal:  Gigascience       Date:  2015-01-29       Impact factor: 6.524

5.  Electronic data capture in a rural African setting: evaluating experiences with different systems in Malawi.

Authors:  Carina King; Jenny Hall; Masford Banda; James Beard; Jon Bird; Peter Kazembe; Ed Fottrell
Journal:  Glob Health Action       Date:  2014-10-30       Impact factor: 2.640

Review 6.  Open-Source Electronic Health Record Systems for Low-Resource Settings: Systematic Review.

Authors:  Assel Syzdykova; André Malta; Maria Zolfo; Ermias Diro; José Luis Oliveira
Journal:  JMIR Med Inform       Date:  2017-11-13

7.  Mobile electronic versus paper case report forms in clinical trials: a randomized controlled trial.

Authors:  Robert Fleischmann; Anne-Marie Decker; Antje Kraft; Knut Mai; Sein Schmidt
Journal:  BMC Med Res Methodol       Date:  2017-12-01       Impact factor: 4.615

8.  H3Africa multi-centre study of the prevalence and environmental and genetic determinants of type 2 diabetes in sub-Saharan Africa: study protocol.

Authors:  K Ekoru; E H Young; C Adebamowo; N Balde; B J Hennig; P Kaleebu; S Kapiga; N S Levitt; M Mayige; J C Mbanya; M I McCarthy; O Nyan; M Nyirenda; J Oli; K Ramaiya; L Smeeth; E Sobngwi; C N Rotimi; M S Sandhu; A A Motala
Journal:  Glob Health Epidemiol Genom       Date:  2016-03-08

9.  Study profile: the Durban Diabetes Study (DDS): a platform for chronic disease research.

Authors:  T R Hird; E H Young; F J Pirie; J Riha; T M Esterhuizen; B O'Leary; M I McCarthy; M S Sandhu; A A Motala
Journal:  Glob Health Epidemiol Genom       Date:  2016-02-05

10.  The Treatment In Morning versus Evening (TIME) study: analysis of recruitment, follow-up and retention rates post-recruitment.

Authors:  David A Rorie; Robert W V Flynn; Isla S Mackenzie; Thomas M MacDonald; Amy Rogers
Journal:  Trials       Date:  2017-11-23       Impact factor: 2.279

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.