| Literature DB >> 23304481 |
Janet E Squires1, Alison M Hutchinson, Anne-Marie Bostrom, Kelly Deis, Peter G Norton, Greta G Cummings, Carole A Estabrooks.
Abstract
Researchers strive to optimize data quality in order to ensure that study findings are valid and reliable. In this paper, we describe a data quality control program designed to maximize quality of survey data collected using computer-assisted personal interviews. The quality control program comprised three phases: (1) software development, (2) an interviewer quality control protocol, and (3) a data cleaning and processing protocol. To illustrate the value of the program, we assess its use in the Translating Research in Elder Care Study. We utilize data collected annually for two years from computer-assisted personal interviews with 3004 healthcare aides. Data quality was assessed using both survey and process data. Missing data and data errors were minimal. Mean and median values and standard deviations were within acceptable limits. Process data indicated that in only 3.4% and 4.0% of cases was the interviewer unable to conduct interviews in accordance with the details of the program. Interviewers' perceptions of interview quality also significantly improved between Years 1 and 2. While this data quality control program was demanding in terms of time and resources, we found that the benefits clearly outweighed the effort required to achieve high-quality data.Entities:
Year: 2012 PMID: 23304481 PMCID: PMC3529418 DOI: 10.1155/2012/303816
Source DB: PubMed Journal: Nurs Res Pract ISSN: 2090-1429
Figure 1Data quality control program.
Number of interviews for data collector by province and nursing home, partial wave 2 (July 16th, 2009–November 23th, 2009).
| Data collector | Province | Total | |||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 2 | 3 | |||||||||||||||||
| Facility | Facility | Facility | |||||||||||||||||
| S3 | A2 | T5 | E5 | B9 | H3 | G7 | D2 | Total | R7 | W9 | F3 | K4 | Total | Z8 | T9 | Q8 | Total | ||
| 1 | 11 | 27 | 36 | 74 | 74 | ||||||||||||||
| 15 | 8 | 4 | 24 | 36 | 36 | ||||||||||||||
| 16 | 28 | 16 | 41 | 85 | 85 | ||||||||||||||
| 29 | 2 | 2 | 2 | ||||||||||||||||
| 30 | 12 | 10 | 20 | 2 | 26 | 70 | 70 | ||||||||||||
| 35 | 9 | 37 | 8 | 54 | 54 | ||||||||||||||
|
| |||||||||||||||||||
| 26 | 9 | 4 | 13 | 26 | 26 | ||||||||||||||
| 34 | 25 | 16 | 22 | 13 | 76 | 76 | |||||||||||||
|
| |||||||||||||||||||
| 13 | 6 | 7 | 13 | 13 | |||||||||||||||
| 18 | 28 | 6 | 17 | 51 | 51 | ||||||||||||||
| 27 | 21 | 9 | 6 | 36 | 36 | ||||||||||||||
|
| |||||||||||||||||||
| Total | 25 | 19 | 84 | 10 | 36 | 20 | 65 | 62 | 321 | 34 | 20 | 35 | 13 | 102 | 55 | 15 | 30 | 100 | 523 |
Figure 2Missing values by item for all provinces (July 16th, 2009–Nov 23rd, 2009).
Figure 3Skewness by item for all provinces (July 16th, 2009–November 23rd, 2009). Comment presented in the report: The skewness graph indicates if the answers to any item are skewed or not. Negative is left skewed (tend to have small values) and positive is right skewed (tend to have large values). In the skewness it shows that there is no significant difference among all three provinces.
Missing data over the two years of the TREC Project 1.
| Missing rate | Year 1 | Year 2 | Total (Year 1 + Year 2) | ||||
|---|---|---|---|---|---|---|---|
| Frequency | Percent | Frequency | Percent | Frequency | Percent | Cumulative percent | |
| No missing | 0 | 0 | 1135 | 75.2 | 1135 | 37.8 | 37.9 |
| 0%~1% | 0 | 0 | 222 | 14.7 | 222 | 7.4 | 45.3 |
| 1%~2% | 0 | 0 | 95 | 6.3 | 95 | 3.2 | 48.5 |
| 2%~3% | 1010 | 67.6 | 33 | 2.2 | 1043 | 34.7 | 83.3 |
| 3%~4% | 381 | 25.5 | 8 | 0.5 | 389 | 13.0 | 96.3 |
| 4%~5% | 70 | 4.7 | 8 | 0.5 | 78 | 2.6 | 98.9 |
| 5%~10% | 26 | 1.7 | 3 | 0.2 | 29 | 1.0 | 99.9 |
| >10% | 6 | 0.4 | 6 | 0.4 | 12 | 0.4 | 100.0 |
|
| |||||||
| Total | 1493 | 100.0 | 1510 | 100.0 | 3003 | 100.0 | |