| Literature DB >> 28073765 |
Susan Hurley1, Andrew Hertz2, David O Nelson1, Michael Layefsky1, Julie Von Behren2, Leslie Bernstein3, Dennis Deapen4, Peggy Reynolds2.
Abstract
Large-scale environmental epidemiologic studies often rely on exposure estimates based on linkage to residential addresses. This approach, however, is limited by the lack of residential histories typically available for study participants. Our objective was to evaluate the feasibility of using address data from LexisNexis (a division of RELX, Inc., Dayton, Ohio), a commercially available credit reporting company, to construct residential histories for participants in the California Teachers Study (CTS), a prospective cohort study initiated in 1995-1996 to study breast cancer (n = 133,479). We evaluated the degree to which LexisNexis could provide retrospective addresses prior to study enrollment, as well as the concordance with existing prospective CTS addresses ascertained at the time of the completion of 4 self-administered questionnaires. For approximately 80% of CTS participants, LexisNexis provided at least 1 retrospective address, including nearly 25,000 addresses completely encompassed by time periods prior to enrollment. This approach more than doubled the proportion of the study population for whom we had an address of residence during the childbearing years-an important window of susceptibility for breast cancer risk. While overall concordance between the prospective addresses contained in these 2 data sources was good (85%), it was diminished among black women and women under the age of 40 years.Entities:
Keywords: data collection; environmental epidemiology; residential history; residential mobility; validation studies
Mesh:
Year: 2017 PMID: 28073765 PMCID: PMC5860230 DOI: 10.1093/aje/kww108
Source DB: PubMed Journal: Am J Epidemiol ISSN: 0002-9262 Impact factor: 4.897
Existing Prospectively Collected Address Information Available From Routine Follow-up Activities (n = 245,545 Unique Address Records) Carried Out From Enrollment (1995–1996) Through June 1, 2011, for Participants in the California Teachers Study (n = 133,479)
| Characteristic | CTS Participants ( | |
|---|---|---|
| No. | % | |
| Total no. of unique addresses per individual | ||
| 1 | 70,187 | 53 |
| 2 | 35,146 | 26 |
| 3 | 16,192 | 12 |
| 4 | 6,899 | 5 |
| 5 | 2,890 | 2 |
| 6 | 1,298 | 1 |
| 7 | 518 | <1 |
| 8 | 204 | <1 |
| >8 | 145 | <1 |
| Age at enrollment, years[ | ||
| <20 | 0 | 0 |
| 20–29 | 5,548 | 4 |
| 30–39 | 16,535 | 12 |
| 40–49 | 33,384 | 25 |
| 50–59 | 31,845 | 24 |
| 60–69 | 23,064 | 17 |
| 70–79 | 15,984 | 12 |
| ≥80 | 7,119 | 5 |
Abbreviation: CTS, California Teachers Study.
a Youngest age for which an address was available.
Problems and Limitations of 358,520 Addresses Provided by LexisNexis[a] for 133,479 California Teachers Study Participants Enrolled in 1995–1996
| Problem/Limitation[ | No. of Addresses ( | No. of Participants ( |
|---|---|---|
| No address provided | 2,558 | |
| Duplicate address | 15,694 | 12,274 |
| Nonresidential address | 42,577 | 30,422 |
| Address date fell after date of death | 9,068 | 742 |
| Address date fell prior to date of birth | 1,245 | 1,245 |
| Multiple unique addresses for the same time period | 14,991 | 7,307 |
| Address was missing date | 196 | 16 |
Abbreviation: CTS, California Teachers Study.
a LexisNexis is a division of RELX, Inc., Dayton, Ohio.
b Categories are not mutually exclusive.
Extent and Scope of Address Data Provided by LexisNexis[a] for 133,479 California Teachers Study Participants Enrolled in 1995–1996, by Race/Ethnicity and Age at Baseline[b]
| Characteristic | No. of Participants | Participants for Whom Unique Addresses Were Provided by LexisNexis[ | |||||
|---|---|---|---|---|---|---|---|
| No Address | Prebaseline Address | Address at Age <40 Years | |||||
| No. of Persons | % | No. of Persons | % | No. of Persons | % | ||
| All participants | 133,479 | 2,559 | 2 | 107,314 | 80 | 56,286 | 42 |
| Race/ethnicity[ | |||||||
| White | 115,871 | 2,250 | 2 | 92,971 | 80 | 46,730 | 40 |
| Black | 3,553 | 37 | 1 | 3,050 | 86 | 1,641 | 46 |
| Hispanic | 5,409 | 37 | 1 | 4,344 | 80 | 3,551 | 66 |
| Native American | 1,302 | 110 | 8 | 933 | 72 | 333 | 26 |
| Asian/Pacific Islander | 4,495 | 34 | 1 | 3,815 | 85 | 2,657 | 59 |
| Age at baseline, years | |||||||
| 20–39 | 22,083 | 78 | <1 | 14,318 | 65 | 20,626 | 93 |
| 40–49 | 33,384 | 153 | <1 | 28,138 | 84 | 21,337 | 64 |
| 50–59 | 31,845 | 218 | <1 | 27,641 | 87 | 10,873 | 34 |
| 60–69 | 23,064 | 267 | 1 | 19,951 | 87 | 2,585 | 11 |
| 70–79 | 15,984 | 576 | 4 | 12,905 | 81 | 506 | 3 |
| ≥80 | 7,119 | 1,267 | 18 | 4,361 | 61 | 359 | 5 |
Abbreviation: CTS, California Teachers Study.
a LexisNexis is a division of RELX, Inc., Dayton, Ohio.
b The distribution of address data varied significantly by race/ethnicity and age at baseline (Pearson χ2 test: P < 0.001).
c Data were restricted to nonduplicate residential addresses and excluded addresses with dates that fell after the end of CTS follow-up (June 11, 2011) or after the date of death.
d Data for participants with unknown/missing/other information on race/ethnicity (n = 2,849) are not shown.
Results From Retrospective Analysis of Residential Addresses Provided by LexisNexis[a] With Start Dates Prior to the Date of Enrollment (1995–1996) for all California Teachers Study Participants (n = 123,828 Addresses for 133,479 Participants)
| Characteristic | No. of CTS Participants | % of CTS Participants |
|---|---|---|
| Total no. of unique retrospective addresses per person | ||
| 0 | 29,853 | 22 |
| 1 | 86,032 | 64 |
| 2 | 15,400 | 12 |
| 3 | 1,876 | 1 |
| 4 | 258 | <1 |
| 5 | 41 | <1 |
| 6–14 | 19 | <1 |
| Earliest calendar year for which an address was available | ||
| 1990 or later | 26,182 | 19 |
| 1985–1989 | 30,578 | 23 |
| 1980–1984 | 22,516 | 17 |
| 1975–1979 | 11,318 | 8 |
| 1970–1974 | 6,625 | 5 |
| 1965–1969 | 2,832 | 2 |
| Before 1965 | 3,575 | 3 |
| No address provided | 29,853 | 22 |
| Youngest age for which a full address was available, years | ||
| ≤19 | 1,527 | 1 |
| 20–29 | 15,484 | 12 |
| 30–39 | 31,499 | 24 |
| 40–49 | 24,052 | 18 |
| 50–59 | 15,085 | 11 |
| 60–69 | 10,074 | 8 |
| 70–79 | 3,911 | 3 |
| ≥80 | 760 | 1 |
| Invalid (<0 years) | 1,234 | 1 |
| No address provided | 29,853 | 22 |
| Preenrollment time at baseline address, years[ | ||
| ≤1 | 4,385 | 3 |
| >1–5 | 23,929 | 18 |
| >5–10 | 30,422 | 23 |
| >10–15 | 19,976 | 15 |
| >15–20 | 9,653 | 7 |
| >20–25 | 6,089 | 5 |
| >25–30 | 2,663 | 2 |
| >30–40 | 1,748 | 1 |
| >40 | 1,727 | 1 |
| Missing data[ | 3,034 | 1 |
| No address provided | 29,853 | 22 |
Abbreviation: CTS, California Teachers Study.
a LexisNexis is a division of RELX, Inc., Dayton, Ohio.
b Duration of residence at the participant's baseline address prior to study enrollment.
c Preenrollment time at baseline address could not be calculated for 3,034 participants because their CTS enrollment address was not considered a residential address.
Results From Prospective Analysis of the Accuracy of LexisNexis[a] Address Data for Participants in the California Teachers Study (CTS), as Captured by the Concordance of LexisNexis Addresses With CTS Addresses Among CTS Participants for Whom LexisNexis was Able to Provide an Address, 1995–2011[b,c]
| CTS Study Participants | Address Concordance | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| At Baseline (1995–1996) | At Questionnaire 2 (1997) | At Questionnaire 3 (2000) | At Questionnaire 4 (2005–2006) | |||||||||
| No. of Matches[ | Total No.[ | %[ | No. of Matches | Total No. | % | No. of Matches | Total No. | % | No. of Matches | Total No. | % | |
| All participants | 83,421 | 97,305 | 86 | 66,471 | 78,114 | 85 | 63,460 | 75,539 | 84 | 46,033 | 62,490 | 74 |
| Race/ethnicity[ | ||||||||||||
| White | 72,488 | 84,031 | 86 | 58,407 | 68,218 | 86 | 56,032 | 66,299 | 85 | 41,077 | 55,210 | 74 |
| Black | 2,248 | 2,806 | 80 | 1,556 | 1,994 | 78 | 1,357 | 1,766 | 77 | 814 | 1,316 | 62 |
| Hispanic | 3,310 | 4,071 | 81 | 2,380 | 2,957 | 80 | 2,274 | 2,848 | 80 | 1,497 | 2,247 | 67 |
| Native American | 686 | 800 | 86 | 481 | 568 | 85 | 443 | 539 | 82 | 258 | 374 | 69 |
| Asian/Pacific Islander | 3,026 | 3,606 | 84 | 2,430 | 2,903 | 84 | 2,214 | 2,695 | 82 | 1,634 | 2,255 | 72 |
| Age at baseline, years | ||||||||||||
| 20–39 | 10,166 | 13,222 | 77 | 8,750 | 11,113 | 79 | 9,418 | 11,541 | 82 | 6,842 | 9,161 | 75 |
| 40–49 | 22,192 | 25,589 | 87 | 17,012 | 19,789 | 86 | 16,174 | 19,147 | 84 | 11,475 | 16,586 | 69 |
| 50–59 | 21,351 | 24,989 | 85 | 16,472 | 19,577 | 84 | 15,539 | 18,933 | 82 | 12,326 | 17,315 | 71 |
| 60–69 | 15,665 | 17,931 | 87 | 12,889 | 14,929 | 86 | 12,367 | 14,519 | 85 | 9,727 | 12,482 | 78 |
| 70–79 | 10,503 | 11,674 | 90 | 8,735 | 9,820 | 89 | 7,935 | 9,072 | 87 | 4,990 | 6,118 | 82 |
| ≥80 | 3,544 | 3,900 | 91 | 2,613 | 2,886 | 91 | 2,027 | 2,327 | 87 | 673 | 828 | 81 |
Abbreviation: CTS, California Teachers Study.
a LexisNexis is a division of RELX, Inc., Dayton, Ohio.
b Data were restricted to nonduplicate residential addresses and excluded addresses with dates that fell after the end of CTS follow-up (June 11, 2011) or after the date of death.
c The distribution of address concordance varied significantly by race/ethnicity and age at baseline (Pearson χ2 test: P < 0.001).
d Number of CTS participants for whom the LexisNexis address exactly matched the CTS address for the date on which the questionnaire was completed.
e Number of CTS participants for whom LexisNexis provided an address for the date on which the questionnaire was completed; it varied, because not all participants completed all questionnaires.
f (No. of matches/total no.) × 100.
g Data for participants with unknown/missing/other information on race/ethnicity (n = 1,991) are not shown.