| Literature DB >> 35587478 |
Clemens Noelke1, Michael Outrich2, Mikyung Baek2, Jason Reece3, Theresa L Osypuk4, Nancy McArdle1, Robert W Ressler1, Dolores Acevedo-Garcia1.
Abstract
In the 1930's, the Home Owner Loan Corporation (HOLC) drafted maps to quantify variation in real estate credit risk across US city neighborhoods. The letter grades and associated risk ratings assigned to neighborhoods discriminated against those with black, lower class, or immigrant residents and benefitted affluent white neighborhoods. An emerging literature has begun linking current individual and community health effects to government redlining, but each study faces the same measurement problem: HOLC graded area boundaries and neighborhood boundaries in present-day health datasets do not match. Previous studies have taken different approaches to classify present day neighborhoods (census tracts) in terms of historical HOLC grades. This study reviews these approaches, examines empirically how different classifications fare in terms of predictive validity, and derives a predictively optimal present-day neighborhood redlining classification for neighborhood and health research.Entities:
Mesh:
Year: 2022 PMID: 35587478 PMCID: PMC9119533 DOI: 10.1371/journal.pone.0267606
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Census tracts and HOLC-rated polygons in Manhattan, New York City.
Note: The map image was created using the 2019 TIGER/Line Shapefiles (machine readable data files) / prepared by the U.S. Census Bureau and HOLC rating data published by the Digital Scholarship Lab under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License (https://creativecommons.org/licenses/by-nc-sa/4.0/).
54 classifications of 2010 census tracts in terms of historical HOLC rating status.
| Rank-ordered (set 6) | 50 percentage points (set 1) | 33 percentage points (set 2) | 25 percentage points (set 3) | 20 percentage points (set 4) | 10 percentage points (set 5) | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
| ||||||||||||
|
|
|
|
|
|
|
|
|
|
|
|
| ||
|
|
| 4 | C | 8 | C0 | 12 | C1 | 12 | C1 | 16 | C1 | 29 | C3 |
|
|
|
|
| 28 | C0-B0 | 52 | C1-B0 | 58 | C1-B1 | 77 | C1-B1 | 189 | C3-B3 |
|
|
| 40 | C-B-A | 63 | C0-B0-A0 | 115 | C1-B0-A0 | 146 | C1-B1-A0 | 195 | C1-B1-A1 | 473 | C3-B3-A2 |
|
|
| 64 | C-B-A-D | 106 | C0-B0-A0-D0 | 178 | C1-B0-A0-D0 | 233 | C1-B1-A0-D0 | 308 | C1-B1-A1-D0 | 679 | C3-B3-A2-D1 |
|
|
| 5 | C | 10 | C1 | 15 | C1 | 15 | C2 | 20 | C2 | 40 | C5 |
|
|
| 24 | C-U | 44 | C1-U0 | 89 | C1-U0 | 102 | C2-U0 | 137 | C2-U0 | 342 | C5-U1 |
|
|
|
|
| 139 | C1-U0-D0 | 246 | C1-U0-D0 | 335 | C2-U0-D0 | 465 | C2-U0-D0 | 1154 | C5-U1-D1 |
|
|
| 194 | C-U-D-B | 318 | C1-U0-D0-B0 | 508 | C1-U0-D0-B0 | 684 | C2-U0-D0-B0 | 904 | C2-U0-D0-B0 | 1988 | C5-U1-D1-B1 |
|
|
| 289 | C-U-D-B-A | 455 | C1-U0-D0-B0-A0 | 661 | C1-U0-D0-B0-A0 | 858 | C2-U0-D0-B0-A0 | 1084 | C2-U0-D0-B0-A0 | 2138 | C5-U1-D1-B1-A0 |
1If yes, classifications include unrated portion of census tracts, i.e., the portion not covered by an A, B, C, or D rated polygon, as a separate grade (“U”).
2The number of ratings considered when classifying tracts. 1 = only HOLC polygon covering the largest area of the tract is considered, 2 = the polygons covering the largest and second largest area of a tract are considered, etc.
3The number of distinct classes into which tracts are classified.
4An example labeled class value for a given classification.
Fig 2Core based statistical areas.
Percentage of population in census tracts with 1% or more HOLC rating coverage. Note: The map image was created using the 2019 TIGER/Line Shapefiles (machine readable data files) / prepared by the U.S. Census Bureau and HOLC rating data published by the Digital Scholarship Lab under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License (https://creativecommons.org/licenses/by-nc-sa/4.0/).
Descriptive statistics.
| Excluded Tracts | Included Tracts | |||||
|---|---|---|---|---|---|---|
|
|
|
|
|
|
| |
|
| ||||||
| Proportion covered by A-rated polygon | 0.00 | 0.02 | 58,417 | 4.19 | 14.76 | 14,639 |
| Proportion covered by B-rated polygon | 0.00 | 0.02 | 58,417 | 13.91 | 26.09 | 14,639 |
| Proportion covered by C-rated polygon | 0.00 | 0.03 | 58,417 | 30.72 | 35.01 | 14,639 |
| Proportion covered by D-rated polygon | 0.00 | 0.02 | 58,417 | 19.32 | 31.64 | 14,639 |
| Proportion unrated | 100.00 | 0.05 | 58,417 | 31.86 | 32.60 | 14,639 |
|
| ||||||
| Life expectancy | 78.50 | 3.75 | 52,367 | 77.47 | 4.77 | 13,295 |
| Physical health | 12.12 | 3.64 | 15,678 | 13.68 | 4.59 | 11,284 |
| Mental health | 12.59 | 3.15 | 15,678 | 13.86 | 3.62 | 11,284 |
| Binge drinking | 17.55 | 3.76 | 15,678 | 17.77 | 4.64 | 11,284 |
| Cancer | 5.64 | 1.92 | 15,678 | 5.30 | 1.49 | 11,284 |
| Asthma | 9.19 | 1.44 | 15,678 | 10.43 | 2.12 | 11,284 |
| Coronary heart disease | 5.61 | 2.02 | 15,678 | 6.11 | 2.13 | 11,284 |
| Smoking | 17.04 | 5.23 | 15,678 | 19.63 | 6.76 | 11,284 |
| Diabetes | 10.02 | 3.60 | 15,678 | 11.95 | 4.88 | 11,284 |
| Limited physical activity | 23.70 | 8.32 | 15,678 | 28.00 | 10.02 | 11,284 |
| Obesity | 29.01 | 6.95 | 15,678 | 31.68 | 9.54 | 11,284 |
| COI 2.0 | 52.19 | 27.20 | 57,583 | 37.22 | 30.84 | 14,630 |
| Household income rank (p50) | 50.61 | 5.79 | 57,374 | 46.73 | 8.13 | 14,580 |
| Residence in low poverty neighborhood (p50) | 48.95 | 17.95 | 57,366 | 41.87 | 16.81 | 14,575 |
| Household income rank (p25) | 43.49 | 6.68 | 57,374 | 40.41 | 8.18 | 14,580 |
| Household income in top 20% (p25) | 12.81 | 7.79 | 57,374 | 11.40 | 9.02 | 14,580 |
| Residence in low poverty neighborhood (p25) | 44.36 | 19.52 | 57,366 | 36.77 | 17.54 | 14,575 |
Note: Tracts with 1% or more HOLC rating coverage are included in the analysis
*Dependent variable included in cross-validation analysis.
OLS regression coefficients from regressions of cross-validated mean squared prediction errors on experiment features, based on 2,628 cross-validation experiments.
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|
| 2 ratings (Ref.) | ||||||
| 1 rating | 0.029 | 0.029 | ||||
| (0.003) | (0.001) | |||||
| 3 ratings | 0.008 | 0.008 | ||||
| (0.003) | (0.001) | |||||
| 4 ratings | 0.027 | 0.027 | ||||
| (0.003) | (0.001) | |||||
| 5 ratings | 0.057 | 0.047 | ||||
| (0.003) | (0.002) | |||||
| Rank ordered, set 6 (Ref.) | ||||||
| 50% bins, set 5 | 0.004 | 0.004 | ||||
| (0.003) | (0.002) | |||||
| 33% bins, set 4 | 0.015 | 0.015 | ||||
| (0.003) | (0.002) | |||||
| 25% bins, set 3 | 0.017 | 0.017 | ||||
| (0.003) | (0.002) | |||||
| 20% bins, set 2 | 0.022 | 0.022 | ||||
| (0.003) | (0.002) | |||||
| 10% bins, set 1 | 0.055 | 0.055 | ||||
| (0.003) | (0.002) | |||||
| Exclude unrated (Ref.) | ||||||
| Include unrated | 0.026 | 0.019 | ||||
| (0.002) | (0.001) | |||||
| COI 2.0 (Ref.) | ||||||
| Household income rank | 0.032 | 0.032 | ||||
| (0.003) | (0.002) | |||||
| Life expectancy | 0.077 | 0.077 | ||||
| (0.003) | (0.002) | |||||
| Low pov. neighborhood | 0.022 | 0.022 | ||||
| (0.003) | (0.002) | |||||
| Mental health | 0.043 | 0.043 | ||||
| (0.003) | (0.002) | |||||
| Physical health | 0.074 | 0.074 | ||||
| (0.003) | (0.002) | |||||
| Threshold = 5% (Ref.) | ||||||
| Threshold = 1% | 0.001 | 0.001 | ||||
| (0.003) | (0.002) | |||||
| Threshold = 10% | 0.000 | 0.000 | ||||
| (0.003) | (0.002) | |||||
| Threshold = 15% | 0.001 | 0.001 | ||||
| (0.003) | (0.002) | |||||
| Threshold = 25% | 0.002 | 0.002 | ||||
| (0.003) | (0.002) | |||||
| Threshold = 33% | 0.003 | 0.003 | ||||
| (0.003) | (0.002) | |||||
| Threshold = 50% | 0.003 | 0.003 | ||||
| (0.003) | (0.002) | |||||
| Constant | 0.869 | 0.867 | 0.873 | 0.846 | 0.886 | 0.796 |
| (0.002) | (0.002) | (0.001) | (0.002) | (0.002) | (0.002) | |
| Observations | 2,268 | 2,268 | 2,268 | 2,268 | 2,268 | 2,268 |
| R-squared | 0.16 | 0.15 | 0.08 | 0.38 | 0.00 | 0.74 |
Note:
*** p<0.001
** p<0.01
* p<0.05.
Fig 3Median MSE across experiments for each of the 54 classifications tested, plotted against the average degrees of freedom used by a given classification.
A) All classifications. b) Only classifications with less than 100 median degrees of freedom. Note: The yellow horizontal line is drawn at the minimum averaged MSE across the 54 different classifications. The yellow circle marks the best fitting classification. The blue circle is the most parsimonious classification that is closest to the minimum averaged MSE.
Detailed and collapsed rank-ordered, two-rating classification.
|
|
|
|
|
|
|
|---|---|---|---|---|---|
| 1. Only A | 223 | 0.87 | Only or mainly A (1–4) | 981 | 0.86 |
| 2. Mainly A, some B | 544 | 0.89 | |||
| 3. Mainly A, some C | 171 | 0.81 | |||
| 4. Mainly A, some D | 43 | 0.51 | |||
| 5. Only B | 785 | 0.41 | Only B (5) | 785 | 0.41 |
| 6. Mainly B, some A | 566 | 0.61 | Mainly B, some A (6) | 566 | 0.61 |
| 7. Mainly B, some C | 1,388 | 0.31 | Mainly B, some C or D (7–8) | 1,562 | 0.29 |
| 8. Mainly B, some D | 174 | 0.20 | |||
| 9. Only C | 2,760 | 0.09 | Only C (9) | 2,760 | 0.09 |
| 10. Mainly C, some A | 199 | 0.39 | |||
| 11. Mainly C, some B | 1,800 | 0.11 | |||
| 12. Mainly C, some D | 1,840 | -0.30 | Mainly C, some D (12) | 1,840 | -0.30 |
| 13. Only D | 2,156 | -0.31 | Only D (13) | 2,156 | -0.31 |
| 14. Mainly D, some A | 47 | 0.44 | |||
| 15. Mainly D, some B | 193 | -0.07 | |||
| 16. Mainly D, some C | 1,750 | -0.47 | Mainly D, some C (16) | 1,750 | -0.47 |
| Mainly C or D, some A (10 & 14) | 246 | 0.40 | |||
| Mainly C or D, some B (11 & 15) | 1,993 | 0.09 | |||
R-squared statistics from OLS regressions of 11 census tract level health and socio-economic outcomes on different census tract HOLC rating measures/classifications.
| Proportions | One rating | Rank-ordered, 2 ratings, collapsed | Rank-ordered, 2 ratings, detailed | Rank-ordered, 3 ratings, optimal | |
|---|---|---|---|---|---|
|
| 3 | 3 | 9 | 15 | 39 |
|
| |||||
| Drinking | 0.02 | 0.02 | 0.03 | 0.03 | 0.04 |
| Cancer | 0.12 | 0.11 | 0.13 | 0.13 | 0.13 |
| Asthma | 0.07 | 0.06 | 0.07 | 0.08 | 0.08 |
| CHD | 0.02 | 0.02 | 0.03 | 0.03 | 0.04 |
| Smoking | 0.10 | 0.09 | 0.12 | 0.12 | 0.13 |
| Diabetes | 0.07 | 0.06 | 0.08 | 0.08 | 0.08 |
| Phys. act. | 0.11 | 0.10 | 0.12 | 0.12 | 0.13 |
| Obesity | 0.06 | 0.05 | 0.08 | 0.08 | 0.09 |
|
| |||||
|
| |||||
| Income rank | 0.11 | 0.10 | 0.13 | 0.13 | 0.14 |
| Income in top 20% | 0.11 | 0.10 | 0.12 | 0.12 | 0.13 |
| Low pov. neighborhood | 0.15 | 0.13 | 0.15 | 0.15 | 0.16 |
|
|
Note: All outcome were standardized using the z-score transformation.
OLS regression coefficients and standard errors from regressions of census tract level adult household income ranks (Opportunity atlas) and diabetes diagnoses (500 cities) on two census tract HOLC classifications.
|
|
| |||
|---|---|---|---|---|
| Rank-ordered, one rating | Rank-ordered, two ratings (collapsed) | Rank-ordered, one rating | Rank-ordered, two ratings (collapsed) | |
| Only or mainly A (Ref.) | ||||
| Only or mainly B | -3.35 | 1.76 | ||
| (0.29) | (0.22) | |||
| Only or mainly C | -5.70 | 3.22 | ||
| (0.27) | (0.20) | |||
| Only or mainly D | -9.28 | 4.46 | ||
| (0.28) | (0.21) | |||
| Only B | -2.47 | 2.23 | ||
| (0.38) | (0.28) | |||
| Mainly B, some A | -2.16 | 0.75 | ||
| (0.41) | (0.31) | |||
| Mainly B, some C or D | -4.17 | 1.88 | ||
| (0.32) | (0.24) | |||
| Only C | -4.29 | 3.06 | ||
| (0.29) | (0.22) | |||
| Mainly C, some D | -8.44 | 4.21 | ||
| (0.31) | (0.23) | |||
| Only D | -8.83 | 4.03 | ||
| (0.30) | (0.22) | |||
| Mainly D, some C | -10.11 | 5.21 | ||
| (0.31) | (0.23) | |||
| Mainly C or D, some A | -3.98 | 1.35 | ||
| (0.57) | (0.42) | |||
| Mainly C or D, some B | -5.36 | 2.59 | ||
| (0.31) | (0.23) | |||
| Constant | 46.16 | 46.16 | 8.82 | 8.82 |
| (0.26) | (0.25) | (0.19) | (0.19) | |
| Observations | 13,927 | 13,927 | 10,891 | 10,891 |
| RMSE | 7.74 | 7.63 | 4.74 | 4.69 |
| R-squared | 0.10 | 0.13 | 0.06 | 0.08 |
Note:
*** p<0.001
** p<0.01
* p<0.05.