| Literature DB >> 32711532 |
Daniela Almeida1, David Gorender1, Maria Yury Ichihara1, Samila Sena1, Luan Menezes1, George C G Barbosa1,2, Rosimeire L Fiaccone1,3, Enny S Paixão4,5, Robespierre Pita1, Mauricio L Barreto1.
Abstract
BACKGROUND: Research using linked routine population-based data collected for non-research purposes has increased in recent years because they are a rich and detailed source of information. The objective of this study is to present an approach to prepare and link data from administrative sources in a middle-income country, to estimate its quality and to identify potential sources of bias by comparing linked and non-linked individuals.Entities:
Year: 2020 PMID: 32711532 PMCID: PMC7382864 DOI: 10.1186/s12911-020-01192-0
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Fig. 1Flow chart data linkage tool
Number and percentage of linked records by year, Brazil, 2001-2015
| Year | Total | Linked | |
|---|---|---|---|
| N | % | ||
| 2,448,609 | 961,605 | 39.27 | |
| 2,319,071 | 1,175,223 | 50.68 | |
| 2,224,872 | 1,179,781 | 53.03 | |
| 2,165,661 | 1,144,809 | 52.86 | |
| 2,161,484 | 1,183,292 | 54.74 | |
| 2,050,534 | 1,271,179 | 61.99 | |
| 1,961,446 | 1,087,254 | 55.43 | |
| 1,936,675 | 1,077,781 | 55.65 | |
| 1,855,919 | 1,052,394 | 56.70 | |
| 1,778,515 | 1,067,417 | 60.02 | |
| 1,765,211 | 1,249,492 | 70.78 | |
| 1,662,414 | 1,251,251 | 75.27 | |
| 1,505,476 | 1,227,162 | 81.51 | |
| 1,271,156 | 1,043,499 | 82.09 | |
| 592,848 | 475,275 | 80.17 | |
| 27,699,891 | 16,447,414 | 59.38 | |
*from 2011 the maternal date of birth was available
Metrics of accuracy - Linkage for mother
| Year | Date of mother’s birth available | |||||||
|---|---|---|---|---|---|---|---|---|
| No | Yes | |||||||
| AUC (%) | Threshold | Specificity (%) | Sensitivity (%) | AUC (%) | Threshold | Specificity (%) | Sensitivity (%) | |
| 99.18 | 0,929 | 96,4 | 95,5 | --- | --- | --- | --- | |
| 98.04 | 0,928 | 92,2 | 96,2 | --- | --- | --- | --- | |
| 98.94 | 0,9300 | 95,2 | 96,5 | --- | --- | --- | --- | |
| 99.31 | 0,954 | 98,4 | 94,6 | --- | --- | --- | --- | |
| 99.34 | 0,947 | 97,4 | 96,1 | --- | --- | --- | --- | |
| 93.94 | 0,915 | 81,6 | 96,4 | --- | --- | --- | --- | |
| 96.04 | 0,954 | 90,3 | 96,1 | --- | --- | --- | --- | |
| 95.74 | 0,955 | 88,7 | 97,7 | --- | --- | --- | --- | |
| 96.63 | 0,950 | 87,4 | 98,2 | --- | --- | --- | --- | |
| 98.50 | 0,944 | 93,5 | 98,6 | --- | --- | --- | --- | |
| 95.59 | 0,955 | 86,6 | 97,6 | 99.36 | 0,940 | 96,9 | 98,7 | |
| 96.79 | 0,925 | 88,1 | 97,4 | 98.58 | 0,941 | 96,1 | 94,1 | |
| 97.19 | 0,952 | 88,5 | 98,6 | 98.25 | 0,920 | 95,6 | 94,7 | |
| 96.70 | 0,953 | 86,7 | 97,9 | 98.20 | 0,913 | 93,1 | 95,5 | |
| 97.28 | 0,955 | 88,3 | 98,4 | 99.15 | 0,933 | 97,1 | 94,5 | |
Associations between the characteristics of the cohort and the accuracy of the linkage
| Characteristics | 2001 | 2014 | ||||||
|---|---|---|---|---|---|---|---|---|
| Linked | Non-linked | Linked | Non-linked | |||||
| Missing | 11610 | 0.78 | 11610 | 0.78 | 50098 | 4.80 | 14134 | 6.21 |
| Public supply | 982902 | 66.10 | 982902 | 66.10 | 735652 | 70.50 | 150925 | 66.29 |
| Well | 361618 | 24.32 | 361618 | 24.32 | 171268 | 16.41 | 41885 | 18.40 |
| Other | 130874 | 8.80 | 130874 | 8.80 | 86481 | 8.29 | 20713 | 9.10 |
| Missing | 8188 | 0.85 | 22896 | 1.54 | 138178 | 13.24 | 37445 | 16.45 |
| Public collection | 378673 | 39.38 | 616471 | 41.46 | 439186 | 42.09 | 83050 | 36.48 |
| Septic tank | 158983 | 16.53 | 207798 | 13.97 | 137642 | 13.19 | 31112 | 13.67 |
| Rudimentary Pit | 253761 | 26.39 | 371292 | 24.97 | 285395 | 27.35 | 65959 | 28.97 |
| Ditch | 143119 | 14.88 | 237890 | 16.00 | 35164 | 3.37 | 8143 | 3.58 |
| Other | 18881 | 1.96 | 30657 | 2.06 | 7934 | 0.76 | 1948 | 0.86 |
| Missing | 5518 | 0.57 | 11616 | 0.78 | 50098 | 4.80 | 14134 | 6.21 |
| Collected | 698035 | 72.59 | 1035356 | 69.63 | 793622 | 76.05 | 162792 | 71.51 |
| Burnt / Buried | 173667 | 18.06 | 294548 | 19.81 | 174553 | 16.73 | 44385 | 19.50 |
| Landfill | 75287 | 7.83 | 127226 | 8.56 | 19263 | 1.85 | 4846 | 2.13 |
| Other | 9098 | 0.95 | 18258 | 1.23 | 5963 | 0.57 | 1500 | 0.66 |
| Missing | 33774 | 3.51 | 64030 | 4.31 | 42891 | 4.11 | 8903 | 8.30 |
| Pre-school | 149013 | 15.50 | 199390 | 13.41 | 14640 | 1.40 | 2946 | 1.29 |
| Literacy | 63098 | 6.56 | 84264 | 5.67 | 84 | 0.01 | 27 | 0.01 |
| Elementary school | 204054 | 21.22 | 455525 | 30.63 | 410 | 0.04 | 297 | 0.13 |
| High school | 956 | 0.10 | 2062 | 0.14 | 187 | 0.02 | 83 | 0.04 |
| College education | 52 | 0.01 | 108 | 0.01 | 0 | 0.00 | 2 | 0.00 |
| Illiteracy | 510658 | 53.10 | 681625 | 45.84 | 985287 | 94.42 | 215399 | 94.62 |
| Missing | 21525 | 2.24 | 25275 | 1.70 | 1 | 0.00 | 0 | 0.00 |
| Caucasian | 312717 | 32.52 | 474201 | 31.89 | 339022 | 32.49 | 64997 | 28.55 |
| Black | 56836 | 5.91 | 78562 | 5.28 | 35608 | 3.41 | 7596 | 3.34 |
| Asian | 3465 | 0.36 | 4667 | 0.31 | 4932 | 0.47 | 1203 | 0.53 |
| Brown | 562957 | 58.54 | 890725 | 59.90 | 654706 | 62.74 | 151017 | 66.34 |
| Indigenous | 4105 | 0.43 | 13574 | 0.91 | 9230 | 0.88 | 2844 | 1.25 |
| Male | 491672 | 51.13 | 763571 | 51.35 | 533728 | 51.15 | 114985 | 50.51 |
| Female | 469933 | 48.87 | 723433 | 48.65 | 509771 | 48.85 | 112672 | 49.49 |
| Missing | 123 | 0.01 | 405 | 0.03 | 126 | 0.01 | 26 | 0.01 |
| Urban | 724567 | 75.35 | 1079682 | 72.61 | 808507 | 77.48 | 168299 | 73.93 |
| Rural | 236915 | 24.64 | 406917 | 27.36 | 234866 | 22.51 | 59332 | 26.06 |
2001- before maternal date of birth was available, 2014- after maternal date of birth was available