| Literature DB >> 34664656 |
William R Hogan1, Elizabeth A Shenkman1, Temple Robinson2, Olveen Carasquillo3, Patricia S Robinson4, Rebecca Z Essner4, Jiang Bian1, Gigi Lipori5, Christopher Harle1,5, Tanja Magoc5, Lizabeth Manini1, Tona Mendoza1, Sonya White1, Alex Loiacono1, Jackie Hall1, Dave Nelson5.
Abstract
The OneFlorida Data Trust is a centralized research patient data repository created and managed by the OneFlorida Clinical Research Consortium ("OneFlorida"). It comprises structured electronic health record (EHR), administrative claims, tumor registry, death, and other data on 17.2 million individuals who received healthcare in Florida between January 2012 and the present. Ten healthcare systems in Miami, Orlando, Tampa, Jacksonville, Tallahassee, Gainesville, and rural areas of Florida contribute EHR data, covering the major metropolitan regions in Florida. Deduplication of patients is accomplished via privacy-preserving entity resolution (precision 0.97-0.99, recall 0.75), thereby linking patients' EHR, claims, and death data. Another unique feature is the establishment of mother-baby relationships via Florida vital statistics data. Research usage has been significant, including major studies launched in the National Patient-Centered Clinical Research Network ("PCORnet"), where OneFlorida is 1 of 9 clinical research networks. The Data Trust's robust, centralized, statewide data are a valuable and relatively unique research resource.Entities:
Keywords: Clinical Data Research Network; data analysis; data management; data quality; data warehouse
Mesh:
Year: 2022 PMID: 34664656 PMCID: PMC8922180 DOI: 10.1093/jamia/ocab221
Source DB: PubMed Journal: J Am Med Inform Assoc ISSN: 1067-5027 Impact factor: 4.497
Figure 1.Map of contributing partners to the OneFlorida Clinical Research Consortium, where each color represents a healthcare partner organization and each dot of that color represents a physical location operated by that organization, such as a clinic or hospital.
Sociodemographic characteristics of OneFlorida compared to Florida and the United States overall
| Population | OneFlorida | Florida | United States |
|---|---|---|---|
| Size |
|
|
|
| Gender | |||
| Male | 45.2% (7 768 107) | 48.9% | 49.2% |
| Female | 54.7% (9 410 081) | 51.1% | 50.8% |
| Unknown | 0.09% (15 397) | ||
| Age | |||
| <18 years | 27% (4 642 783) | 19.7% | 22.2% |
| 18–64 years | 54% (9 284 021) | 59.4% | 61.3% |
| 65+ years | 18.99% (3 264 282) | 20.9% | 16.5% |
| Unknown | 0.01% (2499) | ||
| Race | |||
| American Indian/Alaskan Native | 0.2% (33 043) | 0.3% | 0.9% |
| Asian | 1.4% (239 141) | 2.8% | 5.7% |
| Black | 18% (3 092 909) | 16.0% | 12.8% |
| White | 45.9% (7 897 865) | 74.5% | 72.0% |
| Native Hawaiian/Pacific Islander | 0.1% (10 480) | 0.1% | 0.2% |
| Other | 23.2% (3 993 438) | 3.4% | 5.0% |
| Unknown | 11.2% (1 926 709) | ||
| Ethnicity | |||
| Hispanic | 23% (4 029 625) | 26.4% | 18.4% |
Notes: OneFlorida numbers are cumulative since 2012, whereas Florida and US numbers are current state. The high percentages of other and unknown in OneFlorida are due to data limitations that reflect a suboptimal state of the art in collecting these data, driven by basic compliance with federal data standards as the primary consideration.
Figure 2.Overall scheme of the Data Trust, showing data sources flowing into the Data Trust, and research workflows for using the data (solid lines represent ongoing routine data flows and dashed lines represent data sources under development, but nonetheless used for specific studies in the past). A robust honest broker system enables patient re-identification for IRB approved protocols to enable participant recruitment. PROs: Patient-Reported Outcomes; ETL: extract, transform, and load; IDs: identifiers; IRB: Institutional Review Board.
Figure 3.Geographical distribution of coverage of Florida by the OneFlorida Data Trust. The figure was generated from OneFlorida patients seen during 2017–2021, totaling 9.2 million patients thus far into 2021. 1 dot = 100 patients.