Kelly D Myers1, Joshua W Knowles2, David Staszak3, Michael D Shapiro4, William Howard3, Mrinal Yadava4, David Zuzick5, Latoya Williamson5, Nigam H Shah6, Juan M Banda6, Joe Leader7, William C Cromwell8, Ed Trautman9, Michael F Murray10, Seth J Baum11, Seth Myers3, Samuel S Gidding5, Katherine Wilemon5, Daniel J Rader12. 1. The Familial Hypercholesterolemia Foundation, Pasadena, CA, USA; Atomo, Austin, TX, USA. Electronic address: km@thefhfoundation.org. 2. The Familial Hypercholesterolemia Foundation, Pasadena, CA, USA; Division of Cardiovascular Medicine and Cardiovascular Institute, Stanford University, Stanford, CA, USA. 3. Atomo, Austin, TX, USA. 4. Department of Medicine, Center for Preventive Cardiology, Knight Cardiovascular Institute, Oregon Health & Science University, Portland, OR, USA. 5. The Familial Hypercholesterolemia Foundation, Pasadena, CA, USA. 6. Stanford Center for Biomedical Informatics Research, Stanford University, Stanford, CA, USA. 7. Geisinger Health System, Danville, PA, USA. 8. Lipoprotein & Metabolic Disorders Institute, Raleigh, NC, USA. 9. Laboratory Corporation of America Holdings, Burlington, NC, USA. 10. Yale Center for Genomic Health, New Haven, CT, USA. 11. Department of Integrated Medical Sciences, Charles E Schmidt College of Medicine, Florida Atlantic University, Boca Raton, FL, USA. 12. The Familial Hypercholesterolemia Foundation, Pasadena, CA, USA; Department of Genetics, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, PA, USA; Department of Medicine, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, PA, USA; Department of Pediatrics, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, PA, USA.
Abstract
BACKGROUND: Cardiovascular outcomes for people with familial hypercholesterolaemia can be improved with diagnosis and medical management. However, 90% of individuals with familial hypercholesterolaemia remain undiagnosed in the USA. We aimed to accelerate early diagnosis and timely intervention for more than 1·3 million undiagnosed individuals with familial hypercholesterolaemia at high risk for early heart attacks and strokes by applying machine learning to large health-care encounter datasets. METHODS: We trained the FIND FH machine learning model using deidentified health-care encounter data, including procedure and diagnostic codes, prescriptions, and laboratory findings, from 939 clinically diagnosed individuals with familial hypercholesterolaemia (395 of whom had a molecular diagnosis) and 83 136 individuals presumed free of familial hypercholesterolaemia, sampled from four US institutions. The model was then applied to a national health-care encounter database (170 million individuals) and an integrated health-care delivery system dataset (174 000 individuals). Individuals used in model training and those evaluated by the model were required to have at least one cardiovascular disease risk factor (eg, hypertension, hypercholesterolaemia, or hyperlipidemia). A Health Insurance Portability and Accountability Act of 1996-compliant programme was developed to allow providers to receive identification of individuals likely to have familial hypercholesterolaemia in their practice. FINDINGS: Using a model with a measured precision (positive predictive value) of 0·85, recall (sensitivity) of 0·45, area under the precision-recall curve of 0·55, and area under the receiver operating characteristic curve of 0·89, we flagged 1 331 759 of 170 416 201 patients in the national database and 866 of 173 733 individuals in the health-care delivery system dataset as likely to have familial hypercholesterolaemia. Familial hypercholesterolaemia experts reviewed a sample of flagged individuals (45 from the national database and 103 from the health-care delivery system dataset) and applied clinical familial hypercholesterolaemia diagnostic criteria. Of those reviewed, 87% (95% Cl 73-100) in the national database and 77% (68-86) in the health-care delivery system dataset were categorised as having a high enough clinical suspicion of familial hypercholesterolaemia to warrant guideline-based clinical evaluation and treatment. INTERPRETATION: The FIND FH model successfully scans large, diverse, and disparate health-care encounter databases to identify individuals with familial hypercholesterolaemia. FUNDING: The FH Foundation funded this study. Support was received from Amgen, Sanofi, and Regeneron.
BACKGROUND: Cardiovascular outcomes for people with familial hypercholesterolaemia can be improved with diagnosis and medical management. However, 90% of individuals with familial hypercholesterolaemia remain undiagnosed in the USA. We aimed to accelerate early diagnosis and timely intervention for more than 1·3 million undiagnosed individuals with familial hypercholesterolaemia at high risk for early heart attacks and strokes by applying machine learning to large health-care encounter datasets. METHODS: We trained the FIND FH machine learning model using deidentified health-care encounter data, including procedure and diagnostic codes, prescriptions, and laboratory findings, from 939 clinically diagnosed individuals with familial hypercholesterolaemia (395 of whom had a molecular diagnosis) and 83 136 individuals presumed free of familial hypercholesterolaemia, sampled from four US institutions. The model was then applied to a national health-care encounter database (170 million individuals) and an integrated health-care delivery system dataset (174 000 individuals). Individuals used in model training and those evaluated by the model were required to have at least one cardiovascular disease risk factor (eg, hypertension, hypercholesterolaemia, or hyperlipidemia). A Health Insurance Portability and Accountability Act of 1996-compliant programme was developed to allow providers to receive identification of individuals likely to have familial hypercholesterolaemia in their practice. FINDINGS: Using a model with a measured precision (positive predictive value) of 0·85, recall (sensitivity) of 0·45, area under the precision-recall curve of 0·55, and area under the receiver operating characteristic curve of 0·89, we flagged 1 331 759 of 170 416 201 patients in the national database and 866 of 173 733 individuals in the health-care delivery system dataset as likely to have familial hypercholesterolaemia. Familial hypercholesterolaemia experts reviewed a sample of flagged individuals (45 from the national database and 103 from the health-care delivery system dataset) and applied clinical familial hypercholesterolaemia diagnostic criteria. Of those reviewed, 87% (95% Cl 73-100) in the national database and 77% (68-86) in the health-care delivery system dataset were categorised as having a high enough clinical suspicion of familial hypercholesterolaemia to warrant guideline-based clinical evaluation and treatment. INTERPRETATION: The FIND FH model successfully scans large, diverse, and disparate health-care encounter databases to identify individuals with familial hypercholesterolaemia. FUNDING: The FH Foundation funded this study. Support was received from Amgen, Sanofi, and Regeneron.
Authors: Noura S Abul-Husn; Kandamurugu Manickam; Laney K Jones; Eric A Wright; Dustin N Hartzel; Claudia Gonzaga-Jauregui; Colm O'Dushlaine; Joseph B Leader; H Lester Kirchner; D'Andra M Lindbuchler; Marci L Barr; Monica A Giovanni; Marylyn D Ritchie; John D Overton; Jeffrey G Reid; Raghu P R Metpally; Amr H Wardeh; Ingrid B Borecki; George D Yancopoulos; Aris Baras; Alan R Shuldiner; Omri Gottesman; David H Ledbetter; David J Carey; Frederick E Dewey; Michael F Murray Journal: Science Date: 2016-12-23 Impact factor: 47.728
Authors: David S Wald; Jonathan P Bestwick; Joan K Morris; Ken Whyte; Lucy Jenkins; Nicholas J Wald Journal: N Engl J Med Date: 2016-10-27 Impact factor: 91.245
Authors: Børge G Nordestgaard; M John Chapman; Steve E Humphries; Henry N Ginsberg; Luis Masana; Olivier S Descamps; Olov Wiklund; Robert A Hegele; Frederick J Raal; Joep C Defesche; Albert Wiegman; Raul D Santos; Gerald F Watts; Klaus G Parhofer; G Kees Hovingh; Petri T Kovanen; Catherine Boileau; Maurizio Averna; Jan Borén; Eric Bruckert; Alberico L Catapano; Jan Albert Kuivenhoven; Päivi Pajukanta; Kausik Ray; Anton F H Stalenhoef; Erik Stroes; Marja-Riitta Taskinen; Anne Tybjærg-Hansen Journal: Eur Heart J Date: 2013-08-15 Impact factor: 29.983
Authors: Muin J Khoury; Scott Bowen; W David Dotson; Emily Drzymalla; Ridgely F Green; Robert Goldstein; Katherine Kolor; Leandris C Liburd; Laurence S Sperling; Rebecca Bunnell Journal: Genet Med Date: 2022-04-28 Impact factor: 8.864
Authors: Gemme Campbell-Salome; Laney K Jones; Max F Masnick; Nephi A Walton; Catherine D Ahmed; Adam H Buchanan; Andrew Brangan; Edward D Esplin; David G Kann; Ilene G Ladd; Melissa A Kelly; Iris Kindt; H Lester Kirchner; Mary P McGowan; Megan N McMinn; Ana Morales; Kelly D Myers; Matthew T Oetjens; Alanna Kulchak Rahm; Tara J Schmidlen; Amanda Sheldon; Emilie Simmons; Moran Snir; Natasha T Strande; Nicole L Walters; Katherine Wilemon; Marc S Williams; Samuel S Gidding; Amy C Sturm Journal: Circ Genom Precis Med Date: 2021-01-22
Authors: Laney K Jones; Nicole Walters; Andrew Brangan; Catherine D Ahmed; Michael Gatusky; Gemme Campbell-Salome; Ilene G Ladd; Amanda Sheldon; Samuel S Gidding; Mary P McGowan; Alanna K Rahm; Amy C Sturm Journal: J Pers Med Date: 2021-06-21