Rachel L Draelos1,2, Jordan E Ezekian3, Farica Zhuang1, Mary E Moya-Mendez3, Zhushan Zhang3, Michael B Rosamilia3, Perathu K R Manivannan3, Ricardo Henao4,5, Andrew P Landstrom3,6. 1. Department of Computer Science, Trinity College of Arts and Sciences (R.L.D., F.Z.), Duke University. 2. Medical Scientist Training Program (R.L.D.), Duke University School of Medicine, Durham, NC. 3. Department of Pediatrics, Division of Cardiology (J.E.Z., M.E.M.-M., Z.Z., M.B.R., P.K.R.M., A.P.L.), Duke University School of Medicine, Durham, NC. 4. Department of Electrical and Computer Engineering, Pratt School of Engineering (R.H.), Duke University. 5. Department of Biostatistics and Bioinformatics (R.H.), Duke University School of Medicine, Durham, NC. 6. Department of Cell Biology (A.P.L.), Duke University School of Medicine, Durham, NC.
Abstract
BACKGROUND: Cardiac channelopathies such as catecholaminergic polymorphic tachycardia and long QT syndrome predispose patients to fatal arrhythmias and sudden cardiac death. As genetic testing has become common in clinical practice, variants of uncertain significance (VUS) in genes associated with catecholaminergic polymorphic ventricular tachycardia and long QT syndrome are frequently found. The objective of this study was to predict pathogenicity of catecholaminergic polymorphic ventricular tachycardia-associated RYR2 VUS and long QT syndrome-associated VUS in KCNQ1, KCNH2, and SCN5A by developing gene-specific machine learning models and assessing them using cross-validation, cellular electrophysiological data, and clinical correlation. METHODS: The GENe-specific EnSemble grId Search framework was developed to identify high-performing machine learning models for RYR2, KCNQ1, KCNH2, and SCN5A using variant- and protein-specific inputs. Final models were applied to datasets of VUS identified from ClinVar and exome sequencing. Whole cell patch clamp and clinical correlation of selected VUS was performed. RESULTS: The GENe-specific EnSemble grId Search models outperformed alternative methods, with area under the receiver operating characteristics up to 0.87, average precisions up to 0.83, and calibration slopes as close to 1.0 (perfect) as 1.04. Blinded voltage-clamp analysis of HEK293T cells expressing 2 predicted pathogenic variants in KCNQ1 each revealed an ≈80% reduction of peak Kv7.1 current compared with WT. Normal Kv7.1 function was observed in KCNQ1-V241I HEK cells as predicted. Though predicted benign, loss of Kv7.1 function was observed for KCNQ1-V106D HEK cells. Clinical correlation of 9/10 variants supported model predictions. CONCLUSIONS: Gene-specific machine learning models may have a role in post-genetic testing diagnostic analyses by providing high performance prediction of variant pathogenicity.
BACKGROUND: Cardiac channelopathies such as catecholaminergic polymorphic tachycardia and long QT syndrome predispose patients to fatal arrhythmias and sudden cardiac death. As genetic testing has become common in clinical practice, variants of uncertain significance (VUS) in genes associated with catecholaminergic polymorphic ventricular tachycardia and long QT syndrome are frequently found. The objective of this study was to predict pathogenicity of catecholaminergic polymorphic ventricular tachycardia-associated RYR2 VUS and long QT syndrome-associated VUS in KCNQ1, KCNH2, and SCN5A by developing gene-specific machine learning models and assessing them using cross-validation, cellular electrophysiological data, and clinical correlation. METHODS: The GENe-specific EnSemble grId Search framework was developed to identify high-performing machine learning models for RYR2, KCNQ1, KCNH2, and SCN5A using variant- and protein-specific inputs. Final models were applied to datasets of VUS identified from ClinVar and exome sequencing. Whole cell patch clamp and clinical correlation of selected VUS was performed. RESULTS: The GENe-specific EnSemble grId Search models outperformed alternative methods, with area under the receiver operating characteristics up to 0.87, average precisions up to 0.83, and calibration slopes as close to 1.0 (perfect) as 1.04. Blinded voltage-clamp analysis of HEK293T cells expressing 2 predicted pathogenic variants in KCNQ1 each revealed an ≈80% reduction of peak Kv7.1 current compared with WT. Normal Kv7.1 function was observed in KCNQ1-V241I HEK cells as predicted. Though predicted benign, loss of Kv7.1 function was observed for KCNQ1-V106D HEK cells. Clinical correlation of 9/10 variants supported model predictions. CONCLUSIONS: Gene-specific machine learning models may have a role in post-genetic testing diagnostic analyses by providing high performance prediction of variant pathogenicity.
Authors: Andrew P Landstrom; Andrew L Dailey-Schwartz; Jill A Rosenfeld; Yaping Yang; Margaret J McLean; Christina Y Miyake; Santiago O Valdes; Yuxin Fan; Hugh D Allen; Daniel J Penny; Jeffrey J Kim Journal: Circ Arrhythm Electrophysiol Date: 2017-04
Authors: Michael J Ackerman; Silvia G Priori; Stephan Willems; Charles Berul; Ramon Brugada; Hugh Calkins; A John Camm; Patrick T Ellinor; Michael Gollob; Robert Hamilton; Ray E Hershberger; Daniel P Judge; Hervè Le Marec; William J McKenna; Eric Schulze-Bahr; Chris Semsarian; Jeffrey A Towbin; Hugh Watkins; Arthur Wilde; Christian Wolpert; Douglas P Zipes Journal: Heart Rhythm Date: 2011-08 Impact factor: 6.343
Authors: Karthik A Jagadeesh; Aaron M Wenger; Mark J Berger; Harendra Guturu; Peter D Stenson; David N Cooper; Jonathan A Bernstein; Gill Bejerano Journal: Nat Genet Date: 2016-10-24 Impact factor: 38.330
Authors: Andrew P Landstrom; Ernesto Fernandez; Jill A Rosenfeld; Yaping Yang; Andrew L Dailey-Schwartz; Christina Y Miyake; Hugh D Allen; Daniel J Penny; Jeffrey J Kim Journal: Heart Rhythm Date: 2018-03-02 Impact factor: 6.343
Authors: Argelia Medeiros-Domingo; Zahurul A Bhuiyan; David J Tester; Nynke Hofman; Hennie Bikker; J Peter van Tintelen; Marcel M A M Mannens; Arthur A M Wilde; Michael J Ackerman Journal: J Am Coll Cardiol Date: 2009-11-24 Impact factor: 24.094
Authors: Ngak-Leng Sim; Prateek Kumar; Jing Hu; Steven Henikoff; Georg Schneider; Pauline C Ng Journal: Nucleic Acids Res Date: 2012-06-11 Impact factor: 16.971
Authors: Sue Richards; Nazneen Aziz; Sherri Bale; David Bick; Soma Das; Julie Gastier-Foster; Wayne W Grody; Madhuri Hegde; Elaine Lyon; Elaine Spector; Karl Voelkerding; Heidi L Rehm Journal: Genet Med Date: 2015-03-05 Impact factor: 8.822
Authors: Fiona Cunningham; Premanand Achuthan; Wasiu Akanni; James Allen; M Ridwan Amode; Irina M Armean; Ruth Bennett; Jyothish Bhai; Konstantinos Billis; Sanjay Boddu; Carla Cummins; Claire Davidson; Kamalkumar Jayantilal Dodiya; Astrid Gall; Carlos García Girón; Laurent Gil; Tiago Grego; Leanne Haggerty; Erin Haskell; Thibaut Hourlier; Osagie G Izuogu; Sophie H Janacek; Thomas Juettemann; Mike Kay; Matthew R Laird; Ilias Lavidas; Zhicheng Liu; Jane E Loveland; José C Marugán; Thomas Maurel; Aoife C McMahon; Benjamin Moore; Joannella Morales; Jonathan M Mudge; Michael Nuhn; Denye Ogeh; Anne Parker; Andrew Parton; Mateus Patricio; Ahamed Imran Abdul Salam; Bianca M Schmitt; Helen Schuilenburg; Dan Sheppard; Helen Sparrow; Eloise Stapleton; Marek Szuba; Kieron Taylor; Glen Threadgold; Anja Thormann; Alessandro Vullo; Brandon Walts; Andrea Winterbottom; Amonida Zadissa; Marc Chakiachvili; Adam Frankish; Sarah E Hunt; Myrto Kostadima; Nick Langridge; Fergal J Martin; Matthieu Muffato; Emily Perry; Magali Ruffier; Daniel M Staines; Stephen J Trevanion; Bronwen L Aken; Andrew D Yates; Daniel R Zerbino; Paul Flicek Journal: Nucleic Acids Res Date: 2019-01-08 Impact factor: 16.971
Authors: Peter M Kekenes-Huskey; Don E Burgess; Bin Sun; Daniel C Bartos; Ezekiel R Rozmus; Corey L Anderson; Craig T January; Lee L Eckhardt; Brian P Delisle Journal: Int J Mol Sci Date: 2022-07-02 Impact factor: 6.208