Alexander J M Dingemans1, Max Hinne2, Sandra Jansen3, Jeroen van Reeuwijk3, Nicole de Leeuw3, Rolph Pfundt3, Bregje W van Bon3, Anneke T Vulto-van Silfhout3, Tjitske Kleefstra3, David A Koolen3, Marcel A J van Gerven2, Lisenka E L M Vissers3, Bert B A de Vries4. 1. Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Radboud University, Nijmegen, The Netherlands; Department of Artificial Intelligence, Faculty of Social Sciences, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Radboud University, Nijmegen, The Netherlands. 2. Department of Artificial Intelligence, Faculty of Social Sciences, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Radboud University, Nijmegen, The Netherlands. 3. Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Radboud University, Nijmegen, The Netherlands. 4. Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Radboud University, Nijmegen, The Netherlands. Electronic address: Bert.deVries@radboudumc.nl.
Abstract
PURPOSE: Although the introduction of exome sequencing (ES) has led to the diagnosis of a significant portion of patients with neurodevelopmental disorders (NDDs), the diagnostic yield in actual clinical practice has remained stable at approximately 30%. We hypothesized that improving the selection of patients to test on the basis of their phenotypic presentation will increase diagnostic yield and therefore reduce unnecessary genetic testing. METHODS: We tested 4 machine learning methods and developed PredWES from these: a statistical model predicting the probability of a positive ES result solely on the basis of the phenotype of the patient. RESULTS: We first trained the tool on 1663 patients with NDDs and subsequently showed that diagnostic ES on the top 10% of patients with the highest probability of a positive ES result would provide a diagnostic yield of 56%, leading to a notable 114% increase. Inspection of our model revealed that for patients with NDDs, comorbid abnormal (lower) muscle tone and microcephaly positively correlated with a conclusive ES diagnosis, whereas autism was negatively associated with a molecular diagnosis. CONCLUSION: In conclusion, PredWES allows prioritizing patients with NDDs eligible for diagnostic ES on the basis of their phenotypic presentation to increase the diagnostic yield, making a more efficient use of health care resources.
PURPOSE: Although the introduction of exome sequencing (ES) has led to the diagnosis of a significant portion of patients with neurodevelopmental disorders (NDDs), the diagnostic yield in actual clinical practice has remained stable at approximately 30%. We hypothesized that improving the selection of patients to test on the basis of their phenotypic presentation will increase diagnostic yield and therefore reduce unnecessary genetic testing. METHODS: We tested 4 machine learning methods and developed PredWES from these: a statistical model predicting the probability of a positive ES result solely on the basis of the phenotype of the patient. RESULTS: We first trained the tool on 1663 patients with NDDs and subsequently showed that diagnostic ES on the top 10% of patients with the highest probability of a positive ES result would provide a diagnostic yield of 56%, leading to a notable 114% increase. Inspection of our model revealed that for patients with NDDs, comorbid abnormal (lower) muscle tone and microcephaly positively correlated with a conclusive ES diagnosis, whereas autism was negatively associated with a molecular diagnosis. CONCLUSION: In conclusion, PredWES allows prioritizing patients with NDDs eligible for diagnostic ES on the basis of their phenotypic presentation to increase the diagnostic yield, making a more efficient use of health care resources.