Literature DB >> 31599443

Analysis of machine learning algorithms as integrative tools for validation of next generation sequencing data.

G Marceddu1, T Dallavilla, G Guerri, A Zulian, C Marinelli, M Bertelli.   

Abstract

OBJECTIVE: While next generation sequencing (NGS) has become the technology of choice for clinical diagnostics, most genetic laboratories still use Sanger sequencing for orthogonal confirmation of NGS results. Previous studies have shown that when the quality of NGS data is high, most calls are indicated by Sanger sequencing, making confirmation redundant. We aimed at establishing a set of criteria that make it possible to distinguish NGS calls that need orthogonal confirmation from those that do not would significantly decrease the amount of work necessary to reach a diagnosis.
MATERIALS AND METHODS: A data set of 7976 NGS calls confirmed as true or false positive by Sanger sequencing was used to train and test different machine learning (ML) approaches. By varying the size and class balance of the training dataset, we measured the performance of the different algorithms to determine the conditions under which ML is a valid approach for confirming NGS calls in a diagnostic environment.
RESULTS: Our results indicate that machine learning is a valid approach to find variant calls that need more investigation, but in order to reach the high accuracy required in a clinical environment, the training data set must include enough observations and these observations must be well-balanced between true/false positive NGS calls.
CONCLUSIONS: Our results show that it is possible to integrate the diagnostic NGS validation workflow with a machine learning approach to reduce the number of Sanger confirmations of high- quality NGS calls, reducing the time and costs of diagnosis.

Year:  2019        PMID: 31599443     DOI: 10.26355/eurrev_201909_19034

Source DB:  PubMed          Journal:  Eur Rev Med Pharmacol Sci        ISSN: 1128-3602            Impact factor:   3.507


  3 in total

1.  Male Infertility Diagnosis: Improvement of Genetic Analysis Performance by the Introduction of Pre-Diagnostic Genes in a Next-Generation Sequencing Custom-Made Panel.

Authors:  Vincenza Precone; Rossella Cannarella; Stefano Paolacci; Gian Maria Busetto; Tommaso Beccari; Liborio Stuppia; Gerolamo Tonini; Alessandra Zulian; Giuseppe Marceddu; Aldo E Calogero; Matteo Bertelli
Journal:  Front Endocrinol (Lausanne)       Date:  2021-01-26       Impact factor: 5.555

2.  Machine learning random forest for predicting oncosomatic variant NGS analysis.

Authors:  Eric Pellegrino; Coralie Jacques; Nathalie Beaufils; Isabelle Nanni; Antoine Carlioz; Philippe Metellus; L'Houcine Ouafik
Journal:  Sci Rep       Date:  2021-11-08       Impact factor: 4.379

3.  appMAGI: A complete laboratory information management system for clinical diagnostics.

Authors:  Giuseppe Marceddu; Tiziano Dallavilla; Aleksander Xhuvani; Muharrem Daja; Luca De Antoni; Arianna Casadei; Matteo Bertelli
Journal:  Acta Biomed       Date:  2020-11-09
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.