Jonathan K Vis1, Martijn Vermaat2, Peter E M Taschner3, Joost N Kok1, Jeroen F J Laros4. 1. Department of Molecular Epidemiology, Leiden University Medical Center, Leiden, The Netherlands, Leiden Institute of Advanced Computer Science, Leiden University, Leiden, The Netherlands. 2. Department of Human Genetics, Leiden University Medical Center, Leiden, The Netherlands. 3. Department of Human Genetics, Leiden University Medical Center, Leiden, The Netherlands, Generade Center of Expertise Genomics, University of Applied Sciences Leiden, Leiden, The Netherlands and. 4. Department of Human Genetics, Leiden University Medical Center, Leiden, The Netherlands, Leiden Genome Technology Center, Leiden University Medical Center, Leiden, The Netherlands.
Abstract
MOTIVATION: Unambiguous sequence variant descriptions are important in reporting the outcome of clinical diagnostic DNA tests. The standard nomenclature of the Human Genome Variation Society (HGVS) describes the observed variant sequence relative to a given reference sequence. We propose an efficient algorithm for the extraction of HGVS descriptions from two sequences with three main requirements in mind: minimizing the length of the resulting descriptions, minimizing the computation time and keeping the unambiguous descriptions biologically meaningful. RESULTS: Our algorithm is able to compute the HGVS descriptions of complete chromosomes or other large DNA strings in a reasonable amount of computation time and its resulting descriptions are relatively small. Additional applications include updating of gene variant database contents and reference sequence liftovers. AVAILABILITY: The algorithm is accessible as an experimental service in the Mutalyzer program suite (https://mutalyzer.nl). The C++ source code and Python interface are accessible at: https://github.com/mutalyzer/description-extractor. CONTACT: j.k.vis@lumc.nl.
MOTIVATION: Unambiguous sequence variant descriptions are important in reporting the outcome of clinical diagnostic DNA tests. The standard nomenclature of the Human Genome Variation Society (HGVS) describes the observed variant sequence relative to a given reference sequence. We propose an efficient algorithm for the extraction of HGVS descriptions from two sequences with three main requirements in mind: minimizing the length of the resulting descriptions, minimizing the computation time and keeping the unambiguous descriptions biologically meaningful. RESULTS: Our algorithm is able to compute the HGVS descriptions of complete chromosomes or other large DNA strings in a reasonable amount of computation time and its resulting descriptions are relatively small. Additional applications include updating of gene variant database contents and reference sequence liftovers. AVAILABILITY: The algorithm is accessible as an experimental service in the Mutalyzer program suite (https://mutalyzer.nl). The C++ source code and Python interface are accessible at: https://github.com/mutalyzer/description-extractor. CONTACT: j.k.vis@lumc.nl.
Authors: Katarzyna Klonowska; Joannes M Grevelink; Krinio Giannikou; Barbara A Ogorek; Zachary T Herbert; Aaron R Thorner; Thomas N Darling; Joel Moss; David J Kwiatkowski Journal: J Clin Invest Date: 2022-05-16 Impact factor: 19.456
Authors: Michael D Nyquist; Alexandra Corella; Ilsa Coleman; Navonil De Sarkar; Arja Kaipainen; Gavin Ha; Roman Gulati; Lisa Ang; Payel Chatterjee; Jared Lucas; Colin Pritchard; Gail Risbridger; John Isaacs; Bruce Montgomery; Colm Morrissey; Eva Corey; Peter S Nelson Journal: Cell Rep Date: 2020-05-26 Impact factor: 9.423
Authors: Daniel M Borràs; Rolf H A M Vossen; Michael Liem; Henk P J Buermans; Hans Dauwerse; Dave van Heusden; Ron T Gansevoort; Johan T den Dunnen; Bart Janssen; Dorien J M Peters; Monique Losekoot; Seyed Yahya Anvar Journal: Hum Mutat Date: 2017-05-29 Impact factor: 4.878
Authors: Maaike van der Lee; William G Allard; Sander Bollen; Gijs W E Santen; Claudia A L Ruivenkamp; Mariëtte J V Hoffer; Marjolein Kriek; Henk-Jan Guchelaar; Seyed Y Anvar; Jesse J Swen Journal: Clin Pharmacol Ther Date: 2019-11-21 Impact factor: 6.875
Authors: Lolita G Mathew; Jeyakumar Ponnuraj; Bheemanna Mallappa; Lingutla R Chowdary; Jianwei Zhang; Wee Tek Tay; Thomas K Walsh; Karl H J Gordon; David G Heckel; Sharon Downes; Yves Carrière; Xianchun Li; Bruce E Tabashnik; Jeffrey A Fabrick Journal: Sci Rep Date: 2018-09-10 Impact factor: 4.379