Literature DB >> 31738401

SPDI: data model for variants and applications at NCBI.

J Bradley Holmes1, Eric Moyer1, Lon Phan1, Donna Maglott1, Brandi Kattman1.   

Abstract

MOTIVATION: Normalizing sequence variants on a reference, projecting them across congruent sequences and aggregating their diverse representations are critical to the elucidation of the genetic basis of disease and biological function. Inconsistent representation of variants among variant callers, local databases and tools result in discrepancies that complicate analysis. NCBI's genetic variation resources, dbSNP and ClinVar, require a robust, scalable set of principles to manage asserted sequence variants.
RESULTS: The SPDI data model defines variants as a sequence of four attributes: sequence, position, deletion and insertion, and can be applied to nucleotide and protein variants. NCBI web services convert representations among HGVS, VCF and SPDI and provide two functions to aggregate variants. One, based on the NCBI Variant Overprecision Correction Algorithm, returns a unique, normalized representation termed the 'Contextual Allele'. The SPDI data model, with its four operations, defines exactly the reference subsequence affected by the variant, even in repeat regions, such as homopolymer and other sequence repeats. The second function projects variants across congruent sequences and depends on an alignment dataset of non-assembly NCBI RefSeq sequences (prefixed NM, NR and NG), as well as inter- and intra-assembly-associated genomic sequences (NCs, NTs and NWs), supporting robust projection of variants across congruent sequences and assembly versions. The variant is projected to all congruent Contextual Alleles. One of these Contextual Alleles, typically the allele based on the latest assembly version, represents the entire set, is designated the unique 'Canonical Allele' and is used directly to aggregate variants across congruent sequences.
AVAILABILITY AND IMPLEMENTATION: The SPDI services are available for open access at: https://api.ncbi.nlm.nih.gov/variation/v0. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Published by Oxford University Press 2019. This work is written by US Government employees and is in the public domain in the US.

Year:  2020        PMID: 31738401     DOI: 10.1093/bioinformatics/btz856

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  10 in total

1.  ClinVar: improvements to accessing data.

Authors:  Melissa J Landrum; Shanmuga Chitipiralla; Garth R Brown; Chao Chen; Baoshan Gu; Jennifer Hart; Douglas Hoffman; Wonhee Jang; Kuljeet Kaur; Chunlei Liu; Vitaly Lyoshin; Zenith Maddipatla; Rama Maiti; Joseph Mitchell; Nuala O'Leary; George R Riley; Wenyao Shi; George Zhou; Valerie Schneider; Donna Maglott; J Bradley Holmes; Brandi L Kattman
Journal:  Nucleic Acids Res       Date:  2020-01-08       Impact factor: 16.971

2.  Database resources of the National Center for Biotechnology Information.

Authors:  Eric W Sayers; Jeff Beck; J Rodney Brister; Evan E Bolton; Kathi Canese; Donald C Comeau; Kathryn Funk; Anne Ketter; Sunghwan Kim; Avi Kimchi; Paul A Kitts; Anatoliy Kuznetsov; Stacy Lathrop; Zhiyong Lu; Kelly McGarvey; Thomas L Madden; Terence D Murphy; Nuala O'Leary; Lon Phan; Valerie A Schneider; Françoise Thibaud-Nissen; Bart W Trawick; Kim D Pruitt; James Ostell
Journal:  Nucleic Acids Res       Date:  2020-01-08       Impact factor: 16.971

3.  Database resources of the national center for biotechnology information.

Authors:  Eric W Sayers; Evan E Bolton; J Rodney Brister; Kathi Canese; Jessica Chan; Donald C Comeau; Ryan Connor; Kathryn Funk; Chris Kelly; Sunghwan Kim; Tom Madej; Aron Marchler-Bauer; Christopher Lanczycki; Stacy Lathrop; Zhiyong Lu; Francoise Thibaud-Nissen; Terence Murphy; Lon Phan; Yuri Skripchenko; Tony Tse; Jiyao Wang; Rebecca Williams; Barton W Trawick; Kim D Pruitt; Stephen T Sherry
Journal:  Nucleic Acids Res       Date:  2022-01-07       Impact factor: 19.160

Review 4.  Progress, Challenges, and Surprises in Annotating the Human Genome.

Authors:  Daniel R Zerbino; Adam Frankish; Paul Flicek
Journal:  Annu Rev Genomics Hum Genet       Date:  2020-05-18       Impact factor: 8.929

5.  Ensembl 2020.

Authors:  Andrew D Yates; Premanand Achuthan; Wasiu Akanni; James Allen; Jamie Allen; Jorge Alvarez-Jarreta; M Ridwan Amode; Irina M Armean; Andrey G Azov; Ruth Bennett; Jyothish Bhai; Konstantinos Billis; Sanjay Boddu; José Carlos Marugán; Carla Cummins; Claire Davidson; Kamalkumar Dodiya; Reham Fatima; Astrid Gall; Carlos Garcia Giron; Laurent Gil; Tiago Grego; Leanne Haggerty; Erin Haskell; Thibaut Hourlier; Osagie G Izuogu; Sophie H Janacek; Thomas Juettemann; Mike Kay; Ilias Lavidas; Tuan Le; Diana Lemos; Jose Gonzalez Martinez; Thomas Maurel; Mark McDowall; Aoife McMahon; Shamika Mohanan; Benjamin Moore; Michael Nuhn; Denye N Oheh; Anne Parker; Andrew Parton; Mateus Patricio; Manoj Pandian Sakthivel; Ahamed Imran Abdul Salam; Bianca M Schmitt; Helen Schuilenburg; Dan Sheppard; Mira Sycheva; Marek Szuba; Kieron Taylor; Anja Thormann; Glen Threadgold; Alessandro Vullo; Brandon Walts; Andrea Winterbottom; Amonida Zadissa; Marc Chakiachvili; Bethany Flint; Adam Frankish; Sarah E Hunt; Garth IIsley; Myrto Kostadima; Nick Langridge; Jane E Loveland; Fergal J Martin; Joannella Morales; Jonathan M Mudge; Matthieu Muffato; Emily Perry; Magali Ruffier; Stephen J Trevanion; Fiona Cunningham; Kevin L Howe; Daniel R Zerbino; Paul Flicek
Journal:  Nucleic Acids Res       Date:  2020-01-08       Impact factor: 16.971

6.  Ensembl 2022.

Authors:  Fiona Cunningham; James E Allen; Jamie Allen; Jorge Alvarez-Jarreta; M Ridwan Amode; Irina M Armean; Olanrewaju Austine-Orimoloye; Andrey G Azov; If Barnes; Ruth Bennett; Andrew Berry; Jyothish Bhai; Alexandra Bignell; Konstantinos Billis; Sanjay Boddu; Lucy Brooks; Mehrnaz Charkhchi; Carla Cummins; Luca Da Rin Fioretto; Claire Davidson; Kamalkumar Dodiya; Sarah Donaldson; Bilal El Houdaigui; Tamara El Naboulsi; Reham Fatima; Carlos Garcia Giron; Thiago Genez; Jose Gonzalez Martinez; Cristina Guijarro-Clarke; Arthur Gymer; Matthew Hardy; Zoe Hollis; Thibaut Hourlier; Toby Hunt; Thomas Juettemann; Vinay Kaikala; Mike Kay; Ilias Lavidas; Tuan Le; Diana Lemos; José Carlos Marugán; Shamika Mohanan; Aleena Mushtaq; Marc Naven; Denye N Ogeh; Anne Parker; Andrew Parton; Malcolm Perry; Ivana Piližota; Irina Prosovetskaia; Manoj Pandian Sakthivel; Ahamed Imran Abdul Salam; Bianca M Schmitt; Helen Schuilenburg; Dan Sheppard; José G Pérez-Silva; William Stark; Emily Steed; Kyösti Sutinen; Ranjit Sukumaran; Dulika Sumathipala; Marie-Marthe Suner; Michal Szpak; Anja Thormann; Francesca Floriana Tricomi; David Urbina-Gómez; Andres Veidenberg; Thomas A Walsh; Brandon Walts; Natalie Willhoft; Andrea Winterbottom; Elizabeth Wass; Marc Chakiachvili; Bethany Flint; Adam Frankish; Stefano Giorgetti; Leanne Haggerty; Sarah E Hunt; Garth R IIsley; Jane E Loveland; Fergal J Martin; Benjamin Moore; Jonathan M Mudge; Matthieu Muffato; Emily Perry; Magali Ruffier; John Tate; David Thybert; Stephen J Trevanion; Sarah Dyer; Peter W Harrison; Kevin L Howe; Andrew D Yates; Daniel R Zerbino; Paul Flicek
Journal:  Nucleic Acids Res       Date:  2022-01-07       Impact factor: 16.971

7.  Deletion of Sphingosine 1-Phosphate receptor 1 in cardiomyocytes during development leads to abnormal ventricular conduction and fibrosis.

Authors:  Ryan Jorgensen; Meghna Katta; Jayne Wolfe; Desiree F Leach; Bianca Lavelle; Jerold Chun; Lisa D Wilsbacher
Journal:  Physiol Rep       Date:  2021-10

8.  Mutation-specific reporter for optimization and enrichment of prime editing.

Authors:  I F Schene; I P Joore; J H L Baijens; R Stevelink; G Kok; S Shehata; E F Ilcken; E C M Nieuwenhuis; D P Bolhuis; R C M van Rees; S A Spelier; H P J van der Doef; J M Beekman; R H J Houwen; E E S Nieuwenhuis; S A Fuchs
Journal:  Nat Commun       Date:  2022-03-01       Impact factor: 17.694

9.  The GA4GH Variation Representation Specification: A computational framework for variation representation and federated identification.

Authors:  Alex H Wagner; Lawrence Babb; Gil Alterovitz; Michael Baudis; Matthew Brush; Daniel L Cameron; Melissa Cline; Malachi Griffith; Obi L Griffith; Sarah E Hunt; David Kreda; Jennifer M Lee; Stephanie Li; Javier Lopez; Eric Moyer; Tristan Nelson; Ronak Y Patel; Kevin Riehle; Peter N Robinson; Shawn Rynearson; Helen Schuilenburg; Kirill Tsukanov; Brian Walsh; Melissa Konopko; Heidi L Rehm; Andrew D Yates; Robert R Freimuth; Reece K Hart
Journal:  Cell Genom       Date:  2021-11-10

10.  Eukaryotic tRNA sequences present conserved and amino acid-specific structural signatures.

Authors:  Eric Westhof; Bryan Thornlow; Patricia P Chan; Todd M Lowe
Journal:  Nucleic Acids Res       Date:  2022-04-22       Impact factor: 19.160

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.