| Literature DB >> 24285302 |
Jacqueline A L MacArthur1, Joannella Morales, Ray E Tully, Alex Astashyn, Laurent Gil, Elspeth A Bruford, Pontus Larsson, Paul Flicek, Raymond Dalgleish, Donna R Maglott, Fiona Cunningham.
Abstract
Locus Reference Genomic (LRG; http://www.lrg-sequence.org/) records contain internationally recognized stable reference sequences designed specifically for reporting clinically relevant sequence variants. Each LRG is contained within a single file consisting of a stable 'fixed' section and a regularly updated 'updatable' section. The fixed section contains stable genomic DNA sequence for a genomic region, essential transcripts and proteins for variant reporting and an exon numbering system. The updatable section contains mapping information, annotation of all transcripts and overlapping genes in the region and legacy exon and amino acid numbering systems. LRGs provide a stable framework that is vital for reporting variants, according to Human Genome Variation Society (HGVS) conventions, in genomic DNA, transcript or protein coordinates. To enable translation of information between LRG and genomic coordinates, LRGs include mapping to the human genome assembly. LRGs are compiled and maintained by the National Center for Biotechnology Information (NCBI) and European Bioinformatics Institute (EBI). LRG reference sequences are selected in collaboration with the diagnostic and research communities, locus-specific database curators and mutation consortia. Currently >700 LRGs have been created, of which >400 are publicly available. The aim is to create an LRG for every locus with clinical implications.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24285302 PMCID: PMC3965024 DOI: 10.1093/nar/gkt1198
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.LRG creation process. Steps involved in the creation of each LRG, from initial request to the LRG being made public. All steps are carried out jointly by EBI and NCBI in close collaboration with requesters and collaborators.
Figure 2.Content and structure of LRG file. The information in each LRG is divided into two sections, fixed and updatable, each with specific content.
Figure 3.LRG-specific exon numbering scheme. Each LRG contains a fixed LRG-specific exon numbering scheme. Exons of all transcripts included in the fixed section are numbered sequentially 5′–3′ according to their genomic location. This numbering is then applied to individual transcripts. If a transcript is added to the LRG after it has been made publicly available, the existing exon numbering will not change, with any additional exons being assigned numbers in collaboration with the community.