| Literature DB >> 34718705 |
Brian T Lee1, Galt P Barber1, Anna Benet-Pagès1,2, Jonathan Casper1, Hiram Clawson1, Mark Diekhans1, Clay Fischer1, Jairo Navarro Gonzalez1, Angie S Hinrichs1, Christopher M Lee1, Pranav Muthuraman1, Luis R Nassar1, Beagan Nguy1, Tiana Pereira1, Gerardo Perez1, Brian J Raney1, Kate R Rosenbloom1, Daniel Schmelter1, Matthew L Speir1, Brittney D Wick1, Ann S Zweig1, David Haussler1, Robert M Kuhn1, Maximilian Haeussler1, W James Kent1.
Abstract
The UCSC Genome Browser, https://genome.ucsc.edu, is a graphical viewer for exploring genome annotations. The website provides integrated tools for visualizing, comparing, analyzing, and sharing both publicly available and user-generated genomic datasets. Data highlights this year include a collection of easily accessible public hub assemblies on new organisms, now featuring BLAT alignment and PCR capabilities, and new and updated clinical tracks (gnomAD, DECIPHER, CADD, REVEL). We introduced a new Track Sets feature and enhanced variant displays to aid in the interpretation of clinical data. We also added a tool to rapidly place new SARS-CoV-2 genomes in a global phylogenetic tree enabling researchers to view the context of emerging mutations in our SARS-CoV-2 Genome Browser. Other new software focuses on usability features, including more informative mouseover displays and new fonts.Entities:
Mesh:
Year: 2022 PMID: 34718705 PMCID: PMC8728131 DOI: 10.1093/nar/gkab959
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 19.160
Figure 1.On the Genome Browser Gateway page, a variety of genome assemblies are available. Organisms can be located using a phylogenetic tree below the search box on the left side of the page or via text search. The gateway search box also includes results for public assembly hubs, including from the new GenArk Genome Archive. Here, a search for ‘dog’ returns an internal Canis lupus familiaris result as well as results for several specific dog breeds in external public assembly hubs with labrador (GCF_014441545.1) selected.
New public hubs, 2020–2021
| Track Hub Name | Provider | Assemblies |
|---|---|---|
| ALFA (Allele Frequency Aggregator) | NCBI, dbGaP | hg19, hg38 |
| Digital genomic footprinting | Altius Institute (Jeff Vierstra) | hg38 |
| ENCODE DNA, RNA, and Integrative cCRE Hubs | UMass Medical School (Zhiping Weng) | hg19, hg38, mm10 |
| Exp/Meth VNTR | Icahn School of Medicine at Mount Sinai (Andrew Sharp) | hg19, hg38 |
| GENCODE Updates | GENCODE | hg38, mm10, mm39 |
| Genome Archive (GenArk) | NCBI/VGP | 1,331 assemblies |
| PhyloCSF++ | Johns Hopkins University (Steven Salzberg and Christopher Pockrandt) and the Seoul National University (Martin Steinegger) | rn6, fr3, gasAcu1, tarSyr2, sacCer3 |
| T-cell ChIP and ATAC seq | Scripps Research (Matthew Pipkin) | mm10 |
| UniBind 2021 | University of Oslo (Rafael Riudavets Puig) | hg38, mm10, ce11, dm6, danRer11, sacCer3, rn6, araTha1 |
Figure 2.New recommended track sets feature on the human hg19 assembly. Selection in the top blue menubar under ‘Genome Browser’ launches a dialog box offering three pre-configured track sets. These collections support clinical scenarios for interpreting variants, where a user can swap from one view to another view without changing position.
Figure 3.The human hg38 ClinVar Interpretations and related ClinVar Short Variants track showing part of the FOXG1 gene. ClinVar Interpretations is the first track to use our bead graph display, which is a variation of our existing lollipop display. The size of the bead on the line represents the number of submissions at that genomic position. The color of the beads further aids to distinguish the categories (red = pathogenic or likely pathogenic, blue = variant of unknown significance, green = benign or likely benign). The image also demonstrates another improved feature in the ClinVar Short Variants track: expanded support for complex mouseover text on individual items, also available for track hubs.
Figure 4.Sequences uploaded to UShER are highlighted in a blue horizontal background alongside their most closely related public sequences, at the top of the tree in this image. This allows researchers to investigate relationships among samples, and evaluate their mutations using information from other tracks such as the antibody escape track, second from the bottom. The figure displays two spike protein mutations that overlap with items in the antibody escape track where mutations are colored red for non-synonymous changes.
Figure 5.The mouseover display in a blue box (∼0.65342) gives instant access to data values contained in a signal track. The tilde indicates that multiple values are combined at the location due to data compression at this visibility level. Here the value indicates H3K4me1 scores from human fetal lung fibroblasts, from data in the Roadmap Epigenomics Integrative Analysis public hub, one of the most accessed public hubs. This feature is now available on all signal tracks.
Figure 6.Comparison of the display quality between the new fonts (vector-based with anti-aliasing) on the left showing a stop codon for the PITX2 gene and the old (bitmap-based) fonts on the right showing the same location.
Figure 7.The new location of the multi-region button adjacent to the position text box (red rectangle). The button is highlighted when multi-region is activated, and the configure screen now includes a prominent help link at the top. The image in the background shows data from the rare harmful variants track associated with severe COVID outcomes, displaying eight genes across five different chromosomes, highlighted in alternating white or blue backgrounds.