| Literature DB >> 26590259 |
Matthew L Speir1, Ann S Zweig2, Kate R Rosenbloom2, Brian J Raney2, Benedict Paten2, Parisa Nejad2, Brian T Lee2, Katrina Learned2, Donna Karolchik2, Angie S Hinrichs2, Steve Heitner2, Rachel A Harte3, Maximilian Haeussler2, Luvina Guruvadoo2, Pauline A Fujita4, Christopher Eisenhart2, Mark Diekhans2, Hiram Clawson2, Jonathan Casper2, Galt P Barber2, David Haussler5, Robert M Kuhn2, W James Kent2.
Abstract
For the past 15 years, the UCSC Genome Browser (http://genome.ucsc.edu/) has served the international research community by offering an integrated platform for viewing and analyzing information from a large database of genome assemblies and their associated annotations. The UCSC Genome Browser has been under continuous development since its inception with new data sets and software features added frequently. Some release highlights of this year include new and updated genome browsers for various assemblies, including bonobo and zebrafish; new gene annotation sets; improvements to track and assembly hub support; and a new interactive tool, the "Data Integrator", for intersecting data from multiple tracks. We have greatly expanded the data sets available on the most recent human assembly, hg38/GRCh38, to include updated gene prediction sets from GENCODE, more phenotype- and disease-associated variants from ClinVar and ClinGen, more genomic regulatory data, and a new multiple genome alignment.Entities:
Mesh:
Year: 2015 PMID: 26590259 PMCID: PMC4702902 DOI: 10.1093/nar/gkv1275
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
New and updated genome browsers
| Common name | Scientific name | Sequencing center | UCSC ID | Sequencing center ID |
|---|---|---|---|---|
| Bonobo | Max Planck Institute for Evolutionary Anthropology | panPan1 | Max-Planck Institute panpan1 | |
| Fruit fly | FlyBase Consortium, Berkeley Drosophila Genome Project, Celera Genomics | dm6 | BDGP Release 6 + ISO1 MT | |
| Tarsier | Washington University | tarSyr2 | Tarsius_syrichta-2.0.1 | |
| Zebrafish | Genome Reference Consortium | danRer10 | GRCz10 | |
New and updated annotation tracks in the Genome Browser
| Annotation track name | Track group | Assembly |
|---|---|---|
| Clone Ends | Mapping | hg38 |
| GRC Patch Releases p2, p3 | Mapping | hg38 |
| STS Markers | Mapping | hg38 |
| Augustus Genes | Genes | hg19, hg38 |
| GENCODE Genes V20, V22 | Genes | hg38 |
| IKMC Genes Mapped | Genes | hg38 |
| Retroposed Genes V9 | Genes | hg38 |
| SIB Genes | Genes | hg38 |
| SGP Genes | Genes | hg38 |
| UniProt Protein Annotations | Genes | hg19 |
| UniProt Protein Primary/Secondary Structure Annotations | Genes | hg19 |
| ClinGen CNVs | Phenotype | hg38 |
| ClinVar Variants | Phenotype | hg38 |
| GWAS Catalog | Phenotype | hg38 |
| HGMD Variants (update) | Phenotype | hg19 |
| SIB Alt-splicing | mRNA and EST | hg38 |
| Affy GNF1H | Expression | hg38 |
| Affy U133 | Expression | hg38 |
| Affy U95 | Expression | hg38 |
| GNF Atlas 2 | Expression | hg38 |
| GWIPS-viz Riboseq | Expression | hg19 |
| PeptideAtlas | Expression | hg19 |
| ENCODE DNase Clusters (update) | Regulation | hg19 |
| ENCODE DNase Hypersensitivity | Regulation | hg19 |
| ENCODE Regulation – Transcription | Regulation | hg38 |
| ENCODE Regulation – Layered H3K4Me1 | Regulation | hg38 |
| ENCODE Regulation – Layered H3K4Me3 | Regulation | hg38 |
| ENCODE Regulation – Layered H3K27Ac | Regulation | hg38 |
| ENCODE Regulation – DNase Hypersensitivity Clusters | Regulation | hg38 |
| ENCODE Regulation – DNase Hypersensitivity Signal | Regulation | hg38 |
| ENCODE Regulation – DNase Hypersensitivity | Regulation | hg38 |
| Multiz Alignment and Conservation (20 species) | Comparative | hg38 |
| Dog, Mouse, Rhesus Chain/Net tracks (update) | Comparative | hg38 |
| Bonobo, Cow, Tarsier, Zebrafish Chain/Net tracks | Comparative | hg38 |
| Database of Genomic Variants: Structural Variation (update) | Variation | hg18, hg19, hg38 |
| Exome Aggregation Consortium (ExAC) Variants and Calling Regions | Variation | hg19 |
| Simple Nucleotide Polymorphisms (dbSNP 141/142) | Variation | hg19, hg38 |
| Segmental Duplications | Repeats | hg38 |
| Microsatellites | Repeats | hg16, hg38 |
| GenBank-derived (including RefSeq Genes, CCDS, mRNAs, ESTs, etc.) (update) | Genes, mRNA and EST | hg16, hg17, hg18, hg19, hg38 |
| Augustus Genes | Genes | mm7, mm9, mm10 |
| GENCODE Genes VM3, VM4 | Genes | mm10 |
| Retroposed Genes V6 | Genes | mm10 |
| SGP Genes | Genes | mm10 |
| UCSC Genes | Genes | mm10 |
| Human Chain/Net (update) | Comparative | mm10 |
| Cow Chain/Net track | Comparative | mm10 |
| GenBank-derived (including RefSeq Genes, CCDS, mRNAs, ESTs, etc.) (update) | Genes, mRNA and EST | mm7, mm8, mm9, mm10 |
| Horse QTL | Mapping | equCab2 |
| Augustus Genes | Genes | Many |
| Ensembl Genes 76, 78, 79, 81 | Genes | Many |
| muPIT Protein Map | Genes | eboVir3 |
| SGP Genes | Genes | bosTau8, eriEur2, rn6 |
| Immune Epitope Database (IEDB) HLA Binding Predictions | Immunology | eboVir3 |
| Multiz Alignment and Conservation (20 species) | Comparative | tarSyr2 |
| Multiz Alignment and Conservation (27 species) | Comparative | dm6 |
| New Sequences | Comparative | eboVir3 |
| Microsatellites | Repeats | Many |
| GenBank-derived (including RefSeq Genes, CCDS, mRNAs, ESTs, etc.) (update) | Genes, mRNA and EST | Many |
New track and assembly data hubs
| Hub Name | Source | Assemblies |
|---|---|---|
| CPTAC Hub v1 | Clinical Proteomic Tumor Analysis Consortium (CPTAC) | hg19 |
| LIBD Human DLPFC Development | Lieber Institute for Brain Development (LIBD) | hg19 |
| PhyloCSF | Broad Institute | hg19, hg38, mm10 |
| Porcine DNA methylation and gene transcription | Laboratory of Comparative Genomics, University of Illinois and Animal Breeding and Genomics Centre, Wageningen University | susSCr3 |
| rfam12_ncRNA | Rfam at the Wellcome Trust Sanger Institute | hg38, mm10, galGal4, danRer7, ci2, dm6, ce10, sacCer3 |
| Vista Enhancers | VISTA Enhancer Group, Lawrence Berkeley Laboratories | hg19, mm9, mm10 |
| C_elegans_isolates | Bob Waterston Lab, University of Washington | A C. elegans environmental isolate genome |
Figure 1.Genome Browser screenshot focused on a region of the LHX6 gene that highlights a selection of the new tracks added in the previous year for the hg38/GRCh38 human assembly. The tracks shown in this display (from top to bottom) include GENCODE Genes V22, transcription levels assayed across 9 ENCODE cell lines, DNase hypersensitive regions based on data from 95 ENCODE cell lines, genome-wide conservation scores calculated using phastCons, a multiple genome alignment created using Lastz and Multiz, and pathogenic CNVs from the ClinGen database. This screenshot demonstrates how one can use the ENCODE transcription levels and DNase hypersensitive regions in the Genome Browser to identify expression of different transcripts in different cells. The multiple alignment and conservation scores can be used to look for other potential regions of importance. Lastly, one can use the ClinGen CNVs and other pathogenicity data in the browser to determine what role, if any, a particular gene plays in disease.