| Literature DB >> 24270787 |
Donna Karolchik1, Galt P Barber, Jonathan Casper, Hiram Clawson, Melissa S Cline, Mark Diekhans, Timothy R Dreszer, Pauline A Fujita, Luvina Guruvadoo, Maximilian Haeussler, Rachel A Harte, Steve Heitner, Angie S Hinrichs, Katrina Learned, Brian T Lee, Chin H Li, Brian J Raney, Brooke Rhead, Kate R Rosenbloom, Cricket A Sloan, Matthew L Speir, Ann S Zweig, David Haussler, Robert M Kuhn, W James Kent.
Abstract
The University of California Santa Cruz (UCSC) Genome Browser (http://genome.ucsc.edu) offers online public access to a growing database of genomic sequence and annotations for a large collection of organisms, primarily vertebrates, with an emphasis on the human and mouse genomes. The Browser's web-based tools provide an integrated environment for visualizing, comparing, analysing and sharing both publicly available and user-generated genomic data sets. As of September 2013, the database contained genomic sequence and a basic set of annotation 'tracks' for ∼90 organisms. Significant new annotations include a 60-species multiple alignment conservation track on the mouse, updated UCSC Genes tracks for human and mouse, and several new sets of variation and ENCODE data. New software tools include a Variant Annotation Integrator that returns predicted functional effects of a set of variants uploaded as a custom track, an extension to UCSC Genes that displays haplotype alleles for protein-coding genes and an expansion of data hubs that includes the capability to display remotely hosted user-provided assembly sequence in addition to annotation data. To improve European access, we have added a Genome Browser mirror (http://genome-euro.ucsc.edu) hosted at Bielefeld University in Germany.Entities:
Mesh:
Year: 2013 PMID: 24270787 PMCID: PMC3964947 DOI: 10.1093/nar/gkt1168
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
New and updated genome assemblies added to the Genome Browser since September 2012
| Common name | Scientific name | Sequencing center | UCSC ID | Seq. ctr ID |
|---|---|---|---|---|
| Primates | ||||
| Baboon | Baylor College of Medicine HGSC | papHam1 | Pham_1.0 | |
| Baboon | Baylor College of Medicine HGSC | papAnu2 | Panu_2.0 | |
| Bushbaby | Broad Institute | otoGar3 | OtoGar3 | |
| Chimpanzee | Chimpanzee Sequencing and Analysis Consortium | panTro4 | Build 2.1.4 | |
| Gibbon | Gibbon Genome Sequencing Consortium | nomLeu2 | Nleu1.1 | |
| nomLeu3 | Nleu3.0 | |||
| Mouse lemur | Broad Institute | micMur1 | MicMur1.0 | |
| Rhesus macaque | Beijing Genomics Institute | rheMac3 | CR_1.0 | |
| Squirrel monkey | Broad Institute | saiBol1 | SaiBol1.0 | |
| Tarsier | Broad Institute | tarSyr1 | Tarsyr1.0 | |
| Other mammals | ||||
| Alpaca | Broad Institute | vicPac1 | VicPac1.0 | |
| vicPac2 | VicPac2.0 | |||
| Armadillo | Baylor College of Medicine HGSC | dasNov3 | DasNov3 | |
| Cat | International Cat Genome Sequencing Consortium | felCat5 | Felis_catus-6.2 | |
| Dolphin | Baylor College of Medicine HGSC | turTru2 | Ttru_1.4 | |
| Ferret | Ferret Genome Sequencing Consortium | musFur1 | MusPutFur1.0 | |
| Hedgehog | Broad Institute | eriEur1 | Draft_v1 | |
| Kangaroo rat | Baylor College of Medicine HGSC, Broad Institute | dipOrd1 | DipOrd1.0 | |
| Manatee | Broad Institute | triMan1 | TriManLat1.0 | |
| Megabat | Broad Institute | pteVam1 | PteVap1.0 | |
| Naked mole rat | Broad Institute | hetGla2 | HetGla_female_1.0 | |
| Pig | Swine Genome Sequencing Consortium | susScr3 | Sscrofa10.2 | |
| Pika | Broad Institute | ochPri2 | OchPri2 | |
| Rock hyrax | Baylor College of Medicine HGSC | proCap1 | Procap1.0 | |
| Shrew | Broad Institute | sorAra1 | SorAra1.0 | |
| Sloth | Broad Institute | choHof1 | ChoHof1.0 | |
| Southern white rhinoceros | Broad Institute | cerSim1 | cerSimSim1.0 | |
| Squirrel | Broad Institute | speTri2 | SpeTri2.0 | |
| Tree shrew | Broad Institute | tupBel1 | Tupbel1.0 | |
| Other vertebrates | ||||
| American alligator | Int’l Crocodilian Genomes Working Group | allMis1 | allMis0.2 | |
| Atlantic cod | Genofisk | gadMor1 | GadMor_May2010 | |
| Budgerigar | Genome Institute at Wash. Univ. St. Louis | melUnd1 | v6.3 | |
| Coelacanth | Broad Institute | latCha1 | LatCha1 | |
| Lamprey | Genome Institute at Wash. Univ. St. Louis | petMar2 | WUGSC 7.0 | |
| Nile tilapia | Broad Institute | oreNil2 | OreNil1.1 | |
The ‘UCSC ID’ column shows the Genome Browser database designation for the genome assembly.
New and updated annotation data sets added to the Genome Browser between September 2012 and September 2013
| Annotation track | Assembly |
|---|---|
| Human genome | |
| 1000 Genomes Phase 1 Integrated Variant Calls | hg19 |
| 1000 Genomes Phase 1 Paired-end Accessible Regions | hg19 |
| Affymetrix CytoScan HD Array | hg19 |
| Coriell Cell Line Copy Number Variants | hg19 |
| Denisova: Modern Human Derived, Sequence Reads, Variant Calls, Variant Calls from 11 Modern Human Genome Sequences | hg19 |
| DGV: Structural Variation | hg18-19 |
| DNaseI Hypersensitivity Uniform Peaks— ENCODE/Analysis | hg19 |
| ENCODE Regulation: DNaseI HS Clusters, Transaction Factor ChiP-seq Clusters | hg19 |
| GENCODE Genes v14, v17 | hg19 |
| GeneReviews | hg18-19 |
| GRCh37 Patch 10 | hg19 |
| GWAS Catalog of Published Genome-Wide Association Studies | hg18-19 |
| Human Gene Mutation Database (HGMD) | hg19 |
| Leiden Open Variation Database (LOVD) | hg19 |
| Pfam domains in UCSC Genes | hg19 |
| Proteogenomics and GENCODE Mapping—ENCODE | hg19 |
| qPCR Primers | hg19 |
| Reactome v41 | hg17-19 |
| Retroposed Genes | hg19 |
| SNPs (Build 137): All SNPs, Common SNPs, Flagged SNPs, Mult. SNPs | hg19 |
| SNPs (Build 138): All SNPs, Common SNPs, Flagged SNPs, Mult. SNPs | hg19 |
| Transcription Factor ChIP-seq Uniform Peaks— ENCODE/Analysis | hg19 |
| UCSC Genes | hg19 |
| UniProt Mutations | hg19 |
| Mouse genome | |
| 60-species Conservation | mm10 |
| GRC Incident Database | mm10 |
| GRCm38 Patch Release 1 | mm10 |
| Mouse strain variants | mm10 |
| qPCR Primers | mm10 |
| Reactome v.41 | mm8-9 |
| SNPs (Build 137) | mm10 |
| UCSC Genes | mm10 |
| Cow genome | |
| NumtS Nuclear Mitochondrial Sequences | bosTau6 |
| Pig genome | |
| NumtS Nuclear Mitochondrial Sequences | susScr2 |
| Multiple genomes | |
| Ensembl Genes | Many |
| Human proteins | Many |
| Publications track | Many |
Figure 1.The haplotype alleles display for the ABO gene, which encodes proteins related to the ABO blood group system. A large portion of the ‘Predicted full sequence’ section is truncated in the upper image for display purposes, and is shown in greater detail in the lower image. The leftmost columns of the top image indicate the frequency of each allele haplotype within the 1000 Genomes sample and the occurrence of homozygosity for each allele. In this instance the haplotype alleles display has been expanded to show the distribution of the haplotypes across the major 1000 Genomes population groups. The ‘Variant Sites’ columns summarize the non-synonymous variant sites that occur in at least 1% of the subject chromosomes, with the value from the reference genome (in this case GRCh37/hg19) indicated at the top of each variant column. In all but one case, the ‘O’ phenotype results from a common insertion (indicated by ‘-’ in the reference) causing a frameshift (indicated by ‘[≫]’) that results in a downstream premature stop codon, thus truncating the protein. Note that although certain haplotyes are more frequently found within one population, the insertion that gives rise to the majority of ‘O’ phenotypes is found across all populations, which may indicate that the insertion predates the most recent migration out of Africa. On the other hand, the haplotype in which the SNP variant introduces a stop codon at the variant site may have arisen in the Americas. The zoomed-in view of the ‘Predicted full sequence’ section in the bottom image shows the reference sequence (top row) and sequences incorporating the common non-synonymous variants. The residues corresponding to the variant sites are highlighted by green vertical bars, the site corresponding to the frameshift-causing insertion is highlighted by a blue bar and changes to the reference amino acid sequence are shown in red.