| Literature DB >> 25428374 |
Kate R Rosenbloom1, Joel Armstrong2, Galt P Barber2, Jonathan Casper2, Hiram Clawson2, Mark Diekhans2, Timothy R Dreszer2, Pauline A Fujita2, Luvina Guruvadoo2, Maximilian Haeussler2, Rachel A Harte2, Steve Heitner2, Glenn Hickey2, Angie S Hinrichs2, Robert Hubley3, Donna Karolchik2, Katrina Learned2, Brian T Lee2, Chin H Li2, Karen H Miga2, Ngan Nguyen2, Benedict Paten2, Brian J Raney2, Arian F A Smit3, Matthew L Speir2, Ann S Zweig2, David Haussler4, Robert M Kuhn2, W James Kent2.
Abstract
Launched in 2001 to showcase the draft human genome assembly, the UCSC Genome Browser database (http://genome.ucsc.edu) and associated tools continue to grow, providing a comprehensive resource of genome assemblies and annotations to scientists and students worldwide. Highlights of the past year include the release of a browser for the first new human genome reference assembly in 4 years in December 2013 (GRCh38, UCSC hg38), a watershed comparative genomics annotation (100-species multiple alignment and conservation) and a novel distribution mechanism for the browser (GBiB: Genome Browser in a Box). We created browsers for new species (Chinese hamster, elephant shark, minke whale), 'mined the web' for DNA sequences and expanded the browser display with stacked color graphs and region highlighting. As our user community increasingly adopts the UCSC track hub and assembly hub representations for sharing large-scale genomic annotation data sets and genome sequencing projects, our menu of public data hubs has tripled.Entities:
Mesh:
Year: 2014 PMID: 25428374 PMCID: PMC4383971 DOI: 10.1093/nar/gku1177
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 19.160
New genome browsers
| Common Name | Scientific name | Sequencing Center | Center Date and Assembly ID | UCSC ID |
|---|---|---|---|---|
| Chinese hamster | Beijing Genomics Institute-Shenzen | July 2013 C_griseus_v1.0 | criGri1 | |
| Elephant shark | Institute of Molecular & Cell Biology, Singapore | December 2013 Callorhinchus_milii-6.1.3 | calMil1 | |
| Minke whale | Korea Ocean Research & Development Institute | October 2013 BalAcu1.0 | balAcu1 | |
| Ebola virus | Broad Institute | Jun 2014 West Africa 01 EBOV/G3683/KM034562.1 | eboVir3 | |
| aBald eagle | The Bald Eagle Consortium | August 2014 Haliaeetus_leucocephalus-4.0 | halLeu1 | |
| aChinese pangolin | Washington University | August 2014 m_pentadactyla-1.1.1 | manPen1 | |
| aHooded crow | Uppsala University | August 2014 Hooded_crow_genome | corCor1 | |
| Ebola virus | Broad Institute | Jun 2014 West Africa 01 EBOV/G3683/KM034562.1 | eboVir3 | |
| Alpaca | Broad Institute | March 2013 Vicugna_pacos-2.0.1 | vicPac2 | |
| Hedgehog | Broad Institute | May 2012 EriEur2.0 | eriEur2 | |
| Human | Genome Reference Consortium | December 2013 GRCh38 | hg38 | |
| Pika | Broad Institute | May 2012 OchPri3.0 | ochPri3 | |
| Rat | Rat Genome Sequencing Consortium | July 2014 Rnor_6.0 | rn6 | |
| Sheep | International Sheep Genome Consortium | August 2012 Oar_v3.1 | oviAri3 | |
| Shrew | Broad Institute | August 2008 sorAra2 | sorAra2 | |
| Tenrec | Broad Institute | November 2012 echTel2 | echTel2 | |
| Zebra Finch | Washington University | February 2013 taeGut2 | taeGut2 | |
| bCow | University of Maryland | June 2014 Bos_taurus_UMD_3.1.1 | bosTau8 | |
aIn progress.
bExpected release by January 1, 2015.
New annotation tracks in the Genome Browser
| Annotation track name | Track group | Assembly |
|---|---|---|
| Human genome | ||
| Centromere locations | Mapping | hg38 |
| Chromosome Bands Localized by FISH Mapping Clones | Mapping | hg38 |
| GRC Incident Database | Mapping | hg38 |
| Locus Reference Genome (LRG) Regions | Mapping | hg19, hg38 |
| Gene Annotations from GENCODE Version 19 (Ensembl 76) | Genes | hg19, hg38 |
| Ensembl Genes 74, 75 | Genes | hg19 |
| UCSC Genes | Genes | hg38 |
| ClinVar Variants | Phenotype | hg19 |
| DNA Sequences in Web Pages Indexed by Bing.com/Microsoft Research | Phenotype | hg19 |
| Genetic Association Studies of Complex Diseases and Disorders (update) | Phenotype | hg19 |
| GeneReviews | Phenotype | hg38 |
| OMIM Genes | Phenotype | hg38 |
| Genome Segmentations from ENCODE | Regulation | hg19 |
| Transcription Factor ChIP-seq (161 factors) from ENCODE with Factorbook Motifs | Regulation | hg19 |
| UCSF Brain DNA Methylation | Regulation | hg19 |
| Database of Genomic Variants: Structural Variation (update) | Variation | hg18, hg19 |
| NHLBI GO Exome Sequencing Project (ESP)—Variants from 6503 Exomes | Variation | hg19 |
| Simple Nucleotide Polymorphisms (dbSNP 141) | Variation | hg19 |
| SNP 135/137 (corrections) | Variation | hg19 |
| Alpaca, American alligator, Cat, Chinese hamster, Hedgehog, Pika, Sheep, Shrew, Tenrec, Zebra finch Chain/Net tracks | Comparative | hg19 |
| Multiz Alignment & Conservation (seven species) | Comparative | hg38 |
| Vertebrate Multiz Alignment & Conservation (100 Species) | Comparative | hg19 |
| Multiz Alignment & Conservation (seven species) | Comparative | hg38 |
| phastBias gBGC predictions | Comparative | hg18, hg19 |
| Mouse genome | ||
| INSDC Chromosome Names | Mapping | mm10 |
| MGI Quantitative Trait Loci | Mapping | mm10 |
| Ensembl Genes v74, v75 | Genes | mm10 |
| Ensembl Genes v73 | Genes | mm9 |
| GENCODE Genes vM2 | Genes | mm10 |
| UCSC Genes | Genes | mm10 |
| DNA Sequences in Web Pages Indexed by Bing.com/Microsoft Research | Literature | mm9 |
| FaceBase 24 Sample Types | Expression | mm10 |
| SNP 138 | Variation | mm10 |
| Alpaca, Human, Shrew Chain/Net tracks | Comparative | mm10 |
| Vertebrate Multiz Alignment and Conservation (60 species) | Comparative | mm10 |
| Other genomes | ||
| Accession at International Nucleotide Sequence Database Collaboration (INSDC) | Mapping | 46 species |
| Ensembl Genes v74 | Genes | 17 species |
| Ensembl Genes v75 | Genes | 41 species |
Public track hubs in the Genome Browser
| Hub | Source | Assemblies |
|---|---|---|
| In progress | ||
| Ensembl Regulatory Build | Ensembl | hg19, hg38 |
| FaceBase Hub | FaceBase Consortium | mm9, mm10, hg18, hg19 |
| FANTOM5 CAGE RECLU Analysis | RIKEN Institute | hg18, hg19 |
| Released September 2013–August 2014 | ||
| FANTOM5 CAGE | RIKEN Institute | hg19 |
| Zebrafish Genomics | Shawn Burgess Lab, NHGRI | danRer7 |
| Broad Improved Canine Annotation v1 | Vertebrate Biology Group, Broad Inst | canFam3 |
| Ultraconserved Elements | David Haussler Lab, UCSC | hg19 |
| Roadmap Epigenomics Integrative Analysis Hub | Roadmap Epigenomics Project, WUSTL | hg19 |
| McGill Epigenomics | McGill Epigenomics Mapping Centre | hg19 |
| 454K562 and HelaS3 RNA-seq | Mike Snyder Lab, Stanford | hg19 |
| BCGSC Epigenomics | Centre for Epigenomic Mapping Technologies, Vancouver | hg19 |
| Older | ||
| Blueprint Epigenomics | European Bioinformatics Institute | hg19 |
| Cancer Genome polyA Site & Usage | University of Pittsburgh and Helicos Biosciences | hg19 |
| Brain H3K4me3 ChIP-seq | Weng Lab, UMass Medical School | hg19 |
| Roadmap Epigenomics Data Complete Collection at Wash U VizHub | Roadmap Epigenomics Project, WUSTL | hg19 |
| miRcode microRNA Sites | Larsson Lab, Sahlgrenska Academy, University of Gothenburg, Sweden | hg19 |
| Translation Initiation Sites | Hampe Lab, University Hospital Schleswig-Holstein, Kiel, Germany | hg19 |
| SDSU Exon Array Gene Expression | Bioinformatics group at South Dakota State University | hg19, mm9, rn4 |
| DNA Methylation | Andrew Smith Lab, USC | hg18, hg19, mm9, rheMac3 |
Public assembly hubs in the Genome Browser
| Hub name | Source | Assemblies |
|---|---|---|
| In progress | ||
| Croc and Bird Hub | David Haussler Lab, UCSC | 24 bird, turtle, and crocodilian genomes, plus 22 reconstructed ancestral genomes |
| Released this year | ||
| EcoliCompHub | David Haussler Lab, UCSC | 57 |
| EcoliCompHubWtDups | David Haussler Lab, UCSC | 57 |
| Plants | UCSC Genome Bioinformatics Group | araTha1 ( |
| ThorntonHub | Kevin Thornton lab, UC Irvine | |
| Released last year | ||
| DNA Methylation | Andrew Smith Lab, USC | tair10 ( |
Figure 1.Genome Browser screenshot of the transcription start site of the ZFX gene illustrating the new stacked color graph display. The enrichment of histone marks H3K4me1, H3K4me3 and H3K27ac is shown in three browser tracks, where contribution to the sum of the signal by each of the six different cell types is indicated by color. Note that the K3K4me1 signal is low in all cell types across the promoter and at the beginning of the transcript, but high on either side of the promoter. The other two marks show the opposite tendency, being highest at the promoter and declining with distance from the promoter. In general the signals are strongest in the turquoise cell line (HUVEC).
Figure 2.Genome Browser image of interspersed repeat (IR) annotation on a short region of the human assembly hg38 using the new visualization track. This region contains several layers of nested IRs, created when transposable elements insert into and break up older IR sequences. The solid lines join aligned fragments of an IR and can often reveal interesting features of a locus such as target site duplications. The dashed lines represent the unaligned portions of the transposable element sequence and allow one to visually decipher the relative consensus position of the fragments. When it is not practical to draw the unaligned sequence to scale, the length is displayed between two hash marks. Each annotation box encodes the class of the repeat using the colored edge, the divergence of the repeat with the grayscale interior fill color (lighter levels of gray indicate higher divergence) and the orientation of the repeat using arrowheads.