| Literature DB >> 25352552 |
Fiona Cunningham1, M Ridwan Amode1, Daniel Barrell2, Kathryn Beal1, Konstantinos Billis1, Simon Brent3, Denise Carvalho-Silva1, Peter Clapham3, Guy Coates3, Stephen Fitzgerald1, Laurent Gil1, Carlos García Girón1, Leo Gordon1, Thibaut Hourlier1, Sarah E Hunt1, Sophie H Janacek1, Nathan Johnson1, Thomas Juettemann1, Andreas K Kähäri3, Stephen Keenan1, Fergal J Martin1, Thomas Maurel1, William McLaren1, Daniel N Murphy2, Rishi Nag1, Bert Overduin1, Anne Parker1, Mateus Patricio1, Emily Perry1, Miguel Pignatelli1, Harpreet Singh Riat1, Daniel Sheppard1, Kieron Taylor1, Anja Thormann1, Alessandro Vullo1, Steven P Wilder1, Amonida Zadissa1, Bronwen L Aken1, Ewan Birney1, Jennifer Harrow3, Rhoda Kinsella1, Matthieu Muffato1, Magali Ruffier1, Stephen M J Searle3, Giulietta Spudich1, Stephen J Trevanion1, Andy Yates1, Daniel R Zerbino1, Paul Flicek4.
Abstract
Ensembl (http://www.ensembl.org) is a genomic interpretation system providing the most up-to-date annotations, querying tools and access methods for chordates and key model organisms. This year we released updated annotation (gene models, comparative genomics, regulatory regions and variation) on the new human assembly, GRCh38, although we continue to support researchers using the GRCh37.p13 assembly through a dedicated site (http://grch37.ensembl.org). Our Regulatory Build has been revamped to identify regulatory regions of interest and to efficiently highlight their activity across disparate epigenetic data sets. A number of new interfaces allow users to perform large-scale comparisons of their data against our annotations. The REST server (http://rest.ensembl.org), which allows programs written in any language to query our databases, has moved to a full service alongside our upgraded website tools. Our online Variant Effect Predictor tool has been updated to process more variants and calculate summary statistics. Lastly, the WiggleTools package enables users to summarize large collections of data sets and view them as single tracks in Ensembl. The Ensembl code base itself is more accessible: it is now hosted on our GitHub organization page (https://github.com/Ensembl) under an Apache 2.0 open source license.Entities:
Mesh:
Year: 2014 PMID: 25352552 PMCID: PMC4383879 DOI: 10.1093/nar/gku1010
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Ensembl ‘Region in Detail’ view showing the improved annotation of SLC38A3 in GRCh38, regulatory region information, default variation track and Age of Base track. (a) The GRCh38 assembly has resulted in an improved annotation of many genes compared to GRCh37.p13. Here we show the SLC38A3 gene as an example, where updates to the genome sequence now allow an open reading frame to be annotated. SLC38A3 is viewable in the GENCODE Basic track which shows only selected transcripts per gene. (b) The following regulatory tracks are shown from our new Ensembl Regulatory Build: MultiCell regulatory features (regions that are assigned a function that is independent of the cell type); cell type-specific regulatory data (for 4 selected cell types, out of 18 available - a regulatory feature is shaded in grey if it is inactive in the corresponding cell type, below each cell type is the cell type-specific segmentation track); H1ESC cell line Whole Genome Bisulphite Sequencing; Fantom5 (enhancers and promoters defined by the FANTOM5 project). (c) A track for variants genotyped by the 1000 Genomes project (phase 1) with frequency of at least 1% across any population is now on by default. The variants are colour coded by most severe consequence type (see Variation Legend in the lower part of the image). (d) Age of Base track: Each base pair that differs by a substitution in the human genome is classified as an event according to when it occurred: before the primate evolutionary branch (grey), in the primate-specific branch (blue), in the human-specific branch and is now fixed (red) or in the human-specific branch as a segregating variant (yellow).
Figure 2.Improved VEP web tool and interface: The output is summarized using pie charts and statistics. Additionally, the results preview can be filtered on any output column. The output data can be filtered and downloaded for further analysis.