| Literature DB >> 27899611 |
Konrad J Karczewski1,2, Ben Weisburd3,2, Brett Thomas3,2, Matthew Solomonson3,2, Douglas M Ruderfer4, David Kavanagh4, Tymor Hamamsy4, Monkol Lek3,2, Kaitlin E Samocha3,2, Beryl B Cummings3,2, Daniel Birnbaum3,2, Mark J Daly3,2, Daniel G MacArthur3,2.
Abstract
Worldwide, hundreds of thousands of humans have had their genomes or exomes sequenced, and access to the resulting data sets can provide valuable information for variant interpretation and understanding gene function. Here, we present a lightweight, flexible browser framework to display large population datasets of genetic variation. We demonstrate its use for exome sequence data from 60 706 individuals in the Exome Aggregation Consortium (ExAC). The ExAC browser provides gene- and transcript-centric displays of variation, a critical view for clinical applications. Additionally, we provide a variant display, which includes population frequency and functional annotation data as well as short read support for the called variant. This browser is open-source, freely available at http://exac.broadinstitute.org, and has already been used extensively by clinical laboratories worldwide.Entities:
Mesh:
Year: 2016 PMID: 27899611 PMCID: PMC5210650 DOI: 10.1093/nar/gkw971
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Gene page. (A) Gene information is summarized, including links to various external resources, as well as constraint information as described in (3). For all exons in the canonical transcript, we display (B) base-level coverage for a number of metrics (mean coverage by default), as well as (C) position and frequency information for all variants, including CNVs. (D) A table of all variants is provided with additional annotation information and links to variant pages.
Figure 2.Variant page. (A) Variant metadata is displayed, including links to dbSNP, UCSC and Clinvar. (B) Users can browse quality metrics based on genotypes (genotype quality and depth) as well as site-level quality metrics from GATK. (C) Annotations for each transcript are provided—if a variant overlaps multiple transcripts with the same functional annotation, a dropdown box provides additional details for the annotations. (D) Allele frequency information is displayed for each continental group. (E) Short read data is provided for more technical users to assess validity of the variant call.
Top genes and variants viewed in the ExAC browser
| Gene/Variant | Associated syndromes | Pageviews |
|---|---|---|
| PCSK9 | (Linked on front page of browser) Hypercholesterolemia | 13 540 |
| BRCA1 | Breast cancer susceptibility | 8251 |
| BRCA2 | Breast cancer susceptibility | 7408 |
| CFTR | Cystic Fibrosis | 5179 |
| FBN1 | Marfan Syndrome | 4886 |
| TP53 | Cancer susceptibility | 3712 |
| TTN | Gene that encodes for the largest protein, cardiomyopathy | 3528 |
| MYH7 | Cardiomyopathy | 3497 |
| MYBPC3 | Cardiomyopathy | 3398 |
| SCN5A | Brugada syndrome, long QT, cardiomyopathy | 3175 |
| rs113993960 | Cystic fibrosis (CFTR deltaF508) | 203 |
| rs1799966 | Breast cancer (BRCA1 missense variant) | 157 |
| rs11571833 | Breast cancer (BRCA2 stop-gained variant) | 120 |