| Literature DB >> 31280677 |
Rick A A van der Spek1, Wouter van Rheenen1, Sara L Pulit2, Kevin P Kenna1, Leonard H van den Berg1, Jan H Veldink1.
Abstract
Amyotrophic lateral sclerosis (ALS) is a rapidly progressive fatal neurodegenerative disease affecting one in 350 people. The aim of Project MinE is to elucidate the pathophysiology of ALS through whole-genome sequencing at least 15,000 ALS patients and 7500 controls at 30× coverage. Here, we present the Project MinE data browser ( databrowser.projectmine.com ), a unique and intuitive one-stop, open-access server that provides detailed information on genetic variation analyzed in a new and still growing set of 4366 ALS cases and 1832 matched controls. Through its visual components and interactive design, the browser specifically aims to be a resource to those without a biostatistics background and allow clinicians and preclinical researchers to integrate Project MinE data into their own research. The browser allows users to query a transcript and immediately access a unique combination of detailed (meta)data, annotations and association statistics that would otherwise require analytic expertise and visits to scattered resources.Entities:
Keywords:
Mesh:
Year: 2019 PMID: 31280677 PMCID: PMC7893599 DOI: 10.1080/21678421.2019.1606244
Source DB: PubMed Journal: Amyotroph Lateral Scler Frontotemporal Degener ISSN: 2167-8421 Impact factor: 4.092
Figure 1.Schematic representation of the databrowser. Whole genomes generated by Project MinE are openly available for research and the public. The databrowser does not have a login requirement. It integrates multiple public resources and provides a wide range of robust statistical analyses.
Figure 2.Results are shown for genic (canonical transcripts only) firth logistic regression including variants with a MAF < 1% and categorized as disruptive and damaging. λGC = 0.907, λ1000 = 0.964.
Figure 3.After entering the gene name (HGNC, Ensembl gene (ENSG), or transcript (ENST) identifier) in the search box on the homepage, you will be directed to the gene-specific page. (A) Averaged depth of coverage in the Project MinE dataset, compared to public data and indicating quality of coverage in the region. (B) Firth logistic regression-based genic burden tests. Triangles indicate variant locations. Red triangles reach nominal significance in the single variants association test. Hovering over the triangles to obtain more information about that variant. (C) Firth logistic regression-based geneset burden test. Tests are based on pathways, gene families or druggable gene categories. To elucidate the gene or genes driving a signal in the geneset, a Manhattan plot indicates the genic burden results for each of the genes included in the geneset. Hovering over individual genes will reveal more information about that gene. (D) Gene expression profiles extracted from GTEx. (E) Variant table. By default, a subset of variant information is shown; columns of interest can be selected from the dropdown menu. Minor allele frequency is based on all unrelated and QC passing samples in the Project MinE dataset (6198 genomes). Frequency information is also stratified by phenotypic status and compared to public exome and whole genome data. For comparison, we have indicated the allele frequency on a log scale with orange bars; the longer the bar, the higher the allele frequency. Variant filtering can be customized using the search boxes below the header of each column. All data, including case/control frequencies, are available for download in a tab-delimited file. For a more detailed view of the databrowser, see Supplementary Fig. 11.