| Literature DB >> 25078076 |
Margherita Mutarelli, Veer Marwah, Rossella Rispoli, Diego Carrella, Gopuraja Dharmalingam, Gennaro Oliva, Diego di Bernardo.
Abstract
BACKGROUND: Mendelian disorders are mostly caused by single mutations in the DNA sequence of a gene, leading to a phenotype with pathologic consequences. Whole Exome Sequencing of patients can be a cost-effective alternative to standard genetic screenings to find causative mutations of genetic diseases, especially when the number of cases is limited. Analyzing exome sequencing data requires specific expertise, high computational resources and a reference variant database to identify pathogenic variants.Entities:
Mesh:
Year: 2014 PMID: 25078076 PMCID: PMC4083405 DOI: 10.1186/1471-2164-15-S3-S5
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Website interface.Screenshots of the relevant sections of the web interface and outline of the user experience. The analysis is submitted through the Create New section, where the user submit the required analysis details. The user can choose to submit a single sample or multiple samples of related individuals and an auto-completion feature helps the user in assigning the correct disease id to the analysis. The user can follow the progress of the analysis in the Analysis section where he will also find a link to the Results page after the analysis is completed.
Figure 2Sample analysis report. The analysis report produced by the analysis pipeline includes these fields for each variant called. In addition to the standard predictions of the variant on protein function, the report includes variant frequencies across patients grouped by disease according to the MEDIC hierarchical disease ontology. For a description of each field please refer to the Additional file 4.
Variation Classification
| Variation Class | |||
|---|---|---|---|
| I | + | + | + |
| II | + | + | - |
| III | + | - | + |
| IV | + | - | - |
| V | - | +/- | +/- |
Variants are automatically classified by the pipeline to help the user in detecting causative mutations (also refer to Additional file 4). A "+" sign means that the criterion indicated in the column is satisfied. Frequency criterion: the frequency in 1000 Genomes Project, Exome Variant Server and the TIGEM Variant database (in unrelated disease groups) must be < 1%; Quality criterion: the GATK variant calling tool must assign a "PASS" value to the "filter" field and score > 50 to the Genotype Quality field. Moreover, if the variant is homozygous, the percentage of reads supporting the call must be > 80%. If the variant is heterozygous, the percentage of reads supporting the call must be > 30% and < 80%); Impact criterion: the mutation causes a gain or loss of a stop codon, a gain or loss of a splicing signal, or a frame-shift in the Open Reading Frame. Alternatively, the phylogenetic conservation must be significant (i.e. the fields LJB PhyloP Pred="C" AND LJB Gerp++>2 AND Conserved>170) and 4 out of 5 prediction algorithms indicate a damaging effect (i.e. Avsift, LJB SIFT, LJB PolyPhen2, LJB LRT, LJB Mutation Taster).