| Literature DB >> 28454514 |
Loris Bertoldi1, Claudio Forcato1, Nicola Vitulo1,2, Giovanni Birolo1, Fabio De Pascale1, Erika Feltrin1, Riccardo Schiavon3, Franca Anglani4, Susanna Negrisolo5, Alessandra Zanetti5, Francesca D'Avanzo5, Rosella Tomanin5, Georgine Faulkner3, Alessandro Vezzi3, Giorgio Valle6,7.
Abstract
BACKGROUND: Whole genome and exome sequencing are contributing to the extraordinary progress in the study of human genetic variants. In this fast developing field, appropriate and easily accessible tools are required to facilitate data analysis.Entities:
Keywords: Data integration; Exome sequencing; Variant annotation; Variant prioritization
Mesh:
Year: 2017 PMID: 28454514 PMCID: PMC5410040 DOI: 10.1186/s12859-017-1654-4
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1The three main steps of QueryOR analysis. Step 1 and step 3 require interaction with the user, whereas step 2, data processing, is automatically performed by the system after uploading VCF files
Fig. 2A typical route for a QueryOR investigation starts with the selection of criteria (Q1); a ranked list of genes and variants is returned in Q2. The selection of a gene, for instance FUCA2, leads to page Q3 where variants and affected transcripts in their genomic context are shown. The black track at the top of Q3 shows the target regions of exome capturing. The blue tracks just below show that the analysis was done on two samples named APN and APM, that share a heterozygous variant (white circles). The yellow boxes report the positions specified in the optionally uploaded BED file, indicating for instance low coverage regions. The bottom part of Q3 shows two alternative transcripts where the same variant in one case is located in an exon, generating a missense substitution (dark blue circle) while in the other case is located in an intron (gray circle). By clicking on a transcript of Q3, the system returns Q4, where several transcript features are directly linked to external resources, as well as to the variant overview page (Q5). For a full list of symbols used in Q3, see Fig. 3. A more detailed description of the entire process is given in the main text
Fig. 3Trio analysis. In the Q3 section, the arrow points to a variant that is heterozygous in both parents and homozygous in the child (full green bar). At the end of the next exon, the child displays a heterozygous variant, shown as a small green bar, which was directly inherited from the father. A detailed description of the variants is given in the Q4 section where the user can also find a link to the IGV viewer, that will be conveniently opened on the appropriate genomic position
Fig. 4Comparison of QueryOR with other platforms for variant prioritization. The platforms were tested using a VCF input file. The indicated number of available criteria is approximate due to different ways of implementation
Fig. 5Usage of the criteria for “shared” and “homozygous” variants in a trio case. Diamonds indicate different hypotheses that can be made. For instance, if we hypothesize a recessive homozygous variant in the child we should set two criteria: 1) shared variants by child and both parents and 2) homozygous variants only in the child. Whereas, for a compound heterozygosity we would expect that the child shares the variants, but we do not know which variant is in which parent; furthermore, the variant should not be homozygous in the parents. Compound heterozygosities are generally difficult to find and criteria based only on sharing and homozygosity would not be selective enough. In this case the “Gene Analysis tool” described in the text could help in the selection of genes carrying different mutations. Sometimes it may be useful to set criteria that may appear useless, like homozygosity on a X chromosome; however this may help to reduce false positives
Fig. 6Searching for de novo mutations in a trio. Q1: to set the criteria the user should select “Shared variants” and click the box beside each patient, selecting green, red or gray respectively for present, absent and ignore, while the sentence underneath will report in plain English the meaning of the settings; more criteria can be set by clicking the + symbol. Q3: the results include haplotype phasing. The yellow bar indicated by the arrow is a possible de novo mutation. For the meaning of other symbols see Fig. 3