| Literature DB >> 30700791 |
Fangning Gu1, Anchi Wu1, M Grace Gordon1, Lukas Vlahos1, Shane Macnamara1, Elizabeth Burke1, May C Malicdan1, David R Adams1, Cynthia J Tifft1, Camilo Toro1, William A Gahl1, Thomas C Markello2.
Abstract
PURPOSE: Develop an automated exome analysis workflow that can produce a very small number of candidate variants yet still detect different numbers of deleterious variants between probands and unaffected siblings.Entities:
Keywords: Undiagnosed Diseases Network; agnostic exome analysis; diagnosis; exome; rare diseases
Mesh:
Year: 2019 PMID: 30700791 PMCID: PMC6669106 DOI: 10.1038/s41436-019-0434-0
Source DB: PubMed Journal: Genet Med ISSN: 1098-3600 Impact factor: 8.822
Figure 1:Workflow overview of the agnostic exome analysis pipeline.
Figure 2.VMM scoring function.
The x- and y-axes represent the two CADD (Phred) scores and the z-axis is the VMM score. The longer wavelength colors denote regions more likely to be truly deleterious for the pair of variants.
Figure 3:Difference of deleterious variants (DVs) between probands (N=97) and matched siblings (N=113) at a range of deleterious score thresholds.
Deleterious score thresholds represent the exonic CADD thresholds used for each single hypothesis testing. Bars show the average number of DVs per individual, calculated from the combined total compound heterozygous, de novo, homozygous recessive, hemizygous, X-linked, and extreme novel exon deletion variants for each individual. Values above the bars are significance values from a single-tailed t-test with heteroskedastic distributions for the null hypothesis that there is no difference in DV counts between a group of probands and their unaffected siblings. Asterisks mark deleterious score thresholds at which the difference between groups had a p-value less than 0.05.
Figure 4:Comparison of deleterious variants (DVs) between probands (N=97) and matched siblings (N=113) at each deleterious score threshold.
A. Bars show the average number of DVs per individual, calculated from the combined total compound heterozygous, de novo, homozygous recessive, hemizygous, X-linked, and extreme novel exon deletion variants for each individual at CADD20. * = p<0.05 for a single-tailed t-test with heteroskedastic distributions for the null hypothesis that there is no difference in DV count between a group of probands and a group of their unaffected siblings. B. Histogram of proband DVs at an exonic CADD threshold of 20. C. Histogram of sibling DVs at an exonic CADD threshold of 20. D. Comparison of distribution between probands and their unaffected siblings of average DV counts within six Mendelian states at an exonic CADD threshold of 20, as filtered from the variant exclusion pipeline. : 4.6 DVs and 4.2 DVs (for probands and unaffected sibs, respectively); : 1.0, 0.7; : 0.2, 0.2; : 0.1, 0.1; = 1.4, 1.0; : 0.0, 0.1.