| Literature DB >> 33748785 |
Nicholas S Diab1, Spencer King2,3, Weilai Dong1,4, Garrett Allington1, Amar Sheth5, Samuel T Peters2, Kristopher T Kahle5,6,7, Sheng Chih Jin2.
Abstract
Here, we present a protocol to analyze de novo genetic variants derived from the whole-exome sequencing (WES) of proband-parent trios. We provide stepwise instructions for using existing pipelines to call de novo mutations (DNMs) and determine whether the observed number of such mutations is enriched relative to the expected number. This protocol may be extended to any human disease trio-based cohort. Cohort size is a limiting determinant to the discovery of high-confidence pathogenic DNMs. For complete details on the use and execution of this protocol, please refer to Dong et al. (2020).Entities:
Keywords: Bioinformatics; Genetics; Genomics; High-throughput screening; Sequence analysis; Sequencing
Mesh:
Year: 2021 PMID: 33748785 PMCID: PMC7960548 DOI: 10.1016/j.xpro.2021.100383
Source DB: PubMed Journal: STAR Protoc ISSN: 2666-1667
De novo enrichment analysis among cases
| Class | Observed | rate_obs_cases | Expected | rate_exp_cases | Enrichment | p value |
|---|---|---|---|---|---|---|
| all | 74 | 1.057142857 | 79.5 | 1.135714286 | 0.931 | 0.747 |
| syn | 22 | 0.314285714 | 22.5 | 0.321428571 | 0.977 | 0.571 |
| misT | 27 | 0.385714286 | 37.4 | 0.534799393 | 0.721 | 0.968 |
| misD | 17 | 0.242857143 | 12.7 | 0.181428571 | 1.34 | 0.141 |
| lof | 8 | 0.114285714 | 6.9 | 0.098571429 | 1.16 | 0.387 |
| protD | 25 | 0.357142857 | 19.6 | 0.28 | 1.28 | 0.134 |
| prot | 52 | 0.742857143 | 57 | 0.814285714 | 0.912 | 0.764 |
Results of the enrichment analysis for DNMs across all genes in cases. Corresponds to supplemental table 2b in Dong et al. (2020).
Filtering criteria for putative DNMs
| Cases | Controls |
|---|---|
Minor allele frequency ≤ 4 × 10−4 across all samples in 1000 Genomes, EVS, and ExAC, Bravo | in-cohort allele frequency ≤ 4 × 10−4 |
probands must have at least 10 total reads and 5 alternative allele reads probands must have a minimum of 20% alternate allele ratio for alternate allele reads ≥10 or a minimum 28% alternate ratio for alternate allele reads <10 parents must have a minimum depth of 10 reference reads and alternate allele ratio <3.5% exonic or canonical splice-site variants | |
Example filtering criteria to analyze DNMs in cases and controls. The criteria are the same as those used by Dong et al. (2020).
Figure 1Example IGV plots for two DNM calls
(A) A true-positive DNM in the gene GABRG1.
(B) A false-positive DNM in the gene HTT. Color scheme: A is green, C is blue, G is brown, and T is red.
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| Published paper and data | dbGap accession number: phs000744 | |
| 1,000 genomes | ||
| ExAC database | ||
| Exome variant server (EVS) | Exome Variant Server, NHLBI GO Exome Sequencing Project (ESP), Seattle, WA | |
| BRAVO | ||
| TrioDeNovo | ||
| GATK | ||
| DeNovo protocol code | Scripts, intermediate files, and data generated for this protocol | |