| Literature DB >> 32655614 |
Ai-Ru Hsieh1, Jia Jyun Sie2, Chien Ching Chang3, Jurg Ott4, Ie-Bin Lian2, Cathy S J Fann3.
Abstract
BACKGROUND: Due to the affordability of whole-genome sequencing, the genetic association design can now address rare diseases. However, some common statistical association methods only consider homozygosity mapping and need several criteria, such as sliding windows of a given size and statistical significance threshold setting, such as P-value < 0.05 to achieve good power in rare disease association detection.Entities:
Keywords: ALSPAC; autosomal recessive disease; maximal segmental score; rare disease; whole-genome sequencing
Year: 2020 PMID: 32655614 PMCID: PMC7325894 DOI: 10.3389/fgene.2020.00555
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
FIGURE 1An example of partial sum scores for 20 SNPs. Ladder points in solid dots (2nd, 4th to 8th, 18th, and 19th SNP) forming eight segmental regions are the markers with new record lows of the partial sum scores, which are lower than all scores on their left, respectively. The ladder points partition the curve into the up-climbing intervals (e.g., [0 to 2nd], [2nd to 4th], and [8th to 18th]), which indicate the cluster of neighboring significant SNPs and the declining regions (the rest insignificant intervals). Segmental score (Sv, green stars) is the vertical height of each interval, and the highest Sv is assumed to be the most susceptive region. In eMSS, some minor number of insignificant SNPs within the susceptive region are tolerable, e.g., markers 12 and 13.
Power and type I error comparison with differing scenarios in our eMSS calculations under the two-sample t-test.
| Extreme1 | 76.7% (1.5%) | 76% (7%) | 77.8% (1.2%) | 75% (8%) | 80.7% (2.5%) | 79.7% (1.6%) | 96.2% (1.9%) | 96.2% (0.6%) |
| Non-extreme2 | 63.5% (3.5%) | 60% (7%) | 65.1% (3.4%) | 64% (6%) | 68.3% (3.5%) | 65.3% (3.0%) | 91.4% (4.9%) | 93.4% (0.2%) |
Ranking of known pathogenic variants of the Québec data set used in our eMSS method.
| OI | OI | 8 | 5,628 | 20,884,737–24,359,279 | 294 | 2 | 0.015 | 22,022,749–22,052,894 (30.145 kb) | 10 | 0.015 | |
| F1 | MIA | 2 | 8,465 | 45,205,547–71,635,931 | 744 | 2 | 0.015 | 47,251,634–47,256,618 (4.984 kb) | 4 | 0.030 | |
| F4 | MIA | 2 | 7,380 | 39,108,773–55,528,514 | 378 | 1 | 0.015 | 47,133,330–47,273,668 (140.358 kb) | 11 | 0.015 | |
| F6 | MIA | 2 | 7,562 | 45,171,842–85,059,227 | 872 | 1 | 0.015 | 47,133,330–47,273,668 (140.358 kb) | 8 | 0.015 | |
| F1 + F4 + F6 | MIA | 2 | 3,484 | 38,916,906–68,676,008 | 386 | 1 | 0.002 | 47,133,330–47,277,043 (143.713 kb) | 9 | 0.002 | |
Ranking of known pathogenic variants of the Pakistani data set to the OI disease used in our eMSS method.
| III-5 | 12 | 2,229 | 31,545,381–52,470,979 | 196 | 2 | 0.015 | 48,272,895–48,883,020 (610.125 kb) | 8 | 0.015 | |
| III-15 | 12 | 2,022 | 47,178,307–52,713,088 | 146 | 2 | 0.015 | 48,272,895–48,884,535 (611.640 kb) | 9 | 0.015 | |