| Literature DB >> 28957403 |
Sugi Lee1,2, Minah Jung1,2, Jaeeun Jung1,2, Kunhyang Park3, Jea-Woon Ryu2, Jeongkil Kim1,2, Dae-Soo Kim1,2.
Abstract
Whole-exome sequencing (WES) can identify causative mutations in hereditary diseases. However, WES data might have a large candidate variant list, including false positives. Moreover, in families, it is more difficult to select disease-associated variants because many variants are shared among members. To reduce false positives and extract accurate candidates, we used a multilocus variant instead of a single-locus variant (SNV). We set up a specific window to analyze the multilocus variant and devised a sliding-window approach to observe all variants. We developed the gene selection tool (GST) based on proportion tests for linkage analysis using WES data. This tool is R program coded and has high sensitivity. We tested our code to find the gene for hereditary spastic paraplegia using SNVs from a specific family and identified the gene known to cause the disease in a significant gene list. The list identified other genes that might be associated with the disease.Entities:
Mesh:
Year: 2017 PMID: 28957403 PMCID: PMC5619773 DOI: 10.1371/journal.pone.0185514
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
A list of significant genes deduced by GST analysis.
| Rank | Chromosome | Locus | Gene | Score |
|---|---|---|---|---|
| 1 | chr12 | 6575044 | 20.610176 | |
| 2 | chr5 | 131705949 | 15.517678 | |
| 3 | chr1 | 13183015 | 13.602401 | |
| 4 | chr12 | 6562285 | 10.389608 | |
| 5 | chr2 | 32341282 | 8.452735 | |
| 6 | chr19 | 56283297 | 7.497583 | |
| 7 | chr2 | 132236963 | 5.867672 | |
| 8 | chr7 | 100638484 | 5.788482 | |
| 9 | chr12 | 50749294 | 5.750429 | |
| 10 | chr12 | 113319600 | 4.951603 | |
| 11 | chr1 | 248084909 | 4.101806 | |
| 12 | chr13 | 25671700 | 4.087220 | |
| 13 | chr2 | 10059844 | 3.596389 | |
| 14 | chr15 | 23261850 | 3.281563 |
Fig 1A plot of the significant gene list of all chromosomes except X and Y chromosomes.
To distinguish the chromosomes, they are expressed in red and blue. The candidate genes of Table 1 are represented by gene names and their trimmed score values. Details of the trimmed score option have been provided in the S1 File (GST User guide).