| Literature DB >> 32560373 |
Kumiko V Nishiyama1, Yoko Satta1, Jun Gojobori1.
Abstract
Dyslexia, or reading disability, is found to have a genetic basis, and several related genes have been reported. We investigated whether natural selection has acted on single nucleotide polymorphisms (SNPs) that were reported to be associated with risk/non-risk for the reading disability of Chinese characters. We applied recently developed 2D SFS-based statistics to SNP data of East Asian populations to examine whether there is any sign of selective sweep. While neutrality was not rejected for most SNPs, significant signs of selection were detected for two linkage disequilibrium (LD) regions containing the reported SNPs of GNPTAB and DCDC2. Furthermore, we searched for a selection target site among the SNPs in these LD regions, because a causal site is not necessarily a reported SNP but could instead be a tightly linked site. In both LD regions, we found candidate target sites, which may have an effect on expression regulation and have been selected, although which genes these SNPs affect remains unknown. Because most people were not engaged in reading until recently, it is unlikely that there has been selective pressure on reading ability itself. Consistent with this, our results suggest a possibility of genetic hitchhiking, whereby alleles of the reported SNPs may have increased in frequency together with the selected target, which could have functions for other genes and traits apart from reading ability.Entities:
Keywords: 2D SFS-based statistics; East Asian populations; dyslexia; genetic hitchhiking; nSL; population genetics; selective sweep
Mesh:
Substances:
Year: 2020 PMID: 32560373 PMCID: PMC7349701 DOI: 10.3390/genes11060658
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
The single nucleotide polymorphisms (SNPs) associated with dyslexia of Chinese characters in previous studies.
| Gene | Core SNP | Chr. | Position | Risk Allele | Derived Allele Frequency | References | |
|---|---|---|---|---|---|---|---|
| (GRCh37/hg19) | EAS | (EAS and KPGP) | |||||
|
| rs28366021 | 1 | 36,022,859 | Ancestral | 0.234 | (0.227) | [ |
|
| rs4535189 | 3 | 79,489,971 | Derived | 0.366 | (0.373) | [ |
|
| rs807724 | 6 | 24,278,869 | Ancestral | 0.957 | (0.956) | [ |
|
| rs1091047 | 6 | 24,295,256 | Ancestral | 0.817 | (0.823) | [ |
|
| rs2760157 | 6 | 24,578,272 | Ancestral | 0.456 | (0.470) | [ |
|
| rs807507 | 6 | 24,579,867 | Derived | 0.188 | (0.187) | [ |
|
| rs4504469 | 6 | 24,588,884 | Derived | 0.112 | (0.122) | [ |
|
| rs2074130 | 7 | 111,487,098 | Derived | 0.101 | (0.115) | [ |
|
| rs1079727 | 11 | 113,289,182 | Derived | 0.416 | (0.420) | [ |
|
| rs17031962 | 12 | 102,146,558 | Ancestral | 0.294 | (0.297) | [ |
|
| rs11629841 | 15 | 55,777,638 | Derived | 0.058 | (0.056) | [ |
|
| rs3743205 | 15 | 55,790,530 | Derived | 0.035 | (0.037) | [ |
| intergenic region | rs8049367 | 16 | 3,980,445 | Derived | 0.339 | (0.340) | [ |
|
| rs882294 | 16 | 5,092,118 | Derived | 0.189 | (0.188) | [ |
|
| rs2255526 | 21 | 47,971,539 | Derived | 0.264 | (0.262) | [ |
The results of nSL for the core SNPs.
| Gene | Core SNP | Normalized | |
|---|---|---|---|
|
| rs28366021 a | 0.0771 | 0.469 |
|
| rs4535189 | −0.1882 | 0.575 |
|
| rs807724 | 1.1328 | 0.129 |
|
| rs1091047 | −0.5967 | 0.725 |
|
| rs2760157 | −2.1853 | 0.986 |
|
| rs807507 | 0.7329 | 0.232 |
|
| rs4504469 | 0.7098 | 0.239 |
|
| rs2074130 | 0.3068 | 0.379 |
|
| rs1079727 | −0.1744 | 0.569 |
|
| rs17031962 | 1.2369 | 0.108 |
|
| rs11629841 | −0.0922 | 0.537 |
|
| rs3743205 | −0.1939 | 0.577 |
| intergenic region | rs8049367 | −0.4421 | 0.671 |
|
| rs882294 | 0.2399 | 0.405 |
|
| rs2255526 | - | - |
a Representative for rs28366021.
statistic results for the core SNPs and three younger SNPs.
| Gene | Core SNP | Number of Derived Alleles | Length of the Core Region | Number of Segregating Sites |
| |
|---|---|---|---|---|---|---|
|
| rs28366021 | 236 | 330,223 | 2204 | 0.1476 | 0.718 |
|
| rs4535189 | 369 | 124,626 | 866 | 0.1287 | 0.316 |
|
| rs73129039 a | 363 | 124,626 | 866 | 0.1232 | 0.303 |
|
| rs807724 | 965 | 5910 | 53 | 0.6742 | 0.159 |
|
| rs1091047 | 824 | 41,134 | 334 | 0.3044 | 0.111 |
|
| rs3789228 b | 782 | 41,134 | 334 | 0.2020 | 0.068 * |
|
| rs2760157 | 460 | 7387 | 53 | 0.7765 | 0.939 |
|
| rs807507 | 189 | 11,475 | 81 | 0.0220 | 0.111 |
|
| rs4504469 | 113 | 32,025 | 241 | 0.0736 | 0.529 |
|
| rs2074130 | 102 | - | - | - | - |
|
| rs1079727 | 419 | 38,525 | 372 | 0.1370 | 0.260 |
|
| rs17031962 | 296 | 136,804 | 868 | 0.0400 | 0.038 * |
|
| rs11629841 | 58 | 130,280 | 1113 | 0.0589 | 0.769 |
|
| rs3743205 | 35 | 242,254 | 2024 | 0.0680 | 0.963 |
|
| rs79024225 c | 31 | 242,254 | 2024 | 0.0308 | 0.758 |
| intergenic region | rs8049367 | 342 | 14,513 | 177 | 0.1486 | 0.428 |
|
| rs882294 | 191 | 34,706 | 339 | 0.2875 | 0.905 |
|
| rs2255526 | 266 | 67,101 | 661 | 0.0899 | 0.361 |
* p 0.1; a the younger SNP of rs4535189 on ROBO1; b the younger SNP of rs1091047 on DCDC2; c the younger SNP of rs3743205 on DYX1C1.
Figure 1(A) Top: p-values of for 48 candidate SNPs in the core region of rs17031962 on GNPTAB. Each dot represents a candidate SNP. The core SNP is indicated by “#”. Colored dots other than black indicate the 12 SNPs with p 0.01. SNPs with the same p-value and smallest region are indicated in the same color. Positions of the three genes in the core region are illustrated as thick lines underneath. Bottom: The same plot showing only the SNPs with p 0.01. The possible target site is indicated by a black arrow. (B) The lengths and positions of the smallest regions of the SNPs with p 0.01. The regions are numbered according to the ascending order of the p-value. The color of the regions corresponds to the dot color in (A).
Figure 2(A) Top: p-values of for 21 candidate SNPs in the core region of rs3789228 on DCDC2. Each dot represents a candidate SNP. Colored dots other than black indicate SNPs with p 0.01. SNPs with the same p-value and smallest region are indicated in the same color. Bottom: The same plot showing only the SNPs with p 0.01. The target site is indicated by a black arrow. The younger SNP is indicated by “#”. (B) The lengths and the positions of the smallest regions of the SNPs with p 0.01. The regions are numbered according to the ascending order of p-value. “1st-2nd region” indicates the smallest region containing both the top and the second SNP, shown as red and orange dots in (A), respectively, and overlapped in the same smallest region. The colors of the other regions correspond to the dot colors in (A).