| Literature DB >> 16904005 |
Valeria Mattiangeli1, Anthony W Ryan, Ross McManus, Daniel G Bradley.
Abstract
BACKGROUND: In this study we present a single population test (Ewens-Waterson) applied in a genomic context to investigate the presence of recent positive selection in the Irish population. The Irish population is an interesting focus for the investigation of recent selection since several lines of evidence suggest that it may have a relatively undisturbed genetic heritage.Entities:
Mesh:
Substances:
Year: 2006 PMID: 16904005 PMCID: PMC1779589 DOI: 10.1186/gb-2006-7-8-r74
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Figure 1Schematic illustrating the use of pairwise FST scores to generate a locus-specific branch length (as described in [18]). Branch lengths (lEU, lAF, lAS) were calculated from single locus pairwise FST distances. lEU = (European:Asian FST + European-African FST - Asian-African FST)/2; lAS = (European-Asian FST + Asian-African FST - European-African FST)/2; lAF = (European-African FST + Asian-African FST - European-Asian FST)/2.
Figure 2Absolute European locus-specific branch length (LSBL) plotted versus the relative LSBL for each SNP with significant FST (> 0.45); n = 334. Data were from Akey and colleagues [12]. The 23 loci circled in the plot have an EU Absolute LSBL value = 0.8 and were considered in this study as outliers. EU, European-Americans; AF, African-Americans; AS, East Asian.
SNPs, genes and microsatellites analysed in this study
| SNP identity | Chromosome | Gene | Microsatellite name | Microsatellite distance from SNP (kb) | H-W | DH/sd | PS | PE |
| rs1009127 | 1 | LRRC7 (leucine rich repeat 7) | +34.6 | NS | -3.6 | 0.007 | NS | |
| rs718830 | 7 | CFTR (cystic fibrosis transmembrane conductance regulator) | IVS8CA* | +14.5 | NS | 0.003 | 0.03 | |
| IVS17B* | +49.2 | NS | -1.1 | NS | NS | |||
| -10.2 | NS | -1.3 | NS | NS | ||||
| rs997929 | 8 | TOX (thymus high mobility group box protein) | -20.4 | - | - | - | - | |
| +22.2 | NS | -1.2 | NS | NS | ||||
| rs726733 | 11 | SYT9 (synaptotagmin IX) | -7 | NS | -2.8 | 0.019 | NS | |
| +40.9 | NS | -1.4 | NS | NS | ||||
| rs1111108 | 14 | PKCη (protein kinase C, eta) | +8.7 | NS | 0.0001 | 0.016 | ||
| -70.4 | NS | 0.0001 | 0.013 | |||||
| rs761057 | 16 | TPSG1 (γ-triptase 1) | -74.3 | NS | 0.001 | 0.003 | ||
| +41.4 | NS | -0.2 | NS | NS | ||||
| rs998262 | 5 | No gene | -4.6 | 0.0001 | 0.022 | |||
| +31.7 | NS | -3.6 | 0.008 | NS | ||||
| rs902336 | 1 | 32 kb upstream from the | -15.7 | NS | -3.2 | 0.015 | NS | |
| gene ABCD3 | -2 | - | - | - | - |
'SNP identity' indicates the identification number of each SNP as annotated in dbSNP NCBI (National Center for Biotechnology Information). 'Chromosome' indicates on which chromosome the SNP is located. 'Gene' indicates the gene in which the SNP is located or the nearest gene in the case of the last SNP. 'Microsatellite name' is the name given to the microsatellites flanking the SNP; the same name can be found in Figure 3, where the microsatellites are placed in the genome-wide distribution. 'Microsatellite distance from the SNP' is the distance (in kilobase) upstream (+) or downstream (-) between the microsatellite and the SNP. These statistics are reported from analyses carried out on the microsatellite data. 'H-W' is the p value from the Hardy-Weinberg equilibrium test. 'DH/sd' is the observed gene diversity minus the expected heterozygosity (DH) according the Ewens-Watterson's statistic divided by the standard deviation (sd) of the gene diversity (see the text for details). 'PS' is the significance of the difference between observed gene diversity and expected heterozygosity resulting from the simulation carried out by the program BOTTLENECK. 'PE' is the significance calculated using the empirical distribution; only values < 0.05 are quoted. NS, not significant. The two microsatellites marked with an asterisk have been described previously [19,20] and no data for the two markers that were not scorable are denoted by hyphens.
Figure 3Empirical distribution of Ewens-Waterson's statistic. (a) Genome-wide distribution of Ewens-Watterson's analysis statistic (DH/sd) generated from 372 dinuclotide microsatellites, distributed through the whole genome (ABI PRISM Linkage mapping Set-MD10). (b) Genome-wide distribution of Ewens-Watterson's statistic generated from microsatellites located within genes. The number of loci is indicated by the Y-axis. The arrows show the Ewens-Watterson score for each microsatellite associated with the genes included in the preliminary test. The microsatellites in bold have significant divergence (PE, Table 1) between observed gene diversity and expected heterozygosity according to the Ewens-Watterson test.