| Literature DB >> 23151206 |
Leonardo Arbiza1, Elaine Zhong, Alon Keinan.
Abstract
BACKGROUND: Analyzing regions of the genome where genetic variation is free from the confounding effects of natural selection is essential for many population genetic studies. Several recent studies in humans have stressed the large effect of natural selection at linked neutral sites and have shown that the choice of putatively neutral regions can have a marked effect on estimates of demographic history.Entities:
Mesh:
Year: 2012 PMID: 23151206 PMCID: PMC3543337 DOI: 10.1186/1471-2105-13-301
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Effect of neutral region choice in estimating the X-to-autosome effective population size ratio. Nx/Na (y-axis) is estimated by means of the X-to-autosome ratio of nucleotide diversity (π) normalized by human-macaque divergence to correct for variation in mutation rates. Bars are as described in text. Error bars are standard errors estimated by bootstrapping 10,000 data sets. YRI, Yoruba (Ibadan, Nigeria); CEU, CEPH (from Utah with Northern and Western European ancestry).
Megabases remaining after each filtering and masking step
| A | 1921.62 (65.0%) | 676.70 (35.2%) | 522.88 (77.3%) | 267.28 (51.1%) | 120.17 (23.0%) | 54.59 (10.4%) | 395.32 (75.6%) |
| X | 97.99 (63.3%) | 43.30 (44.2%) | 20.56 (47.5%) | 14.63 (71.1%) | 3.08 (15.0%) | 2.15 (10.5%) | 10.31 (50.2%) |
The first three filters, starting with the leftmost column were sequentially applied, resulting in the “genome-wide” set on which all additional analyses are based for both the X-chromosome (X) and the autosomes (A). Subsequent filters are all subsets of this set. Indicated percentages are out of the previous filtering step, i.e. previous column and the third column for all following columns.
Genome-wide macaque-normalized diversity estimates and ratios of chromosome X to autosomes
| CEU | 20.6 | 0.00797 (0.0015) | 522.9 | 0.01596 (0.0003) | 0.4992 (0.083) |
| YRI | | 0.01245 (0.0008) | | 0.02023 (0.0001) | 0.6154 (0.064) |
| CEU/YRI | 0.63980 (0.0897) | 0.78890 (0.0162) | 0.8113 (0.115) |
Estimates for each region (and standard errors) together with the total number of bases analyzed after filtering (Mb). Note that the genome-wide data summarized in this table correspond to the leftmost bars of Figure 1 and Additional file 1: Figure S3 prior to the application of the more stringent set of filters for neutrality which have a large effect on results.