| Literature DB >> 26335686 |
Reuben J Pengelly1, William Tapper2, Jane Gibson3, Marcin Knut4, Rick Tearle5, Andrew Collins6, Sarah Ennis7.
Abstract
BACKGROUND: An understanding of linkage disequilibrium (LD) structures in the human genome underpins much of medical genetics and provides a basis for disease gene mapping and investigating biological mechanisms such as recombination and selection. Whole genome sequencing (WGS) provides the opportunity to determine LD structures at maximal resolution.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26335686 PMCID: PMC4558963 DOI: 10.1186/s12864-015-1854-0
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Number of individuals, component marker counts and LD map length and using ABG and WGS data
| Individuals | Markers | Map length (LDU) | ||
|---|---|---|---|---|
| ABG | CEU | 112 | 15359 | 850.07 |
| YRI | 147 | 16083 | 993.80 | |
| WGS | CEU | 96 | 66704 (4.34) | 1021.07 (1.20) |
| YRI | 80 | 91320 (5.68) | 1569.46 (1.56) |
Fold change vs. ABG data in parentheses
Fig. 1Comparison of LD maps from ABG and WGS, and linkage map. Comparison of WGS (red) and ABG (blue) CEU LD maps (left ordinate axis scale) and linkage map (black; right ordinate axis scale) for chromosome 22. Linkage map shown is from the June 2012 release of the Rutgers Map v3, interpolated using the Kosambi function (available at http://compgen.rutgers.edu/download_maps.shtml) [12]
Fig. 2Comparison of LD decline intensity in WGS derived LD maps between populations. Comparison of regional rates of LD breakdown for CEU (green) and YRI (purple) populations using the WGS dataset for chromosome 22 for 100 kb windows. A very strong correlation between the LDU/kb for the two populations can be seen (p = 0.91, p < 2.2x10−16)
Fig. 3Expanded comparison of LD maps for a small region. Fine detail comparison of WGS (red) and ABG (blue) LD maps for a 250 kb region of YRI chromosome 22. All markers are plotted individually; hotspots are highlighted in grey. Whilst 13 hotspots are identified within the WGS map for this region, the ABG map shows no hotspots
Fig. 4Distribution of allele frequencies between data sources. Histogram showing MAF distributions within ABG (left panel) and WGS (right panel) datasets for CEU (green) and YRI (purple) populations. A MAF bin width of 0.05 has been used. The median MAF for CEU is 0.25 and 0.21 for the ABG and WGS data respectively; the same metrics for the YRI are 0.23 and 0.15 respectively
Fig. 5Concordance between identified hotspots. Euler diagram showing overlap between hotspots identified in each dataset. The area of all regions is proportional to the number of hotspots which are present in those sets; total area represents 629 independent hotspots across all datasets