| Literature DB >> 25052506 |
Yamin Ma1, Jian Zhao2, Jian-Syuan Wong3, Li Ma3, Wenzhi Li3, Guoxing Fu3, Wei Xu3, Kui Zhang4, Rick A Kittles5, Yun Li6, Qing Song3.
Abstract
Population stratification is a growing concern in genetic-association studies. Averaged ancestry at the genome level (global ancestry) is insufficient for detecting the population substructures and correcting population stratifications in association studies. Local and phase stratification are needed for human genetic studies, but current technologies cannot be applied on the entire genome data due to various technical caveats. Here we developed a novel approach (aMAP, ancestry of Modern Admixed Populations) for inferring local phased ancestry. It took about 3 seconds on a desktop computer to finish a local ancestry analysis for each human genome with 1.4-million SNPs. This method also exhibits the scalability to larger datasets with respect to the number of SNPs, the number of samples, and the size of reference panels. It can detect the lack of the proxy of reference panels. The accuracy was 99.4%. The aMAP software has a capacity for analyzing 6-way admixed individuals. As the biomedical community continues to expand its efforts to increase the representation of diverse populations, and as the number of large whole-genome sequence datasets continues to grow rapidly, there is an increasing demand on rapid and accurate local ancestry analysis in genetics, pharmacogenomics, population genetics, and clinical diagnosis.Entities:
Mesh:
Year: 2014 PMID: 25052506 PMCID: PMC4107375 DOI: 10.1038/srep05800
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1The results of aMAP on ASW with proxy reference panels and imperfect reference panels.
ASW individuals (chromosome-1) were analyzed by aMAP with and without the YRI reference panel. The results of 5 personal haplotypes are shown. When YRI was missing in the reference panel, those African-originated segments (green) could be detected and reported as “others” (yellow).
Figure 2The computing time of aMAP and LAMP.
The whole-genome of 20 HapMap ASW individuals (African-Americans) were analyzed with three references (CEU, YRI, and CHBCHD). The computing speeds of aMAP and LAMP-HAP are compared, and both are linear to the total number of SNPs; the speed of aMAP is about 923 times faster than the speed of LAMP-HAP.
Figure 3The aMAP algorithm.
(a) Reference analysis and pretreatment prior to use. (b) Parallel window scan using a set of parallel sliding windows and horizontal data integration. (c) Vertical data integration that integrates local ancestral calls from adjacent windows. Common calls are indicated by brackets.