Literature DB >> 30472416

AncestryPainter: A Graphic Program for Displaying Ancestry Composition of Populations and Individuals.

Qidi Feng1, Dongsheng Lu2, Shuhua Xu3.   

Abstract

Ancestry composition of populations and individuals has been extensively investigated in recent years due to advances in the genotyping and sequencing technologies. As the number of populations and individuals used for ancestry inference increases remarkably, say more than 100 populations or 1000 individuals, it is usually challenging to present the ancestry composition in a traditional way using a rectangular graph. To address this issue, we developed a program, AncestryPainter, which can illustrate the ancestry composition of populations and individuals with a rounded and nice-looking graph to save space. Individuals are depicted as length-fixed bars partitioned into colored segments representing different ancestries, and the population of interest can be highlighted as a pie chart in the center of the circle plot. In addition, AncestryPainter can also be applied to display personal ancestry in a way similar to that for displaying population ancestry. AncestryPainter is publicly available at http://www.picb.ac.cn/PGG/resource.php.
Copyright © 2018 The Authors. Production and hosting by Elsevier B.V. All rights reserved.

Entities:  

Keywords:  Admixed populations; Admixture proportions; Ancestry composition; AncestryPainter; Graphic program

Mesh:

Year:  2018        PMID: 30472416      PMCID: PMC6364040          DOI: 10.1016/j.gpb.2018.05.002

Source DB:  PubMed          Journal:  Genomics Proteomics Bioinformatics        ISSN: 1672-0229            Impact factor:   7.691


Introduction

Ancestry composition of individuals indicates the average proportion of contributing ancestries to their entire genomes [1]. Global ancestry for individuals is frequently inferred in population genetics to understand the genetic relationship and history of gene flow of populations [2], [3], figure out ancestry composition of admixed populations [4], [5], [6], or detect population stratification [7], [8]. Ancestry of a population or an individual can be estimated in different ways, through simple calculation based on allele frequency of the ancestral source populations with K-mean clustering, or using model-based methods such as ADMIXTURE [1], structure [9], and frappe [2], [10]. To use these methods, it is assumed that the observed individuals are drawn from a population with contributions from K ancestries. Then the programs estimate the ancestry proportions contributed by K ancestral source populations, where K is a potential number of ancestral source populations. In theory, the sum of the ancestry proportions of each individual attributed to K ancestral source populations is expected to be 100%. However, it does not necessarily mean that all of the ancestral source populations truly contribute ancestry to the target individual or population and statistical tests can be applied to estimate the significance of ancestry contributions. Several computational tools are available to graphically display the estimated ancestry proportions of individuals or populations [9], [11]. These programs display the ancestry of an individual as a fixed-length line segment partitioned into K colored components, with the length of each colored component corresponding to the proportion of each ancestry. However, all these programs align individuals in a rectangular graph, which is not able to accommodate a large number of individuals or populations in a single print page, thus making it difficult to publish these results. To solve this problem, we developed AncestryPainter, a computational program for displaying the ancestry composition of individuals and populations in a circular graph, and highlighting the ancestry composition of individuals that are of particular interest to the users in the center of the circle as a pie chart (Figure 1). Moreover, AncestryPainter automatically aligns populations based on the proportions of their representative ancestries, i.e., the ancestry accounting for the largest proportion in the population. Therefore, the graph generated by AncestryPainter is not only nice-looking, but also efficient in ancestry visualization.
Figure 1

An example graph taken from the output of

The input of this figure is produced by running ADMIXTURE assuming eight ancestral source populations (K = 8) based on the Human Origins datasets [12] including Africans, Americans, Oceanians, West Eurasians, South Asians, Central Asians/Siberians, and East Asians. The Uyghur population is highlighted in the center of the graph. The command line used for this result is “perl AncestryPainter.pl -i data.ind -q data.Q -t Uygur”. This example is also included in the AncestryPainter program package, which can be downloaded at http://www.picb.ac.cn/PGG/resource.php.

An example graph taken from the output of The input of this figure is produced by running ADMIXTURE assuming eight ancestral source populations (K = 8) based on the Human Origins datasets [12] including Africans, Americans, Oceanians, West Eurasians, South Asians, Central Asians/Siberians, and East Asians. The Uyghur population is highlighted in the center of the graph. The command line used for this result is “perl AncestryPainter.pl -i data.ind -q data.Q -t Uygur”. This example is also included in the AncestryPainter program package, which can be downloaded at http://www.picb.ac.cn/PGG/resource.php.

Method

The graphic program of AncestryPainter is written in Perl. By running the Perl program, an R script is produced based on the input ancestry composition of each population and then automatically run to plot the figure. The final figure is a circle with fixed perimeter and radius by automatically adjusting the relative space of each input population. It is of note that in AncestryPainter, for the purpose of presentation, same space is allocated to each population, regardless of the number of individuals included in each population. AncestryPainter automatically sorts the populations/individuals according to ancestry proportions when displaying the figure. Firstly, the ancestry that accounts for the largest proportion in the population is determined (defined as the representative ancestry of the population) and the input populations are categorized into K groups accordingly. Then, populations in each of the K groups are plotted in a descending order of the representative ancestry proportions. Individuals within each population are also sorted based on a descending order of its representative ancestry proportion.

Implementation

AncestryPainter requires users to provide a matrix of ancestry proportions of each individual as input data, which can be obtained from any analysis of personal or population ancestry. The program also requires a predefined group identifier of each individual to classify and label the individuals in the graph. The predefined group identifier could be population label, sampling location, or phenotypic classification. For instance, ADMIXTURE output file [1] that includes a matrix of ancestry proportions (.Q file) can be used directly as the input data of AncestryPainter. In theory, ancestry proportions obtained using any approach could be provided as the input data for AncestryPainter as long as the proportions across ancestral populations sum to one, and the input format is modified to meet software requirements. Users can run AncestryPainter with a command line under Linux, Mac, or Windows operating systems. It produces and automatically runs an R script to plot a figure file for displaying personal or population ancestry composition. The output of AncestryPainter includes a figure (pdf or png), the R script, the color scheme used for plotting the figure, and a file containing the ancestry proportions as well as population label of each individual.

Display options and features

Figure 1 exemplifies the display for the ancestry composition of global populations using AncestryPainter with the Uyghur population highlighted in the center. Dataset used to generate Figure 1 included 2345 individuals representing 203 populations from the Human Origins dataset [12]. The ancestry proportions of each population were obtained using ADMIXTURE program, assuming 8 pseudo ancestries (K = 8). Default color scheme was used in the figure and populations were automatically sorted according to admixture proportions. As shown in Figure 1, Uyghurs are composed of four major ancestries, with representative ancestral source populations from West Eurasia, South Asia, East Asia, and Siberia, respectively. AncestryPainter allows users to freely modify the default settings such as the color scheme, the order of populations/individuals, and what populations/individuals to be included with command options. Users can also specify a certain population or individual to be highlighted in the center of the graph, or plot without any population or individual highlighted in the center. If there are too many populations to present, users can remove the black lines between populations for a better display. In addition to displaying ancestry composition of individuals, AncestryPainter could be applied more extensively to illustrate any data that can be presented in a bar plot. As long as the assigned proportions for all items sum to 100%, the data can be displayed in a circular graph in the same way as shown for the personal or population ancestry composition.

Future developments

In this study, we developed AncestryPainter, a computational program that can be used to illustrate the ancestry compositions of populations and individuals with a rounded and nice-looking graph in a spatially-efficient way. In the future, we will implement more functions, for instance, to allow displaying ancestry compositions of populations assuming different number of ancestral populations (multiple Ks) in one image using multiple concentric circles, so that ancestry compositions at different levels could be presented in an explicit and efficient way.

Authors’ contributions

SX conceived and designed the study. DL and QF developed the program and wrote the computer code. QF drafted the manuscript and SX revised the manuscript. All authors read and approved the final manuscript.

Competing interests

The authors declare that they have no competing interests.
  11 in total

1.  Inference of population structure using multilocus genotype data.

Authors:  J K Pritchard; M Stephens; P Donnelly
Journal:  Genetics       Date:  2000-06       Impact factor: 4.562

2.  Mapping human genetic diversity in Asia.

Authors:  Mahmood Ameen Abdulla; Ikhlak Ahmed; Anunchai Assawamakin; Jong Bhak; Samir K Brahmachari; Gayvelline C Calacal; Amit Chaurasia; Chien-Hsiun Chen; Jieming Chen; Yuan-Tsong Chen; Jiayou Chu; Eva Maria C Cutiongco-de la Paz; Maria Corazon A De Ungria; Frederick C Delfin; Juli Edo; Suthat Fuchareon; Ho Ghang; Takashi Gojobori; Junsong Han; Sheng-Feng Ho; Boon Peng Hoh; Wei Huang; Hidetoshi Inoko; Pankaj Jha; Timothy A Jinam; Li Jin; Jongsun Jung; Daoroong Kangwanpong; Jatupol Kampuansai; Giulia C Kennedy; Preeti Khurana; Hyung-Lae Kim; Kwangjoong Kim; Sangsoo Kim; Woo-Yeon Kim; Kuchan Kimm; Ryosuke Kimura; Tomohiro Koike; Supasak Kulawonganunchai; Vikrant Kumar; Poh San Lai; Jong-Young Lee; Sunghoon Lee; Edison T Liu; Partha P Majumder; Kiran Kumar Mandapati; Sangkot Marzuki; Wayne Mitchell; Mitali Mukerji; Kenji Naritomi; Chumpol Ngamphiw; Norio Niikawa; Nao Nishida; Bermseok Oh; Sangho Oh; Jun Ohashi; Akira Oka; Rick Ong; Carmencita D Padilla; Prasit Palittapongarnpim; Henry B Perdigon; Maude Elvira Phipps; Eileen Png; Yoshiyuki Sakaki; Jazelyn M Salvador; Yuliana Sandraling; Vinod Scaria; Mark Seielstad; Mohd Ros Sidek; Amit Sinha; Metawee Srikummool; Herawati Sudoyo; Sumio Sugano; Helena Suryadi; Yoshiyuki Suzuki; Kristina A Tabbada; Adrian Tan; Katsushi Tokunaga; Sissades Tongsima; Lilian P Villamor; Eric Wang; Ying Wang; Haifeng Wang; Jer-Yuarn Wu; Huasheng Xiao; Shuhua Xu; Jin Ok Yang; Yin Yao Shugart; Hyang-Sook Yoo; Wentao Yuan; Guoping Zhao; Bin Alwi Zilfalil
Journal:  Science       Date:  2009-12-11       Impact factor: 47.728

3.  Genomic dissection of population substructure of Han Chinese and its implication in association studies.

Authors:  Shuhua Xu; Xianyong Yin; Shilin Li; Wenfei Jin; Haiyi Lou; Ling Yang; Xiaohong Gong; Hongyan Wang; Yiping Shen; Xuedong Pan; Yungang He; Yajun Yang; Yi Wang; Wenqing Fu; Yu An; Jiucun Wang; Jingze Tan; Ji Qian; Xiaoli Chen; Xin Zhang; Yangfei Sun; Xuejun Zhang; Bailin Wu; Li Jin
Journal:  Am J Hum Genet       Date:  2009-12       Impact factor: 11.025

4.  Genetic structure of the Han Chinese population revealed by genome-wide SNP variation.

Authors:  Jieming Chen; Houfeng Zheng; Jin-Xin Bei; Liangdan Sun; Wei-hua Jia; Tao Li; Furen Zhang; Mark Seielstad; Yi-Xin Zeng; Xuejun Zhang; Jianjun Liu
Journal:  Am J Hum Genet       Date:  2009-12       Impact factor: 11.025

5.  Genetic History of Xinjiang's Uyghurs Suggests Bronze Age Multiple-Way Contacts in Eurasia.

Authors:  Qidi Feng; Yan Lu; Xumin Ni; Kai Yuan; Yajun Yang; Xiong Yang; Chang Liu; Haiyi Lou; Zhilin Ning; Yuchen Wang; Dongsheng Lu; Chao Zhang; Ying Zhou; Meng Shi; Lei Tian; Xiaoji Wang; Xi Zhang; Jing Li; Asifullah Khan; Yaqun Guan; Kun Tang; Sijia Wang; Shuhua Xu
Journal:  Mol Biol Evol       Date:  2017-10-01       Impact factor: 16.240

6.  Ancestral Origins and Genetic History of Tibetan Highlanders.

Authors:  Dongsheng Lu; Haiyi Lou; Kai Yuan; Xiaoji Wang; Yuchen Wang; Chao Zhang; Yan Lu; Xiong Yang; Lian Deng; Ying Zhou; Qidi Feng; Ya Hu; Qiliang Ding; Yajun Yang; Shilin Li; Li Jin; Yaqun Guan; Bing Su; Longli Kang; Shuhua Xu
Journal:  Am J Hum Genet       Date:  2016-08-25       Impact factor: 11.025

7.  Worldwide human relationships inferred from genome-wide patterns of variation.

Authors:  Jun Z Li; Devin M Absher; Hua Tang; Audrey M Southwick; Amanda M Casto; Sohini Ramachandran; Howard M Cann; Gregory S Barsh; Marcus Feldman; Luigi L Cavalli-Sforza; Richard M Myers
Journal:  Science       Date:  2008-02-22       Impact factor: 47.728

8.  Ancient human genomes suggest three ancestral populations for present-day Europeans.

Authors:  Iosif Lazaridis; Nick Patterson; Alissa Mittnik; Gabriel Renaud; Swapan Mallick; Karola Kirsanow; Peter H Sudmant; Joshua G Schraiber; Sergi Castellano; Mark Lipson; Bonnie Berger; Christos Economou; Ruth Bollongino; Qiaomei Fu; Kirsten I Bos; Susanne Nordenfelt; Heng Li; Cesare de Filippo; Kay Prüfer; Susanna Sawyer; Cosimo Posth; Wolfgang Haak; Fredrik Hallgren; Elin Fornander; Nadin Rohland; Dominique Delsate; Michael Francken; Jean-Michel Guinet; Joachim Wahl; George Ayodo; Hamza A Babiker; Graciela Bailliet; Elena Balanovska; Oleg Balanovsky; Ramiro Barrantes; Gabriel Bedoya; Haim Ben-Ami; Judit Bene; Fouad Berrada; Claudio M Bravi; Francesca Brisighelli; George B J Busby; Francesco Cali; Mikhail Churnosov; David E C Cole; Daniel Corach; Larissa Damba; George van Driem; Stanislav Dryomov; Jean-Michel Dugoujon; Sardana A Fedorova; Irene Gallego Romero; Marina Gubina; Michael Hammer; Brenna M Henn; Tor Hervig; Ugur Hodoglugil; Aashish R Jha; Sena Karachanak-Yankova; Rita Khusainova; Elza Khusnutdinova; Rick Kittles; Toomas Kivisild; William Klitz; Vaidutis Kučinskas; Alena Kushniarevich; Leila Laredj; Sergey Litvinov; Theologos Loukidis; Robert W Mahley; Béla Melegh; Ene Metspalu; Julio Molina; Joanna Mountain; Klemetti Näkkäläjärvi; Desislava Nesheva; Thomas Nyambo; Ludmila Osipova; Jüri Parik; Fedor Platonov; Olga Posukh; Valentino Romano; Francisco Rothhammer; Igor Rudan; Ruslan Ruizbakiev; Hovhannes Sahakyan; Antti Sajantila; Antonio Salas; Elena B Starikovskaya; Ayele Tarekegn; Draga Toncheva; Shahlo Turdikulova; Ingrida Uktveryte; Olga Utevska; René Vasquez; Mercedes Villena; Mikhail Voevoda; Cheryl A Winkler; Levon Yepiskoposyan; Pierre Zalloua; Tatijana Zemunik; Alan Cooper; Cristian Capelli; Mark G Thomas; Andres Ruiz-Linares; Sarah A Tishkoff; Lalji Singh; Kumarasamy Thangaraj; Richard Villems; David Comas; Rem Sukernik; Mait Metspalu; Matthias Meyer; Evan E Eichler; Joachim Burger; Montgomery Slatkin; Svante Pääbo; Janet Kelso; David Reich; Johannes Krause
Journal:  Nature       Date:  2014-09-18       Impact factor: 49.962

9.  Whole genome sequencing of Turkish genomes reveals functional private alleles and impact of genetic interactions with Europe, Asia and Africa.

Authors:  Can Alkan; Pinar Kavak; Mehmet Somel; Omer Gokcumen; Serkan Ugurlu; Ceren Saygi; Elif Dal; Kuyas Bugra; Tunga Güngör; S Cenk Sahinalp; Nesrin Özören; Cemalettin Bekpen
Journal:  BMC Genomics       Date:  2014-11-07       Impact factor: 3.969

10.  Differentiated demographic histories and local adaptations between Sherpas and Tibetans.

Authors:  Chao Zhang; Yan Lu; Qidi Feng; Xiaoji Wang; Haiyi Lou; Jiaojiao Liu; Zhilin Ning; Kai Yuan; Yuchen Wang; Ying Zhou; Lian Deng; Lijun Liu; Yajun Yang; Shilin Li; Lifeng Ma; Zhiying Zhang; Li Jin; Bing Su; Longli Kang; Shuhua Xu
Journal:  Genome Biol       Date:  2017-06-15       Impact factor: 13.583

View more
  5 in total

1.  Cross-continental admixture in the Kho population from northwest Pakistan.

Authors:  Asifullah Khan; Leonardo Vallini; Shahid Aziz; Luca Pagani; Shao-Qing Wen; Hizbullah Khan; Komal Zaib; Kiran Nigar; Qasim Ayub; Ling-Xiang Wang
Journal:  Eur J Hum Genet       Date:  2022-02-25       Impact factor: 5.351

2.  Whole mitochondrial genome analysis of the Daur ethnic minority from Hulunbuir in the Inner Mongolia Autonomous Region of China.

Authors:  Chi-Zao Wang; Xue-Er Yu; Mei-Sen Shi; Hui Li; Shu-Hua Ma
Journal:  BMC Ecol Evol       Date:  2022-05-18

3.  Genome-Wide Landscape of North-Eastern European Populations: A View from Lithuania.

Authors:  Alina Urnikyte; Alma Molyte; Vaidutis Kučinskas
Journal:  Genes (Basel)       Date:  2021-10-28       Impact factor: 4.096

4.  Age Estimate of GJB2-p.(Arg143Trp) Founder Variant in Hearing Impairment in Ghana, Suggests Multiple Independent Origins across Populations.

Authors:  Elvis Twumasi Aboagye; Samuel Mawuli Adadey; Kevin Esoh; Mario Jonas; Carmen de Kock; Lucas Amenga-Etego; Gordon A Awandare; Ambroise Wonkam
Journal:  Biology (Basel)       Date:  2022-03-21

5.  Ancient viral genomes reveal introduction of human pathogenic viruses into Mexico during the transatlantic slave trade.

Authors:  Axel A Guzmán-Solís; Viridiana Villa-Islas; Miriam J Bravo-López; Marcela Sandoval-Velasco; Julie K Wesp; Jorge A Gómez-Valdés; María de la Luz Moreno-Cabrera; Alejandro Meraz; Gabriela Solís-Pichardo; Peter Schaaf; Benjamin R TenOever; Daniel Blanco-Melo; María C Ávila Arcos
Journal:  Elife       Date:  2021-08-05       Impact factor: 8.140

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.