Literature DB >> 32405166

GenPop-An Online Tool to Analyze Human Population Genetic Data.

B Arundhati Mahesh1, E Kannan2, G Dicky John Davis1, P Venkatesan1, P K Ragunath1.   

Abstract

GenPop is a web based online cross platform tool developed to help Geneticist and Epidemiologist to deal with association studies in analyzing human population genetic data. The tool features include descriptive analysis such as Hardy-Weinberg equilibrium test, chi-square p-value and analysis of single nucleotide polymorphisms (SNPs) with multiple inheritance models such as dominant, recessive, allelic, genotype, odd's ratio and relative risk at 95% confidence interval and analysis of multiple SNPs including haplotype frequencies and linkage disequilibrium for a pair of biallelic markers. This is a user-driven human population genetic data analysis tool that is easily scalable and acceptable with multiple implementations of different algorithms. GenPop has been developed using PHP, JavaScript and with PHPExcel library to analyse the genetic data for case control studies.
© 2020 Biomedical Informatics.

Entities:  

Keywords:  Geneticist; Hardy-Weinberg equilibrium; PHP; epidemiologist

Year:  2020        PMID: 32405166      PMCID: PMC7196168          DOI: 10.6026/97320630016149

Source DB:  PubMed          Journal:  Bioinformation        ISSN: 0973-2063


Background

Genetic epidemiology deals with the study the role of genetic factors involved in determining health and diseases in families and as well in populations which seeks to derive statistical and quantitative analysis of how genetics work in larger groups. There are various software packages developed for analyzing human genetic data which rely on computer-based algorithm that is not passable in certain instances, few packages provide a single function and are difficult to install and use. Statistical packages can be used to perform these study analysis, but an assistance of computational tool is mainly needed by researcher to perform specific analysis like HWE, haplotype estimation, and at times difficulty in integrating results from different packages at a shorter time span [1]. Thereby as a constraint of trend in the use of worldwide Web technology and Web design that aims at enhancing creativity, information sharing and communication among users in analysing the case control data.

Materials and Methods:

GenPop is developed using JavaScript, which is a dynamic, integrated, and prototype-based language that makes it easy to use and flexible [2]. PHP is a server-side scripting language with PHPExcel library to analyse. The example used in help menu is taken from elsewhere [3]. The workflow of the tool is shown in Figure 1 flowchart.
Figure 1

Flowchart for GenPop Tool

Results

Descriptive and association analysis of SNPs:

Testing Hardy-Weinberg equilibrium is commonly performed for analyzing genetic marker data such as SNPs in population studies. The chi-square test determines if a sample data matches a population. The p and q allelic frequencies for the observed phenotype or genotype are calculated to get chi-square p value (Figure 3). The tool also provides ODD's ratio and Risk Ratio with 95% confidence interval for phenotype or genotype using logistic regression analysis (Figure 2).
Figure 3

P and q allele frequency with chi-square p value for the observed Genotype

Figure 2

ODD's Ratio, Standard Error and 95% confidence interval for the observed Phenotype

Linkage disequilibrium analysis:

Linkage disequilibrium (LD) refers to the dependence of alleles from neighboring loci and can provide information on population histories and disease mapping. A widely used statistic measuring pairwise LD between single nucleotide polymorphisms (SNPs) and or multi allelic markers is Hedrick's D',r2 and χ2 which is based on two-locus haplotype frequencies [4] as shown in Figure 4.
Figure 4

Haplotype Frequencies, LD Statistics, χ2 of 3x3

Discussion:

There are many computer programs in population genetics that have been successful in hiding the complexity of the computations from the user but they often rely on assumptions that are crucial for a correct interpretation of the results [5]. The research community uses the R statistical and computing language since all R code is open source.The language allows functions to be evaluated and modified by the user [6].GenPop is a tool developed that gives integrated results for a single input based on the user's choice without much time consumption and is free and easily available on web as an online tool.

Conclusions:

GenPop is an online cross platform tool that is useful in performing analysis of association studies based on single nucleotide polymorphisms (SNPs) or biallelic markers.
  5 in total

1.  SNPStats: a web tool for the analysis of association studies.

Authors:  Xavier Solé; Elisabet Guinó; Joan Valls; Raquel Iniesta; Víctor Moreno
Journal:  Bioinformatics       Date:  2006-05-23       Impact factor: 6.937

Review 2.  Computer programs for population genetics data analysis: a survival guide.

Authors:  Laurent Excoffier; Gerald Heckel
Journal:  Nat Rev Genet       Date:  2006-08-22       Impact factor: 53.242

3.  TCF7L2 rs7903146 polymorphism and diabetic nephropathy association is not independent of type 2 diabetes--a study in a south Indian population and meta-analysis.

Authors:  Hajarah Hussain; Vinu Ramachandran; Samathmika Ravi; Teena Sajan; Kiruthiha Ehambaram; Venkatesh Babu Gurramkonda; Gnanasambandan Ramanathan; Lakkakula Venkata Bhaskar
Journal:  Endokrynol Pol       Date:  2014       Impact factor: 1.582

4.  Novel R tools for analysis of genome-wide population genetic data with emphasis on clonality.

Authors:  Zhian N Kamvar; Jonah C Brooks; Niklaus J Grünwald
Journal:  Front Genet       Date:  2015-06-10       Impact factor: 4.599

5.  'Unite and conquer': enhanced prediction of protein subcellular localization by integrating multiple specialized tools.

Authors:  Yao Qing Shen; Gertraud Burger
Journal:  BMC Bioinformatics       Date:  2007-10-29       Impact factor: 3.169

  5 in total
  2 in total

1.  Dispersal patterns and population genetic structure of Aedes albopictus (Diptera: Culicidae) in three different climatic regions of China.

Authors:  Jian Gao; Heng-Duan Zhang; Xiao-Xia Guo; Dan Xing; Yan-De Dong; Ce-Jie Lan; Ge Wang; Chao-Jie Li; Chun-Xiao Li; Tong-Yan Zhao
Journal:  Parasit Vectors       Date:  2021-01-06       Impact factor: 3.876

2.  High genetic diversity and low population differentiation of a medical plant Ficus hirta Vahl., uncovered by microsatellite loci: implications for conservation and breeding.

Authors:  Yi Lu; Jianling Chen; Bing Chen; Qianqian Liu; Hanlin Zhang; Liyuan Yang; Zhi Chao; Enwei Tian
Journal:  BMC Plant Biol       Date:  2022-07-12       Impact factor: 5.260

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.