Literature DB >> 12407212

Comparisons of likelihood and machine learning methods of individual classification.

B Guinand1, A Topchy, K S Page, M K Burnham-Curtis, W F Punch, K T Scribner.   

Abstract

Classification methods used in machine learning (e.g., artificial neural networks, decision trees, and k-nearest neighbor clustering) are rarely used with population genetic data. We compare different nonparametric machine learning techniques with parametric likelihood estimations commonly employed in population genetics for purposes of assigning individuals to their population of origin ("assignment tests"). Classifier accuracy was compared across simulated data sets representing different levels of population differentiation (low and high F(ST)), number of loci surveyed (5 and 10), and allelic diversity (average of three or eight alleles per locus). Empirical data for the lake trout (Salvelinus namaycush) exhibiting levels of population differentiation comparable to those used in simulations were examined to further evaluate and compare classification methods. Classification error rates associated with artificial neural networks and likelihood estimators were lower for simulated data sets compared to k-nearest neighbor and decision tree classifiers over the entire range of parameters considered. Artificial neural networks only marginally outperformed the likelihood method for simulated data (0-2.8% lower error rates). The relative performance of each machine learning classifier improved relative likelihood estimators for empirical data sets, suggesting an ability to "learn" and utilize properties of empirical genotypic arrays intrinsic to each population. Likelihood-based estimation methods provide a more accessible option for reliable assignment of individuals to the population of origin due to the intricacies in development and evaluation of artificial neural networks.

Entities:  

Mesh:

Year:  2002        PMID: 12407212     DOI: 10.1093/jhered/93.4.260

Source DB:  PubMed          Journal:  J Hered        ISSN: 0022-1503            Impact factor:   2.645


  6 in total

1.  Informativeness of genetic markers for inference of ancestry.

Authors:  Noah A Rosenberg; Lei M Li; Ryk Ward; Jonathan K Pritchard
Journal:  Am J Hum Genet       Date:  2003-11-20       Impact factor: 11.025

2.  KLFDAPC: a supervised machine learning approach for spatial genetic structure analysis.

Authors:  Xinghu Qin; Charleston W K Chiang; Oscar E Gaggiotti
Journal:  Brief Bioinform       Date:  2022-07-18       Impact factor: 13.994

3.  Characterization of a likelihood based method and effects of markers informativeness in evaluation of admixture and population group assignment.

Authors:  Bao-Zhu Yang; Hongyu Zhao; Henry R Kranzler; Joel Gelernter
Journal:  BMC Genet       Date:  2005-10-14       Impact factor: 2.797

4.  Identification of Target Chicken Populations by Machine Learning Models Using the Minimum Number of SNPs.

Authors:  Dongwon Seo; Sunghyun Cho; Prabuddha Manjula; Nuri Choi; Young-Kuk Kim; Yeong Jun Koh; Seung Hwan Lee; Hyung-Yong Kim; Jun Heon Lee
Journal:  Animals (Basel)       Date:  2021-01-19       Impact factor: 2.752

5.  Authoritative subspecies diagnosis tool for European honey bees based on ancestry informative SNPs.

Authors:  Jamal Momeni; Melanie Parejo; Rikke Vingborg; Maria Bouga; Per Kryger; Marina D Meixner; Andone Estonba; Rasmus O Nielsen; Jorge Langa; Iratxe Montes; Laetitia Papoutsis; Leila Farajzadeh; Christian Bendixen; Eliza Căuia; Jean-Daniel Charrière; Mary F Coffey; Cecilia Costa; Raffaele Dall'Olio; Pilar De la Rúa; M Maja Drazic; Janja Filipi; Thomas Galea; Miroljub Golubovski; Ales Gregorc; Karina Grigoryan; Fani Hatjina; Rustem Ilyasov; Evgeniya Ivanova; Irakli Janashia; Irfan Kandemir; Aikaterini Karatasou; Meral Kekecoglu; Nikola Kezic; Enikö Sz Matray; David Mifsud; Rudolf Moosbeckhofer; Alexei G Nikolenko; Alexandros Papachristoforou; Plamen Petrov; M Alice Pinto; Aleksandr V Poskryakov; Aglyam Y Sharipov; Adrian Siceanu; M Ihsan Soysal; Aleksandar Uzunov; Marion Zammit-Mangion
Journal:  BMC Genomics       Date:  2021-02-03       Impact factor: 3.969

6.  Machine Learning Based Classification of Microsatellite Variation: An Effective Approach for Phylogeographic Characterization of Olive Populations.

Authors:  Bahareh Torkzaban; Amir Hossein Kayvanjoo; Arman Ardalan; Soraya Mousavi; Roberto Mariotti; Luciana Baldoni; Esmaeil Ebrahimie; Mansour Ebrahimi; Mehdi Hosseini-Mazinani
Journal:  PLoS One       Date:  2015-11-24       Impact factor: 3.240

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.