Literature DB >> 25591889

Precision assessment of some supervised and unsupervised algorithms for genotype discrimination in the genus Pisum using SSR molecular data.

Jaber Nasiri1, Mohammad Reza Naghavi2, Amir Hossein Kayvanjoo3, Mojtaba Nasiri4, Mansour Ebrahimi5.   

Abstract

For the first time, prediction accuracies of some supervised and unsupervised algorithms were evaluated in an SSR-based DNA fingerprinting study of a pea collection containing 20 cultivars and 57 wild samples. In general, according to the 10 attribute weighting models, the SSR alleles of PEAPHTAP-2 and PSBLOX13.2-1 were the two most important attributes to generate discrimination among eight different species and subspecies of genus Pisum. In addition, K-Medoids unsupervised clustering run on Chi squared dataset exhibited the best prediction accuracy (83.12%), while the lowest accuracy (25.97%) gained as K-Means model ran on FCdb database. Irrespective of some fluctuations, the overall accuracies of tree induction models were significantly high for many algorithms, and the attributes PSBLOX13.2-3 and PEAPHTAP could successfully detach Pisum fulvum accessions and cultivars from the others when two selected decision trees were taken into account. Meanwhile, the other used supervised algorithms exhibited overall reliable accuracies, even though in some rare cases, they gave us low amounts of accuracies. Our results, altogether, demonstrate promising applications of both supervised and unsupervised algorithms to provide suitable data mining tools regarding accurate fingerprinting of different species and subspecies of genus Pisum, as a fundamental priority task in breeding programs of the crop.
Copyright © 2015 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  DNA fingerprinting; Data mining; Genus Pisum; Machine learning; SSR markers

Mesh:

Substances:

Year:  2015        PMID: 25591889     DOI: 10.1016/j.jtbi.2015.01.001

Source DB:  PubMed          Journal:  J Theor Biol        ISSN: 0022-5193            Impact factor:   2.691


  2 in total

1.  Computational approaches for classification and prediction of P-type ATPase substrate specificity in Arabidopsis.

Authors:  Zahra Zinati; Abbas Alemzadeh; Amir Hossein KayvanJoo
Journal:  Physiol Mol Biol Plants       Date:  2016-04-07

2.  Machine Learning Based Classification of Microsatellite Variation: An Effective Approach for Phylogeographic Characterization of Olive Populations.

Authors:  Bahareh Torkzaban; Amir Hossein Kayvanjoo; Arman Ardalan; Soraya Mousavi; Roberto Mariotti; Luciana Baldoni; Esmaeil Ebrahimie; Mansour Ebrahimi; Mehdi Hosseini-Mazinani
Journal:  PLoS One       Date:  2015-11-24       Impact factor: 3.240

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.