Literature DB >> 25781205

Combined use of principal component analysis and random forests identify population-informative single nucleotide polymorphisms: application in cattle breeds.

F Bertolini1, G Galimberti2, D G Calò2, G Schiavo1, D Matassino3, L Fontanesi1.   

Abstract

The genetic identification of the population of origin of individuals, including animals, has several practical applications in forensics, evolution, conservation genetics, breeding and authentication of animal products. Commercial high-density single nucleotide polymorphism (SNP) genotyping tools that have been recently developed in many species provide information from a large number of polymorphic sites that can be used to identify population-/breed-informative markers. In this study, starting from Illumina BovineSNP50 v1 BeadChip array genotyping data available from 3711 cattle of four breeds (2091 Italian Holstein, 738 Italian Brown, 475 Italian Simmental and 407 Marchigiana), principal component analysis (PCA) and random forests (RFs) were combined to identify informative SNP panels useful for cattle breed identification. From a PCA preselected list of 580 SNPs, RFs were computed using ranking methods (Mean Decrease in the Gini Index and Mean Accuracy Decrease) to identify the most informative 48 and 96 SNPs for breed assignment. The out-of-bag (OOB) error rate for both ranking methods and SNP densities ranged from 0.0 to 0.1% in the reference population. Application of this approach in a test population (10% of individuals pre-extracted from the whole data set) achieved 100% of correct assignment with both classifiers. Linkage disequilibrium between selected SNPs was relevant (r(2) > 0.6) only in few pairs of markers indicating that most of the selected SNPs captured different fractions of variance. Several informative SNPs were in genes/QTL regions that affect or are associated with phenotypes or production traits that might differentiate the investigated breeds. The combination of PCA and RF to perform SNP selection and breed assignment can be easily implemented and is able to identify subsets of informative SNPs useful for population assignment starting from a large number of markers derived by high-throughput genotyping platforms.
© 2015 Blackwell Verlag GmbH.

Entities:  

Keywords:  Allocation; SNP panel; bovine genome; informative SNPs; population assignment

Mesh:

Year:  2015        PMID: 25781205     DOI: 10.1111/jbg.12155

Source DB:  PubMed          Journal:  J Anim Breed Genet        ISSN: 0931-2668            Impact factor:   2.380


  7 in total

1.  Signatures of selection reveal candidate genes involved in economic traits and cold acclimation in five Swedish cattle breeds.

Authors:  Seyed Mohammad Ghoreishifar; Susanne Eriksson; Anna M Johansson; Majid Khansefid; Sima Moghaddaszadeh-Ahrabi; Nahid Parna; Pourya Davoudi; Arash Javanmard
Journal:  Genet Sel Evol       Date:  2020-09-04       Impact factor: 4.297

2.  Identification of Target Chicken Populations by Machine Learning Models Using the Minimum Number of SNPs.

Authors:  Dongwon Seo; Sunghyun Cho; Prabuddha Manjula; Nuri Choi; Young-Kuk Kim; Yeong Jun Koh; Seung Hwan Lee; Hyung-Yong Kim; Jun Heon Lee
Journal:  Animals (Basel)       Date:  2021-01-19       Impact factor: 2.752

3.  Signatures of selection are present in the genome of two close autochthonous cattle breeds raised in the North of Italy and mainly distinguished for their coat colours.

Authors:  Francesca Bertolini; Giulia Moscatelli; Giuseppina Schiavo; Samuele Bovo; Anisa Ribani; Mohamad Ballan; Massimo Bonacini; Marco Prandi; Stefania Dall'Olio; Luca Fontanesi
Journal:  J Anim Breed Genet       Date:  2021-11-28       Impact factor: 3.271

4.  Identification of Ancestry Informative Markers in Mediterranean Trout Populations of Molise (Italy): A Multi-Methodological Approach with Machine Learning.

Authors:  Giovanna Salvatore; Valentino Palombo; Stefano Esposito; Nicolaia Iaffaldano; Mariasilvia D'Andrea
Journal:  Genes (Basel)       Date:  2022-07-28       Impact factor: 4.141

5.  Species-informative SNP markers for characterising freshwater prawns of genus Macrobrachium in Cameroon.

Authors:  Judith G Makombu; Evans K Cheruiyot; Francesca Stomeo; David N Thuo; Pius M Oben; Benedicta O Oben; Paul Zango; Eric Mialhe; Jules R Ngueguim; Fidalis D N Mujibi
Journal:  PLoS One       Date:  2022-10-03       Impact factor: 3.752

6.  The Bos taurus-Bos indicus balance in fertility and milk related genes.

Authors:  Parthan Kasarapu; Laercio R Porto-Neto; Marina R S Fortes; Sigrid A Lehnert; Mauricio A Mudadu; Luiz Coutinho; Luciana Regitano; Andrew George; Antonio Reverter
Journal:  PLoS One       Date:  2017-08-01       Impact factor: 3.240

7.  Identification of Ancestry Informative Marker (AIM) Panels to Assess Hybridisation between Feral and Domestic Sheep.

Authors:  Elisa Somenzi; Paolo Ajmone-Marsan; Mario Barbato
Journal:  Animals (Basel)       Date:  2020-03-30       Impact factor: 2.752

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.