Literature DB >> 25030906

Inferring population structure and relationship using minimal independent evolutionary markers in Y-chromosome: a hybrid approach of recursive feature selection for hierarchical clustering.

Amit Kumar Srivastava1, Rupali Chopra1, Shafat Ali1, Shweta Aggarwal1, Lovekesh Vig2, Rameshwar Nath Koul Bamezai3.   

Abstract

Inundation of evolutionary markers expedited in Human Genome Project and 1000 Genome Consortium has necessitated pruning of redundant and dependent variables. Various computational tools based on machine-learning and data-mining methods like feature selection/extraction have been proposed to escape the curse of dimensionality in large datasets. Incidentally, evolutionary studies, primarily based on sequentially evolved variations have remained un-facilitated by such advances till date. Here, we present a novel approach of recursive feature selection for hierarchical clustering of Y-chromosomal SNPs/haplogroups to select a minimal set of independent markers, sufficient to infer population structure as precisely as deduced by a larger number of evolutionary markers. To validate the applicability of our approach, we optimally designed MALDI-TOF mass spectrometry-based multiplex to accommodate independent Y-chromosomal markers in a single multiplex and genotyped two geographically distinct Indian populations. An analysis of 105 world-wide populations reflected that 15 independent variations/markers were optimal in defining population structure parameters, such as FST, molecular variance and correlation-based relationship. A subsequent addition of randomly selected markers had a negligible effect (close to zero, i.e. 1 × 10(-3)) on these parameters. The study proves efficient in tracing complex population structures and deriving relationships among world-wide populations in a cost-effective and expedient manner.
© The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Substances:

Year:  2014        PMID: 25030906      PMCID: PMC4150763          DOI: 10.1093/nar/gku585

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  44 in total

Review 1.  Single-nucleotide polymorphism analysis by MALDI-TOF mass spectrometry.

Authors:  T J Griffin; L M Smith
Journal:  Trends Biotechnol       Date:  2000-02       Impact factor: 19.536

Review 2.  The human Y chromosome: an evolutionary marker comes of age.

Authors:  Mark A Jobling; Chris Tyler-Smith
Journal:  Nat Rev Genet       Date:  2003-08       Impact factor: 53.242

3.  Machine learning and data mining: strategies for hypothesis generation.

Authors:  M A Oquendo; E Baca-Garcia; A Artés-Rodríguez; F Perez-Cruz; H C Galfalvy; H Blasco-Fontecilla; D Madigan; N Duan
Journal:  Mol Psychiatry       Date:  2012-01-10       Impact factor: 15.992

4.  Development of multiplex PCRs for evolutionary and forensic applications of 37 human Y chromosome SNPs.

Authors:  Valerio Onofri; Federica Alessandrini; Chiara Turchi; Mauro Pesaresi; Loredana Buscemi; Adriano Tagliabracci
Journal:  Forensic Sci Int       Date:  2006-02-10       Impact factor: 2.395

Review 5.  A review of feature selection techniques in bioinformatics.

Authors:  Yvan Saeys; Iñaki Inza; Pedro Larrañaga
Journal:  Bioinformatics       Date:  2007-08-24       Impact factor: 6.937

6.  A new model of multi-marker correlation for genome-wide tag SNP selection.

Authors:  Wei-Bung Wang; Tao Jiang
Journal:  Genome Inform       Date:  2008

7.  High level multiplex genotyping by MALDI-TOF mass spectrometry.

Authors:  P Ross; L Hall; I Smirnov; L Haff
Journal:  Nat Biotechnol       Date:  1998-12       Impact factor: 54.908

8.  Simultaneous determination of seven informative Y chromosome SNPs to differentiate East Asian, European, and African populations.

Authors:  Tomonori Muro; Reiko Iida; Junko Fujihara; Toshihiro Yasuda; Yukina Watanabe; Shinji Imamura; Hiroaki Nakamura; Kaori Kimura-Kataoka; Isao Yuasa; Tomoko Toga; Haruo Takeshita
Journal:  Leg Med (Tokyo)       Date:  2011-05       Impact factor: 1.376

Review 9.  Use of matrix-assisted laser desorption/ionization time-of-flight mass spectrometry for multiplex genotyping.

Authors:  Klaus Meyer; Per Magne Ueland
Journal:  Adv Clin Chem       Date:  2011       Impact factor: 5.394

10.  The Indian origin of paternal haplogroup R1a1* substantiates the autochthonous origin of Brahmins and the caste system.

Authors:  Swarkar Sharma; Ekta Rai; Prithviraj Sharma; Mamata Jena; Shweta Singh; Katayoon Darvishi; Audesh K Bhat; A J S Bhanwer; Pramod Kumar Tiwari; Rameshwar N K Bamezai
Journal:  J Hum Genet       Date:  2009-01-09       Impact factor: 3.172

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.