Literature DB >> 36124792

SALAI-Net: species-agnostic local ancestry inference network.

Benet Oriol Sabat1,2, Daniel Mas Montserrat2, Xavier Giro-I-Nieto1, Alexander G Ioannidis2,3.   

Abstract

MOTIVATION: Local ancestry inference (LAI) is the high resolution prediction of ancestry labels along a DNA sequence. LAI is important in the study of human history and migrations, and it is beginning to play a role in precision medicine applications including ancestry-adjusted genome-wide association studies (GWASs) and polygenic risk scores (PRSs). Existing LAI models do not generalize well between species, chromosomes or even ancestry groups, requiring re-training for each different setting. Furthermore, such methods can lack interpretability, which is an important element in each of these applications.
RESULTS: We present SALAI-Net, a portable statistical LAI method that can be applied on any set of species and ancestries (species-agnostic), requiring only haplotype data and no other biological parameters. Inspired by identity by descent methods, SALAI-Net estimates population labels for each segment of DNA by performing a reference matching approach, which leads to an interpretable and fast technique. We benchmark our models on whole-genome data of humans and we test these models' ability to generalize to dog breeds when trained on human data. SALAI-Net outperforms previous methods in terms of balanced accuracy, while generalizing between different settings, species and datasets. Moreover, it is up to two orders of magnitude faster and uses considerably less RAM memory than competing methods.
AVAILABILITY AND IMPLEMENTATION: We provide an open source implementation and links to publicly available data at github.com/AI-sandbox/SALAI-Net. Data is publicly available as follows: https://www.internationalgenome.org (1000 Genomes), https://www.simonsfoundation.org/simons-genome-diversity-project (Simons Genome Diversity Project), https://www.sanger.ac.uk/resources/downloads/human/hapmap3.html (HapMap), ftp://ngs.sanger.ac.uk/production/hgdp/hgdp_wgs.20190516 (Human Genome Diversity Project) and https://www.ncbi.nlm.nih.gov/bioproject/PRJNA448733 (Canid genomes). SUPPLEMENTARY INFORMATION: Supplementary data are available from Bioinformatics online.
© The Author(s) 2022. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Year:  2022        PMID: 36124792      PMCID: PMC9486591          DOI: 10.1093/bioinformatics/btac464

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.931


  33 in total

1.  Screening Human Embryos for Polygenic Traits Has Limited Utility.

Authors:  Ehud Karavani; Or Zuk; Danny Zeevi; Nir Barzilai; Nikos C Stefanis; Alex Hatzimanolis; Nikolaos Smyrnis; Dimitrios Avramopoulos; Leonid Kruglyak; Gil Atzmon; Max Lam; Todd Lencz; Shai Carmi
Journal:  Cell       Date:  2019-11-21       Impact factor: 41.582

2.  Effect of genetic divergence in identifying ancestral origin using HAPAA.

Authors:  Andreas Sundquist; Eugene Fratkin; Chuong B Do; Serafim Batzoglou
Journal:  Genome Res       Date:  2008-03-18       Impact factor: 9.043

3.  Estimating local ancestry in admixed populations.

Authors:  Sriram Sankararaman; Srinath Sridhar; Gad Kimmel; Eran Halperin
Journal:  Am J Hum Genet       Date:  2008-02       Impact factor: 11.025

4.  PLINK: a tool set for whole-genome association and population-based linkage analyses.

Authors:  Shaun Purcell; Benjamin Neale; Kathe Todd-Brown; Lori Thomas; Manuel A R Ferreira; David Bender; Julian Maller; Pamela Sklar; Paul I W de Bakker; Mark J Daly; Pak C Sham
Journal:  Am J Hum Genet       Date:  2007-07-25       Impact factor: 11.025

5.  Relatedness mapping and tracts of relatedness for genome-wide data in the presence of linkage disequilibrium.

Authors:  Anders Albrechtsen; Thorfinn Sand Korneliussen; Ida Moltke; Thomas van Overseem Hansen; Finn Cilius Nielsen; Rasmus Nielsen
Journal:  Genet Epidemiol       Date:  2009-04       Impact factor: 2.135

6.  POPULATION GENETICS. Genomic evidence for the Pleistocene and recent population history of Native Americans.

Authors:  Maanasa Raghavan; Matthias Steinrücken; Kelley Harris; Stephan Schiffels; Simon Rasmussen; Michael DeGiorgio; Anders Albrechtsen; Cristina Valdiosera; María C Ávila-Arcos; Anna-Sapfo Malaspinas; Anders Eriksson; Ida Moltke; Mait Metspalu; Julian R Homburger; Jeff Wall; Omar E Cornejo; J Víctor Moreno-Mayar; Thorfinn S Korneliussen; Tracey Pierre; Morten Rasmussen; Paula F Campos; Peter de Barros Damgaard; Morten E Allentoft; John Lindo; Ene Metspalu; Ricardo Rodríguez-Varela; Josefina Mansilla; Celeste Henrickson; Andaine Seguin-Orlando; Helena Malmström; Thomas Stafford; Suyash S Shringarpure; Andrés Moreno-Estrada; Monika Karmin; Kristiina Tambets; Anders Bergström; Yali Xue; Vera Warmuth; Andrew D Friend; Joy Singarayer; Paul Valdes; Francois Balloux; Ilán Leboreiro; Jose Luis Vera; Hector Rangel-Villalobos; Davide Pettener; Donata Luiselli; Loren G Davis; Evelyne Heyer; Christoph P E Zollikofer; Marcia S Ponce de León; Colin I Smith; Vaughan Grimes; Kelly-Anne Pike; Michael Deal; Benjamin T Fuller; Bernardo Arriaza; Vivien Standen; Maria F Luz; Francois Ricaut; Niede Guidon; Ludmila Osipova; Mikhail I Voevoda; Olga L Posukh; Oleg Balanovsky; Maria Lavryashina; Yuri Bogunov; Elza Khusnutdinova; Marina Gubina; Elena Balanovska; Sardana Fedorova; Sergey Litvinov; Boris Malyarchuk; Miroslava Derenko; M J Mosher; David Archer; Jerome Cybulski; Barbara Petzelt; Joycelynn Mitchell; Rosita Worl; Paul J Norman; Peter Parham; Brian M Kemp; Toomas Kivisild; Chris Tyler-Smith; Manjinder S Sandhu; Michael Crawford; Richard Villems; David Glenn Smith; Michael R Waters; Ted Goebel; John R Johnson; Ripan S Malhi; Mattias Jakobsson; David J Meltzer; Andrea Manica; Richard Durbin; Carlos D Bustamante; Yun S Song; Rasmus Nielsen; Eske Willerslev
Journal:  Science       Date:  2015-07-21       Impact factor: 47.728

7.  Ancestry, admixture and fitness in Colombian genomes.

Authors:  Lavanya Rishishwar; Andrew B Conley; Charles H Wigington; Lu Wang; Augusto Valderrama-Aguirre; I King Jordan
Journal:  Sci Rep       Date:  2015-07-21       Impact factor: 4.379

Review 8.  Genetic Ancestry Inference and Its Application for the Genetic Mapping of Human Diseases.

Authors:  Eva Suarez-Pajes; Ana Díaz-de Usera; Itahisa Marcelino-Rodríguez; Beatriz Guillen-Guio; Carlos Flores
Journal:  Int J Mol Sci       Date:  2021-06-28       Impact factor: 5.923

9.  Cross-species hybridization and the origin of North African date palms.

Authors:  Jonathan M Flowers; Khaled M Hazzouri; Muriel Gros-Balthazard; Ziyi Mo; Konstantina Koutroumpa; Andreas Perrakis; Sylvie Ferrand; Hussam S M Khierallah; Dorian Q Fuller; Frederique Aberlenc; Christini Fournaraki; Michael D Purugganan
Journal:  Proc Natl Acad Sci U S A       Date:  2019-01-14       Impact factor: 11.205

10.  Ancestry deconvolution and partial polygenic score can improve susceptibility predictions in recently admixed individuals.

Authors:  Davide Marnetto; Katri Pärna; Kristi Läll; Ludovica Molinaro; Francesco Montinaro; Toomas Haller; Mait Metspalu; Reedik Mägi; Krista Fischer; Luca Pagani
Journal:  Nat Commun       Date:  2020-04-02       Impact factor: 14.919

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.