| Literature DB >> 29496957 |
Joanna Kaplanis1,2, Assaf Gordon1,2, Tal Shor3,4, Omer Weissbrod5, Dan Geiger4, Mary Wahl1,2,6, Michael Gershovits2, Barak Markus2, Mona Sheikh2, Melissa Gymrek1,2,7,8,9, Gaurav Bhatia10,11, Daniel G MacArthur7,9,10, Alkes L Price10,11,12, Yaniv Erlich13,2,3,14,15.
Abstract
Family trees have vast applications in fields as diverse as genetics, anthropology, and economics. However, the collection of extended family trees is tedious and usually relies on resources with limited geographical scope and complex data usage restrictions. We collected 86 million profiles from publicly available online data shared by genealogy enthusiasts. After extensive cleaning and validation, we obtained population-scale family trees, including a single pedigree of 13 million individuals. We leveraged the data to partition the genetic architecture of human longevity and to provide insights into the geographical dispersion of families. We also report a simple digital procedure to overlay other data sets with our resource.Entities:
Mesh:
Year: 2018 PMID: 29496957 PMCID: PMC6593158 DOI: 10.1126/science.aam9309
Source DB: PubMed Journal: Science ISSN: 0036-8075 Impact factor: 47.728