Literature DB >> 35771611

mapDATAge: a ShinyR package to chart ancient DNA data through space and time.

Xuexue Liu1, Ludovic Orlando1.   

Abstract

SUMMARY: Ancient DNA datasets are increasingly difficult to visualise for users lacking computational experience. Here, we describe mapDATAge, which aims to provide user-friendly automated modules for the interactive mapping of allele, haplogroup and/or ancestry distributions through space and time. mapDATAge enhances collaborative data sharing while assists the assessment and reporting of spatio-temporal patterns of genetic changes. AVAILABILITY: mapDATAge is a Shiny R application designed for exploring spatiotemporal patterns in ancient DNA data through a graphical user interface (GUI). It is freely available under GNU Public License in Github: https://github.com/xuefenfei712/mapDATAge. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2022. Published by Oxford University Press.

Entities:  

Year:  2022        PMID: 35771611      PMCID: PMC9364369          DOI: 10.1093/bioinformatics/btac425

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.931


1 Introduction

Ancient DNA research focuses on the genetic characterization of archaeological assemblages and sediments within the last 1.5 million-year timescale (van der Valk ). With the ever-growing capacity of high throughput sequencing instruments and improved DNA manipulation techniques, it has become increasingly possible to chart patterns of genetic variation through space and time at the scale of uniparental markers, individual Single Nucleotide Polymorphism (SNP) or even the whole genome (Orlando ). The temporal stratification of allelic frequencies at individual loci has also provided improved resolution into the detection of selection signatures (Schraiber ). Furthermore, spatiotemporal changes in individual ancestry profiles have helped reconstruct the atlas of past population migrations on the planet, mostly in humans (Nielsen ), but increasingly across a range of other species, mainly domestic plants (Kistler ) and animals (Frantz ). While ancient DNA analysis typically involves the exploration of patterns of genetic variation through space and time, there are currently no user-friendly tools facilitating the underlying visualization steps. mapDATAge provides the first interactive platform to map spatiotemporal patterns in ancient genetic data. It helps users generate hypotheses by identifying regions and/or time periods characterized by important changes in their genetic composition. It also improves the collaborative experience by allowing all stakeholders and project partners to directly interact with the data.

2 Implementation

mapDATAge is designed to visualize and explore the presence of geographic and temporal patterns in ancient DNA data. It takes simple tabulated text files as input, providing samples as rows and those data types to be visualized as columns, including age, GPS coordinates, presence/absence of alleles, ancestry components, Principal Component Analysis (PCA) coordinates and more. It displays different modules to interactively: (i) map the spatiotemporal distribution of a given set of samples or alleles (AMAP); (ii) draw temporal trajectories of allele frequencies, estimating mean and confidence intervals assuming binomial sampling for genotype data or iteratively random sampling one read per sample if read counts are provided (TRAJECTORY, Fig. 1A and B); (iii) draw maps of individual ancestry profiles in two user-defined time slices (ANCESTRY, Fig. 1C and D); (iv) PCA and/or Mutidimensional Scaling (MDS) projections (PCA); (v) draw the spatial distribution of alleles at one or multiple loci (MULTISNPS) and (vi) map (sub) haplogroup distributions (HAPLO).
Fig. 1.

Two examples of data visualization with mapDATAge. (A) Configuration options; (B) Temporal trajectory for the T allele at the rs4988235 locus in Europeans (data from the Allen Ancient DNA Resources, https://reich.hms.harvard.edu/allen-ancient-dna-resource-aadr-downloadable-genotypes-present-day-and-ancient-dna-data). Colored dots represent the number of individuals considered in each time bin. (C) and (D) Horse ancestry profiles prior to and following 4200 years ago (data from Librado ). Six genetic ancestry components were considered

Two examples of data visualization with mapDATAge. (A) Configuration options; (B) Temporal trajectory for the T allele at the rs4988235 locus in Europeans (data from the Allen Ancient DNA Resources, https://reich.hms.harvard.edu/allen-ancient-dna-resource-aadr-downloadable-genotypes-present-day-and-ancient-dna-data). Colored dots represent the number of individuals considered in each time bin. (C) and (D) Horse ancestry profiles prior to and following 4200 years ago (data from Librado ). Six genetic ancestry components were considered Users can select the geographic and/or temporal range of interest, color palette options and the list of annotations displayed on each individual location. The final ONECLICK module allows users to automatically generate figures in html format, applying a preselected range of temporal and spatial parameters. This can prove useful to contrast data from different loci and/or species.

3 Application

To demonstrate the versatility of mapDATAge, we prepared three example files providing geolocated and time-stamped ancient DNA data. The first tabulates the frequency of T allele at rs4988235, responsible for lactose tolerance, in 2120 ancient and modern Europeans, together with sex and mitochondrial haplotype information. The second provides the ancestry profiles and PCA components of 271 horses from Librado . The last dataset tabulates allele counts for 427 previously published ancient horses (Fages ; Librado , 2017, 2021), at nine loci, causative for locomotory, stature and coat-coloration phenotypes. Figure 1A shows menus allowing users to select specific visualization parameters, such as the time and geographic range, etc. Figure 1B illustrates the AMAP panel, which shows the previously reported rise of the rs4988235 T frequency within the last ∼3000 years in Europe (Segurel ). Figure 1C and D were generated using the ANCESTRY module to illustrate the massive change in the horse genomic makeup that followed the expansion of the DOM2 bloodline approximately ∼4200 years ago (Librado ). Installation instructions, guidance for formatting input files and further illustrations of additional features are provided as Supplementary Information.

4 Conclusion

mapDATAge facilitates the interactive visualization of ancient DNA data through space and time. It provides a user-friendly platform for the discovery of spatiotemporal shifts in the genetic composition of populations of interest, which can serve as the basis for generating new hypotheses. It also enhances the collaborative experience by allowing all stakeholders, including those lacking genetic and/or bioinformatic expertise, to actively explore data content. Click here for additional data file.
  10 in total

1.  Bayesian Inference of Natural Selection from Allele Frequency Time Series.

Authors:  Joshua G Schraiber; Steven N Evans; Montgomery Slatkin
Journal:  Genetics       Date:  2016-03-23       Impact factor: 4.562

Review 2.  Ancient Plant Genomics in Archaeology, Herbaria, and the Environment.

Authors:  Logan Kistler; Vanessa C Bieker; Michael D Martin; Mikkel Winther Pedersen; Jazmín Ramos Madrigal; Nathan Wales
Journal:  Annu Rev Plant Biol       Date:  2020-03-02       Impact factor: 26.379

Review 3.  Tracing the peopling of the world through genomics.

Authors:  Rasmus Nielsen; Joshua M Akey; Mattias Jakobsson; Jonathan K Pritchard; Sarah Tishkoff; Eske Willerslev
Journal:  Nature       Date:  2017-01-18       Impact factor: 49.962

4.  Ancient genomic changes associated with domestication of the horse.

Authors:  Pablo Librado; Cristina Gamba; Charleen Gaunitz; Clio Der Sarkissian; Mélanie Pruvost; Anders Albrechtsen; Antoine Fages; Naveed Khan; Mikkel Schubert; Vidhya Jagannathan; Aitor Serres-Armero; Lukas F K Kuderna; Inna S Povolotskaya; Andaine Seguin-Orlando; Sébastien Lepetz; Markus Neuditschko; Catherine Thèves; Saleh Alquraishi; Ahmed H Alfarhan; Khaled Al-Rasheid; Stefan Rieder; Zainolla Samashev; Henri-Paul Francfort; Norbert Benecke; Michael Hofreiter; Arne Ludwig; Christine Keyser; Tomas Marques-Bonet; Bertrand Ludes; Eric Crubézy; Tosso Leeb; Eske Willerslev; Ludovic Orlando
Journal:  Science       Date:  2017-04-28       Impact factor: 47.728

5.  Tracking the origins of Yakutian horses and the genetic basis for their fast adaptation to subarctic environments.

Authors:  Pablo Librado; Clio Der Sarkissian; Luca Ermini; Mikkel Schubert; Hákon Jónsson; Anders Albrechtsen; Matteo Fumagalli; Melinda A Yang; Cristina Gamba; Andaine Seguin-Orlando; Cecilie D Mortensen; Bent Petersen; Cindi A Hoover; Belen Lorente-Galdos; Artem Nedoluzhko; Eugenia Boulygina; Svetlana Tsygankova; Markus Neuditschko; Vidhya Jagannathan; Catherine Thèves; Ahmed H Alfarhan; Saleh A Alquraishi; Khaled A S Al-Rasheid; Thomas Sicheritz-Ponten; Ruslan Popov; Semyon Grigoriev; Anatoly N Alekseev; Edward M Rubin; Molly McCue; Stefan Rieder; Tosso Leeb; Alexei Tikhonov; Eric Crubézy; Montgomery Slatkin; Tomas Marques-Bonet; Rasmus Nielsen; Eske Willerslev; Juha Kantanen; Egor Prokhortchouk; Ludovic Orlando
Journal:  Proc Natl Acad Sci U S A       Date:  2015-11-23       Impact factor: 11.205

6.  Million-year-old DNA sheds light on the genomic history of mammoths.

Authors:  Tom van der Valk; Patrícia Pečnerová; David Díez-Del-Molino; Anders Bergström; Jonas Oppenheimer; Stefanie Hartmann; Georgios Xenikoudakis; Jessica A Thomas; Marianne Dehasque; Ekin Sağlıcan; Fatma Rabia Fidan; Ian Barnes; Shanlin Liu; Mehmet Somel; Peter D Heintzman; Pavel Nikolskiy; Beth Shapiro; Pontus Skoglund; Michael Hofreiter; Adrian M Lister; Anders Götherström; Love Dalén
Journal:  Nature       Date:  2021-02-17       Impact factor: 49.962

7.  The origins and spread of domestic horses from the Western Eurasian steppes.

Authors:  Pablo Librado; Naveed Khan; Antoine Fages; Mariya A Kusliy; Tomasz Suchan; Laure Tonasso-Calvière; Stéphanie Schiavinato; Duha Alioglu; Aurore Fromentier; Aude Perdereau; Jean-Marc Aury; Charleen Gaunitz; Lorelei Chauvey; Andaine Seguin-Orlando; Clio Der Sarkissian; John Southon; Beth Shapiro; Alexey A Tishkin; Alexey A Kovalev; Saleh Alquraishi; Ahmed H Alfarhan; Khaled A S Al-Rasheid; Timo Seregély; Lutz Klassen; Rune Iversen; Olivier Bignon-Lau; Pierre Bodu; Monique Olive; Jean-Christophe Castel; Myriam Boudadi-Maligne; Nadir Alvarez; Mietje Germonpré; Magdalena Moskal-Del Hoyo; Jarosław Wilczyński; Sylwia Pospuła; Anna Lasota-Kuś; Krzysztof Tunia; Marek Nowak; Eve Rannamäe; Urmas Saarma; Gennady Boeskorov; Lembi Lōugas; René Kyselý; Lubomír Peške; Adrian Bălășescu; Valentin Dumitrașcu; Roxana Dobrescu; Daniel Gerber; Viktória Kiss; Anna Szécsényi-Nagy; Balázs G Mende; Zsolt Gallina; Krisztina Somogyi; Gabriella Kulcsár; Erika Gál; Robin Bendrey; Morten E Allentoft; Ghenadie Sirbu; Valentin Dergachev; Henry Shephard; Noémie Tomadini; Sandrine Grouard; Aleksei Kasparov; Alexander E Basilyan; Mikhail A Anisimov; Pavel A Nikolskiy; Elena Y Pavlova; Vladimir Pitulko; Gottfried Brem; Barbara Wallner; Christoph Schwall; Marcel Keller; Keiko Kitagawa; Alexander N Bessudnov; Alexander Bessudnov; William Taylor; Jérome Magail; Jamiyan-Ombo Gantulga; Jamsranjav Bayarsaikhan; Diimaajav Erdenebaatar; Kubatbeek Tabaldiev; Enkhbayar Mijiddorj; Bazartseren Boldgiv; Turbat Tsagaan; Mélanie Pruvost; Sandra Olsen; Cheryl A Makarewicz; Silvia Valenzuela Lamas; Silvia Albizuri Canadell; Ariadna Nieto Espinet; Ma Pilar Iborra; Jaime Lira Garrido; Esther Rodríguez González; Sebastián Celestino; Carmen Olària; Juan Luis Arsuaga; Nadiia Kotova; Alexander Pryor; Pam Crabtree; Rinat Zhumatayev; Abdesh Toleubaev; Nina L Morgunova; Tatiana Kuznetsova; David Lordkipanize; Matilde Marzullo; Ornella Prato; Giovanna Bagnasco Gianni; Umberto Tecchiati; Benoit Clavel; Sébastien Lepetz; Hossein Davoudi; Marjan Mashkour; Natalia Ya Berezina; Philipp W Stockhammer; Johannes Krause; Wolfgang Haak; Arturo Morales-Muñiz; Norbert Benecke; Michael Hofreiter; Arne Ludwig; Alexander S Graphodatsky; Joris Peters; Kirill Yu Kiryushin; Tumur-Ochir Iderkhangai; Nikolay A Bokovenko; Sergey K Vasiliev; Nikolai N Seregin; Konstantin V Chugunov; Natalya A Plasteeva; Gennady F Baryshnikov; Ekaterina Petrova; Mikhail Sablin; Elina Ananyevskaya; Andrey Logvin; Irina Shevnina; Victor Logvin; Saule Kalieva; Valeriy Loman; Igor Kukushkin; Ilya Merz; Victor Merz; Sergazy Sakenov; Victor Varfolomeyev; Emma Usmanova; Viktor Zaibert; Benjamin Arbuckle; Andrey B Belinskiy; Alexej Kalmykov; Sabine Reinhold; Svend Hansen; Aleksandr I Yudin; Alekandr A Vybornov; Andrey Epimakhov; Natalia S Berezina; Natalia Roslyakova; Pavel A Kosintsev; Pavel F Kuznetsov; David Anthony; Guus J Kroonen; Kristian Kristiansen; Patrick Wincker; Alan Outram; Ludovic Orlando
Journal:  Nature       Date:  2021-10-20       Impact factor: 49.962

Review 8.  Animal domestication in the era of ancient genomics.

Authors:  Laurent A F Frantz; Daniel G Bradley; Greger Larson; Ludovic Orlando
Journal:  Nat Rev Genet       Date:  2020-04-07       Impact factor: 53.242

9.  Tracking Five Millennia of Horse Management with Extensive Ancient Genome Time Series.

Authors:  Antoine Fages; Kristian Hanghøj; Naveed Khan; Charleen Gaunitz; Andaine Seguin-Orlando; Michela Leonardi; Christian McCrory Constantz; Cristina Gamba; Khaled A S Al-Rasheid; Silvia Albizuri; Ahmed H Alfarhan; Morten Allentoft; Saleh Alquraishi; David Anthony; Nurbol Baimukhanov; James H Barrett; Jamsranjav Bayarsaikhan; Norbert Benecke; Eloísa Bernáldez-Sánchez; Luis Berrocal-Rangel; Fereidoun Biglari; Sanne Boessenkool; Bazartseren Boldgiv; Gottfried Brem; Dorcas Brown; Joachim Burger; Eric Crubézy; Linas Daugnora; Hossein Davoudi; Peter de Barros Damgaard; María de Los Ángeles de Chorro Y de Villa-Ceballos; Sabine Deschler-Erb; Cleia Detry; Nadine Dill; Maria do Mar Oom; Anna Dohr; Sturla Ellingvåg; Diimaajav Erdenebaatar; Homa Fathi; Sabine Felkel; Carlos Fernández-Rodríguez; Esteban García-Viñas; Mietje Germonpré; José D Granado; Jón H Hallsson; Helmut Hemmer; Michael Hofreiter; Aleksei Kasparov; Mutalib Khasanov; Roya Khazaeli; Pavel Kosintsev; Kristian Kristiansen; Tabaldiev Kubatbek; Lukas Kuderna; Pavel Kuznetsov; Haeedeh Laleh; Jennifer A Leonard; Johanna Lhuillier; Corina Liesau von Lettow-Vorbeck; Andrey Logvin; Lembi Lõugas; Arne Ludwig; Cristina Luis; Ana Margarida Arruda; Tomas Marques-Bonet; Raquel Matoso Silva; Victor Merz; Enkhbayar Mijiddorj; Bryan K Miller; Oleg Monchalov; Fatemeh A Mohaseb; Arturo Morales; Ariadna Nieto-Espinet; Heidi Nistelberger; Vedat Onar; Albína H Pálsdóttir; Vladimir Pitulko; Konstantin Pitskhelauri; Mélanie Pruvost; Petra Rajic Sikanjic; Anita Rapan Papeša; Natalia Roslyakova; Alireza Sardari; Eberhard Sauer; Renate Schafberg; Amelie Scheu; Jörg Schibler; Angela Schlumbaum; Nathalie Serrand; Aitor Serres-Armero; Beth Shapiro; Shiva Sheikhi Seno; Irina Shevnina; Sonia Shidrang; John Southon; Bastiaan Star; Naomi Sykes; Kamal Taheri; William Taylor; Wolf-Rüdiger Teegen; Tajana Trbojević Vukičević; Simon Trixl; Dashzeveg Tumen; Sainbileg Undrakhbold; Emma Usmanova; Ali Vahdati; Silvia Valenzuela-Lamas; Catarina Viegas; Barbara Wallner; Jaco Weinstock; Victor Zaibert; Benoit Clavel; Sébastien Lepetz; Marjan Mashkour; Agnar Helgason; Kári Stefánsson; Eric Barrey; Eske Willerslev; Alan K Outram; Pablo Librado; Ludovic Orlando
Journal:  Cell       Date:  2019-05-02       Impact factor: 66.850

10.  Why and when was lactase persistence selected for? Insights from Central Asian herders and ancient DNA.

Authors:  Laure Segurel; Perle Guarino-Vignon; Nina Marchi; Sophie Lafosse; Romain Laurent; Céline Bon; Alexandre Fabre; Tatyana Hegay; Evelyne Heyer
Journal:  PLoS Biol       Date:  2020-06-08       Impact factor: 8.029

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.